It Engineering - Nutanix
  • Cork, Munster, Ireland
  • via BeBee.com
-
Job Description

Job Description: We are seeking an experienced Operations Support Engineer with advanced expertise in Linux, virtualization technologies, enterprise storage, and Splunk queries. The role will support operational infrastructure and troubleshoot complex issues across various technical domains, ensuring high availability and performance of enterprise systems. Key responsibilities include developing and implementing advanced monitoring solutions, designing and deploying automation workflows, creating and refining escalation patterns, and maintaining and troubleshooting Linux-based systems. The ideal candidate will have 5+ years of experience in operations support, expertise in developing and deploying monitoring tools and automation frameworks, and proficiency in Splunk for querying and monitoring complex systems. Additionally, the candidate should have basic to intermediate experience with HW engineering design flows, ETX, grid computing, and GPU computing, as well as strong analytical and problem-solving skills and excellent communication skills.

    Key Responsibilities:
  • Develop and implement advanced monitoring solutions to track system health, detect anomalies, and proactively address issues before they impact operations.
  • Design and deploy automation workflows to streamline incident response, system recovery, and routine maintenance, reducing manual intervention.
  • Create and refine escalation patterns and processes, ensuring efficient resolution paths and minimizing operational disruption.
  • Maintain and troubleshoot Linux-based systems, ensuring optimal performance, security, and availability across the infrastructure.
  • Implement automation and scripting to enhance operational efficiency.
  • Manage and support virtualization platforms (e.g., VMware, KVM), ensuring resource allocation, scaling, and system stability.
  • Monitor virtual environments and resolve issues promptly to minimize downtime.
  • Develop and run advanced Splunk queries to monitor system performance, generate reports, and troubleshoot operational issues.
  • Use Splunk insights for proactive system improvements.
  • Diagnose and resolve issues related to ETX display technologies, grid computing environments, GPU computing setups, and licensing backend systems.
  • Provide support for infrastructure integration with hardware design flows.
  • Coordinate with Global COs and COOs to align operational strategies, ensure consistent system performance, and contribute to global initiatives.
  • Act as a point of escalation for critical infrastructure issues.

;