Staff Engineer Applications Site Reliability Engineering Job In Limerick

Staff Engineer, Applications, Site Reliability Engineering - General Motors
  • Limerick, Munster, Ireland
  • via BeBee.com
-
Job Description

Senior Technical Leadership Role:
Design and implement reliability patterns for client applications, communication protocols, and back-office services. Operate and influence data collection, processing, and delivery systems that are scalable, resilient, and capable of operating on millions of vehicles at a global scale. Leverage monitoring and observability tools to ensure system health and reliability, lead automation efforts to reduce toil, and drive postmortem processes to identify and remediating systemic issues. Participate in mentorship programs to grow the team.

Key Responsibilities:
Design and implement reliability patterns for client applications, communication protocols, and back-office services.
Operate and influence data collection, processing, and delivery systems that are scalable, resilient, and capable of operating on millions of vehicles at a global scale.
Leverage monitoring and observability tools (e.g. Prometheus, Grafana, Datadog) to ensure system health and reliability.
Lead automation efforts to reduce toil and keep our systems running efficiently.
Lead incident response efforts, ensure root cause analysis, and drive continuous improvement after incidents.
Drive postmortem processes, focusing on identifying and remediating systemic issues to prevent recurrence.
Participate in mentorship programs to help grow the team from your experiences.

Requirements:
8+ years of experience engineering reliability.
Experience building and operating enterprise cloud applications.
Experience with cloud platforms such as AWS, Azure, or GCP, and container orchestration technologies like Kubernetes.
Familiarity with security practices such as Dev Sec Ops, and ensuring secure and compliant infrastructure as part of reliability engineering.
Ability to lead technical decision-making, balancing reliability, performance, and cost.
Ability to make challenging decisions under pressure.
Ability to learn complex systems in order to identify and mitigate incidents.
Ability to work well cross-functionally and navigate through ambiguity.
Strong written and verbal communication skills to both technical and non-technical audiences.
BS/MS/Ph D in Computer Science/Engineering preferred.

;