Sr Site Reliability Engineer Remote Ireland Based 4 Day Week Swing Shift Job In Dublin

Sr Site Reliability Engineer, Remote (Ireland based), 4 day week (Swing Shift) - servicenow
  • Dublin, County Dublin, Ireland
  • via ClickaJobs (1)
-
Job Description

Job Description The Site Reliability Engineering team is a group of highly technical engineers who are tasked with maintaining and developing the reliability, scalability and performance of the ServiceNow platform and infrastructure. The SRE is empowered to drive technical resolutions across the technology stack from application through to hardware and all stops in between. The ultimate goal of the SRE is to never have to escalate an issue to an engineering or development team and to completely own the resolution of incidents. They are also tasked with driving forward the operability of the platform to drive down incident numbers and to reduce MTTR. To accomplish this the team combines Software Development, Networking and Systems Engineering expertise with a strong desire to be challenged by problems of scale and complexity and to make services better for our customers. What you get to do in this role: As an Engineer in the SRE team you will: Provide relief and sustainable resolution to issues within our infrastructure. Use your experience in software development, systems engineering, and networking to proactively prevent repeatable issues. Drive initiatives with partner teams to improve the reliability and performance of the infrastructure through improved system design. Drive a culture of intolerance to manual activity which results in a highly automated environment delivering scalable solutions. Drive monitoring and automation initiatives. Please note this is a Swing shift role with a Wed-Sat working week, and includes a shift allowance to compensate. Candidates for this role must be based in Ireland. Qualifications To be successful in this role you have: Deep knowledge of Linux systems. Experience working with relational database: MySQL, MariaDB or PostgresSQL. Experience working with systems at scale - supporting critical services with focus on automation, observability, availability, and performance. Experience with Kubernetes to orchestrate the deployment, scaling, and management of containers. Experience coding in various languages; preferably Python, JavaScript, and Ruby. Networking skills, IP addressing and routing. Team-first attitude and an uncompromising attention to detail. An eye for proactively anticipating potential issues, expertise in performing root cause analysis, and a mindset focused on building effective solutions to prevent recurrence. Good collaboration and communication skills. Good to have: Expertise in Observability and Monitoring of applications, services, and networks at scale. Experience with DevOps automation, CI/CD pipeline and agile methodologies such as Gitlab CI-CD. Experience writing test specifications and understanding the fundamentals of test automation. Experience working with Cloud technologies such as Azure and AWS. Experience in configuration management of infrastructure using Ansible or Puppet. Experience developing on the ServiceNow Platform. #J-18808-Ljbffr

;