Job Description
SRE Observability Specialism Financial Services Dublin/WFH Hybrid
Market leader in Financial Services powered by Technology seeks Site Reliability Engineers with infrastructure experience to join their Dublin based team on a permanent basis.
The role is to design and deliver Observability and Monitoring processes to ensure ongoing performance, reliability and scalability of global platforms.
Responsibilities include:
- Quickly get up to speed on current Observability processes and audit to focus on areas of immediate improvement
- Work with global team to design updates and implement same
- Utilise experience on Dashboards, Reporting and Performance Analysis to adapt to automation processes, taking responsibility for developing various metrics
Requirements:
- 8+ years of commercial Reliability experience
- Log Retention processes knowledge
- Strength in Prometheus specifically with hands on knowledge of Go/Python Scripting
- In depth experience in Linux based OS, kernels etc
- Config ideally will include Terraform/Ansible
- Opinions on scaling, distributing systems
Preferred skills: Observability, Site Reliability Engineering, Python