Site Reliability Engineering Leader - Security
Apple Services Engineering is seeking a passionate and talented Site Reliability Engineering Leader to lead a new SRE team dedicated to security services.
The successful candidate will oversee critical security infrastructure services, improve their reliability, observability, and manageability, and establish a European SRE team to support these services.
This role requires a strong SRE leader with experience managing SRE teams, mission-critical production services, and infrastructure development engineers.
The ideal candidate will have a deep understanding of SRE principles, including monitoring, alerting, error budgets, fault analysis, and other common reliability engineering concepts.
They will also have experience working in a standard SDLC, understanding key Infrastructure Security concepts and principles, and proficiency in at least one of Python, Golang, Java, or Rust.
Key Responsibilities:
- Lead a new SRE team dedicated to security services
- Oversee critical security infrastructure services
- Improve reliability, observability, and manageability of security services
- Establish a European SRE team to support security services
- Collaborate with multi-functional teams to design, implement, and maintain security measures
Requirements:
- 8+ years of engineering management experience
- 5+ years of experience managing SRE teams and mission-critical production services
- Demonstrated success leading SRE teams and managing infrastructure development engineers
- Understanding of SRE principles and key Infrastructure Security concepts
- Proficiency in at least one of Python, Golang, Java, or Rust
Preferred Qualifications:
- Proven experience with large-scale, highly available, distributed, and fault-tolerant systems
- Excellent understanding of operating systems concepts, including multi-threading, memory management, networking, and storage
- Experience with Kubernetes, Docker, and containerization
- Deep knowledge of Linux security primitives, systems, packaging, container security, and SELinux