Join Apple's Cloud Service Infrastructure team as a Site Reliability Engineer to support and scale cloud services for millions of Apple users.
The team is responsible for building and supporting critical infrastructural systems and frameworks that provide services like storage, caching, queueing, searching, and more at hyperscale.
The platform will support a variety of services based on open-source software, such as Kubernetes, Cassandra, Zookeeper, Kafka, Redis, and internally developed services.
As a member of this group, you will have a tremendous amount of individual responsibility and influence over the direction the core platform takes for years to come.
You will work with domain experts in fleet management, systems, and software engineering to build automations, instrument reliability tools, and respond to alerts and incidents.
The team's focus is on infrastructure capabilities and processes, improving the reliability and efficiency of the systems at scale.
Key responsibilities include:
Requirements:
Preferred qualifications include experience with large bare-metal infrastructure, release management, and development within the Kubernetes ecosystem.