Senior Site Reliability Engineer, Object Storage
Job Description
Summary
The Apple Services Engineering (ASE) team is one of the most exciting examples of Apple’s long-held passion for combining art and technology. These are the people who power the App Store, Apple TV, Apple Music, Apple Podcasts, and Apple Books. And they do it on a massive scale, meeting Apple’s high expectations with high performance to deliver a huge variety of entertainment in over 35 languages to more than 150 countries.
These engineers build secure, end-to-end solutions. They develop the custom software used to process all the creative work, the tools that providers use to deliver that media, all the server-side systems, and the APIs for many Apple services.
Thanks to Apple’s unique integration of hardware, software, and services, engineers here partner to get behind a single unified vision. That vision always includes a deep commitment to strengthening Apple’s privacy policy, one of Apple’s core values. Although services are a bigger part of Apple’s business than ever before, these teams remain small, nimble, and cross-functional, offering greater exposure to the array of opportunities here.
These engineers build secure, end-to-end solutions. They develop the custom software used to process all the creative work, the tools that providers use to deliver that media, all the server-side systems, and the APIs for many Apple services.
Thanks to Apple’s unique integration of hardware, software, and services, engineers here partner to get behind a single unified vision. That vision always includes a deep commitment to strengthening Apple’s privacy policy, one of Apple’s core values. Although services are a bigger part of Apple’s business than ever before, these teams remain small, nimble, and cross-functional, offering greater exposure to the array of opportunities here.
Description
Support and maintain object store orchestration service measuring and monitoring availability, latency and overall system health.
- Develop, run and support SRE tools and applications.
- Engage in improving the whole lifecycle of services from inception through deployment, operations and refinement.
- Analyze logs and telemetry data by writing monitoring and automation code.
- Participate in on-call and release manager rotations.
- Provide technical expertise and troubleshooting during service level impacting events.
- Participate in code review, internal infrastructure improvements and process enhancements.
- Operate our application at scale, across multiple geographically dispersed public and private clouds, to support Apple’s mission critical internal efforts.
- Collaborate with dependent teams and customers through clear communications
- Develop, run and support SRE tools and applications.
- Engage in improving the whole lifecycle of services from inception through deployment, operations and refinement.
- Analyze logs and telemetry data by writing monitoring and automation code.
- Participate in on-call and release manager rotations.
- Provide technical expertise and troubleshooting during service level impacting events.
- Participate in code review, internal infrastructure improvements and process enhancements.
- Operate our application at scale, across multiple geographically dispersed public and private clouds, to support Apple’s mission critical internal efforts.
- Collaborate with dependent teams and customers through clear communications
Minimum Qualifications
- BS degree in computer science or equivalent field with 5+ years of experience
- At least 5 years in a Site Reliability Engineering, DevOps or infrastructure focused role
Preferred Qualifications
- Lower level understanding of the Linux Operating System, standard networking protocols, and components
- Experience with containers and orchestration via Kubernetes in public / private clouds
- Hands-on experience managing large numbers of diverse systems with configuration management, infrastructure provisioning tools or software delivery platforms (such as Terraform and Spinnaker)
- Excellent troubleshooting and problem solving skills