Site Reliability Engineer (SRE)

Apple Inc

Tokyo, Japan

Job posting number: #7150404 (Ref:apl-200479615)

Posted: May 15, 2023

Application Deadline: Open Until Filled

Job Description

Summary
The Site Reliability Engineer role resides within the Software Delivery organization, which is at the core of the Apple software release process. This role is responsible for creating and maintaining services, which are used by software engineers to compile source and assemble Apple software for delivery to customers around the world!

The Build Services team drives reliability and performance engineering, instruments observability of services, responds to incident alerts, and reports on SLI/SLO metrics for visibility across the organization. This SRE role is essential in maintaining cloud-based services that are responsible for delivering the state-of-the-art operating systems (macOS, iOS, watchOS, tvOS) and more to Apple customers!
Key Qualifications
  • Strong passion in building reliable, scalable, and performant cloud-based services
  • Provisioning and maintaining distributed systems using Kubernetes and containers
  • Strong understanding of distributed systems w.r.t. storage, networking, and security
  • SRE or Dev/Ops experience in managing customer-facing systems in 24/7 environment
  • Excellent judgment and integrity with ability to make timely and sound decisions
  • Ability to anticipate the needs of others and adapt to changing conditions
  • Excellent verbal and written communication skills
Description
As a Site Reliability Engineer, responsibilities include:
- Configuration management and integrating container platforms
- Automating continuous integration (CI) and continuous delivery (CD) pipelines
- Analyze requirements and migrate legacy workloads to containerized solutions
- Develop and deploy reliable, large-scale, services across multiple technology stacks
- Implement observability of system for monitoring, alerting, and metrics reporting
- Generate reports regarding service metrics on performance, availability, and reliability
- Provide feedback and insights of metrics and data analysis to engineering teams
- Champion practices regarding change control management and incident response

A successful Build Services Site Reliability Engineer will be expected to:
- Proactive communication of status to stakeholders and follow through on time-sensitive tasks
- Demonstrate willingness to ask for clarification and increase awareness of the larger context
- Explore multiple solutions to a problem, evaluate tradeoffs, then execute best approach
- Communicate asynchronously with a global team across multiple timezones
- Document new processes or update existing documentation pages
- Eager to learn about Apple software products and schedules
Education & Experience
BSCS, MSCS, or equivalent experience in Computer Science
Additional Requirements
  • Apple is an equal opportunity employer that is committed to inclusion and diversity. We also take affirmative action to offer employment and advancement opportunities to all applicants, including minorities, women, protected veterans, and individuals with disabilities. Apple will not discriminate or retaliate against applicants who inquire about, disclose, or discuss their compensation or that of other applicant.
  • We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.




Apply Now

Please mention to the employer that you saw this ad on Sciencejobs.org