Job Details
Job Information
Other Information
Job Description
Role Number: 200635034-3543
Summary
Imagine a world where every video stream starts instantly, every live event plays flawlessly, and the perfect movie finds you before you even search. As a Video SRE at Apple, you'll turn this vision into reality. You'll tackle unprecedented scale challenges, engineer systems that self-heal, and ensure our video infrastructure remains rock-solid when millions tune in simultaneously. If you thrive on solving complex distributed systems problems and want your work to directly impact how the world experiences entertainment, this is your opportunity.
Description
As a Video SRE at Apple, you will be responsible for the reliability, scalability, and performance of our video streaming infrastructure that serves content to millions of users globally. You will work closely with software engineers, video encoding specialists, and content delivery teams to design resilient systems that handle massive scale while maintaining exceptional quality of experience. Your day-to-day work will include developing automation to reduce operational toil, building sophisticated monitoring and observability solutions, leading incident response efforts, and driving post-incident reviews that result in meaningful reliability improvements. You will participate in on-call rotations and take ownership of critical infrastructure components, using data-driven approaches to identify and eliminate single points of failure. This role offers the opportunity to work with cutting-edge video technologies and influence architectural decisions that shape how Apple delivers streaming content worldwide.
Minimum Qualifications
Bachelor's degree in Computer Science, Engineering, or a related technical field, or equivalent practical experience
5+ years of experience in Site Reliability Engineering, DevOps, or Systems Engineering
Production ownership at scale including on-call/incident response, post incident reviews and driving operational improvements
Strong understanding of Linux fundamentals and networking principles, with experience operating and debugging production systems
Proficiency in at least one programming language (Shell, Python, Go, or similar) to reduce toil, build SRE tooling, and improve operability
Hands-on experience with cloud infrastructure and container orchestration
Excellent troubleshooting and root-cause analysis skills across the full technology stack
Preferred Qualifications
Strong understanding of distributed systems fundamentals, failure modes, and resilience patterns that prevent cascading outages
Track record of building and continuously improving observability (metrics/logs/traces), alert quality, and incident response processes for complex, high-traffic environments
Experience with performance optimization, capacity planning, and reliability engineering (load testing, bottleneck analysis, degradation strategies)
Proven ability to build and operate Infrastructure as Code and CI/CD pipelines, including safe deployment practices and change risk controls
Experience with video streaming technologies, codecs, protocols, and media delivery infrastructure
Strong communicator who can align and influence cross-functional partners to drive reliability outcomes
Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant (https://www.eeoc.gov/sites/default/files/2023-06/22-088_EEOC_KnowYourRights6.12ScreenRdr.pdf) .
Other Details

