Job Details
Job Information
Other Information
Job Description
Weekly Hours: 40
Role Number: 200641184-3337
Summary
The Apple Services Engineering team is one of the most exciting examples of Apple’s long-held passion for combining art and technology. We are the people who power the App Store, Apple TV, Apple Music, Apple Podcasts, and Apple Books. And we do it on a massive scale, meeting Apple’s high expectations with high performance, to deliver a huge variety of entertainment in over 35 languages to more than 150 countries. Our scientists and engineers build secure, end-to-end solutions powered by machine learning. Thanks to Apple’s unique integration of hardware, software, and services, designers, scientists and engineers here partner to get behind a single unified vision. That vision always includes a deep commitment to strengthening Apple’s privacy policy, one of Apple’s core values. Although services are a bigger part of Apple’s business than ever before, these teams remain small, flexible, and multi-functional, offering greater exposure to the array of opportunities here. Come join us to build large-scale personalized recommender systems for Apps & Games, Video, Fitness+, Podcast and Books Recommendations. See your work touch the lives of billions of Apple users worldwide.
Description
In this role, you will be responsible for operationalizing machine learning models—from building real-time and batch inference pipelines to optimizing system performance, reliability, and experimentation velocity. You’ll help bridge the gap between research and production by developing the infrastructure, tooling, and monitoring required to ship ML-driven features safely and efficiently.
If you are an engineer who enjoys scaling ML solutions, building production-grade services, and driving experimentation across billions of users, this is your opportunity to make a meaningful impact.
Minimum Qualifications
MS or PhD in Computer Science, Software Engineering, or related field.
2+ years of experience in production machine learning systems, especially for personalization or recommendations.
Experience with big data and stream processing frameworks like Spark, Flink, or Kafka.
Proficiency in object-oriented programming languages such as Java, Scala, or C++.
Experience building and maintaining large-scale distributed systems for ML workloads.
Deep understanding of ML model deployment pipelines, runtime optimization, and system integration.
Familiarity with A/B testing frameworks, experimental design, and online evaluation.
Strong focus on system reliability, latency, and observability in production environments.
Preferred Qualifications
Experience in batch and real-time inference serving, including autoscaling and traffic management.
Background in content recommendation systems, search ranking, or user engagement optimization.
Experience with CI/CD workflows for ML systems, including safe model rollouts and shadow testing.
Exposure to containerized deployments and orchestration (Kubernetes, Docker).
Experience building and deploying production-grade applications using LLMs, including expertise in prompt engineering, RAG pipelines, and framework orchestration.
Proven track record of developing autonomous agents capable of multi-step reasoning, external tool integration, and complex task decomposition to solve open-ended problems.
Prior experience working on consumer-scale media products (apps, games, books, music, or video).
Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant (https://www.eeoc.gov/sites/default/files/2023-06/22-088_EEOC_KnowYourRights6.12ScreenRdr.pdf) .
Other Details

