Job Details
Job Information
Other Information
Job Description
Role Number: 200634806-3337
Summary
Are you passionate about Generative AI and excited to work on groundbreaking modeling technologies that will enrich the lives of billions? The Intelligence System Experience (ISE) team within Apple’s software organization is a multidisciplinary group operating at the intersection of Multimodal Foundation Models, Efficient and Scalable ML Infrastructure, and Personalized Intelligent Experiences. As a senior machine learning engineer on our team, you will design software systems and algorithms that enable performant, scalable training and inference for Apple’s AI-driven experiences across both on-device and server environments. This role also includes opportunities to open source your work. Join our team of highly skilled, impact-focused engineers!
Description
We’re seeking strong senior machine learning engineers to help build next-generation tools for large-scale deep learning. You’ll join a team focused on accelerating training and inference speed, improving scalability, and advancing Apple’s centralized ML platform. Candidates should bring polished coding skills and a passion for machine learning and computational science. We offer a respectful work environment, flexible responsibilities, and access to world-class experts and growth opportunities.
In this role, you will develop core components for our scalable ML platform, push the limits of existing training technologies, and create new techniques to overcome system constraints. Your work will be deployed on high-impact tasks across teams building Apple Intelligence products, with opportunities to open-source your contributions. We are especially looking for a PyTorch-focused ML efficiency expert to optimize training and inference performance, improve distributed training throughput, and drive system-level efficiency for large-scale models. If you have deep experience with PyTorch internals and high-performance ML infrastructure, we’d love to hear from you.
Minimum Qualifications
PhD or Master's degree in the area of Computer Science, or equivalent years of industry experience
3+ years working with AI/ML technologies in production or research settings
Strong Python programming skills
Understanding software design principles, and algorithms
Experience with deep learning frameworks, such as PyTorch
Experience building large-scale distributed systems
Familiarity with parallelization algorithms for large model training
Familiarity with recent developments in foundation model architectures
Preferred Qualifications
Experience developing model parallel and data parallel training solutions and other training optimizations
Experience with parallel training libraries such as torch.distributed, DeepSpeed, or FairScale
Experience with CUDA-level optimization
Experience building ML models targeting Apple Silicon
Experience building large-scale deep learning infrastructure or platforms for distributed model training
Publication record at Machine Learning conferences such as MLSys, NeurIPS, etc.
Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant (https://www.eeoc.gov/sites/default/files/2023-06/22-088_EEOC_KnowYourRights6.12ScreenRdr.pdf) .
Other Details

