Job Details
Job Information
Other Information
Job Description

ML Framework (MetalLM) Engineer
Cupertino, California, United States
Software and Services
Summary
Posted: May 28, 2025
Role Number: 200572773
Apple’s ML Frameworks (MetalLM) team in GPU, Graphics and Machine Learning works on enabling Apple Intelligence through high-performance, distributed inference of GenAI applications (such as LLMs) on Private Cloud Compute. You will get to work on custom-built server hardware that brings the power and security of Apple silicon to the data center.
Team also works on GPU acceleration of ML Training frameworks such as PyTorch and JAX using Metal runtime and device backend. We are looking for engineers with systems background who are deeply passionate about building scalable, efficient, and production-grade solutions tailored for high-throughput GPU execution.
Description
Our team is seeking extraordinary machine learning and GPU programming engineers who are passionate about providing robust compute solutions for accelerating Machine learning libraries on Apple Silicon. Role has the opportunity to influence the design of compute and programming models in next generation GPU architectures.
Responsibilities:
Work on cutting-edge ML inference framework project and optimize code for efficient and scalable ML inference using distributed compute strategies such as data, tensor, pipeline and expert parallelism.
Develop kernel and compiler level optimizations and perform in-depth analysis to ensure the best possible performance across Server hardware families.
Apply advanced model optimization techniques including speculation, quantization, compression, and others to maximize throughput and minimize latency.
Collaborate closely with hardware, compiler, and systems teams to align software performance with hardware capabilities.
Analyze and improve performance metrics such as end-to-end latency, TTFT, TBOT, memory footprint, and compute efficiency.
Implement features of Metal device backend for ML training acceleration technologies
If this sounds of interest, we would love to hear from you!
Minimum Qualifications
3+ years of programming and problem-solving experience with C/C++/ObjC
Experience with GPU kernel development & optimizations using compute programming models such as Metal, CUDA etc.
Experience with system level programming and computer architecture
Experience with Distributed training or inference techniques
Preferred Qualifications
Experience with graph compilers such as Triton, OpenXLA or LLVM/MLIR is a plus
Contributions to an AI framework such as PyTorch, JAX or Tensorflow is a plus
Good understanding of machine learning fundamentals
Pay & Benefits
At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $143,100 and $264,200, and your base pay will depend on your skills, qualifications, experience, and location.
Apple employees also have the opportunity to become an Apple shareholder through participation in Apple’s discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple’s Employee Stock Purchase Plan. You’ll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses — including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation.Learn more about Apple Benefits. (https://www.apple.com/careers/us/benefits.html)
Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.
Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics.Learn more about your EEO rights as an applicant (https://www.eeoc.gov/sites/default/files/2023-06/22-088_EEOC_KnowYourRights6.12ScreenRdr.pdf) .
Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics.Learn more about your EEO rights as an applicant (https://www.eeoc.gov/sites/default/files/2023-06/22-088_EEOC_KnowYourRights6.12ScreenRdr.pdf) .
Apple will not discriminate or retaliate against applicants who inquire about, disclose, or discuss their compensation.
Apple participates in the E-Verify program in certain locations as required by law.Learn more about the E-Verify program (https://www.apple.com/jobs/pdf/EverifyPosterEnglish.pdf) .
Apple is committed to working with and providing reasonable accommodation to applicants with physical and mental disabilities. Reasonable Accommodation and Drug Free Workplace policy Learn more .
Apple is a drug-free workplace. Reasonable Accommodation and Drug Free Workplace policy Learn more .
Apple will consider for employment all qualified applicants with criminal histories in a manner consistent with applicable law. If you’re applying for a position in San Francisco, review the San Francisco Fair Chance Ordinance guidelines applicable in your area.
It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability.
Other Details
