Job Details

Job Information

AI Evaluation Engineer – Siri AI Agents
AWM-4049-AI Evaluation Engineer – Siri AI Agents
9/13/2025
9/18/2025
Negotiable
Permanent

Other Information

www.apple.com
Cupertino, CA, 95015, USA
Cupertino
California
United States
95015

Job Description

No Video Available
 

AI Evaluation Engineer – Siri AI Agents

Cupertino, California, United States

Machine Learning and AI

Summary

Posted: Sep 11, 2025

Weekly Hours: 40

Role Number: 200620244-0836

We are seeking talented engineers to join our team and push the boundaries of evaluations for Siri AI Agents. Evaluation lies at the heart of our model development strategy—it shapes architectural choices, guides launch decisions, and ultimately ensures a world-class user experience.
Our team is highly innovative and fast-moving, leveraging auto-evaluators and LLM-based judges to measure, validate, and continuously improve the core Siri AI engine. If you’re excited by the challenge of building trusted evaluation systems that directly impact the quality of a groundbreaking AI product used by millions worldwide, this role is for you.

Description

As an AI Evaluation Engineer, you will:
- Design, build, and maintain auto-evaluators that measure the quality of Siri’s core AI engine.
- Identify and triage issues and implement changes to improve auto-evaluator trustworthiness.
- Work with both simulators and real devices to ensure high-fidelity evaluation and a superior user experience.
- Collaborate with scientists and engineers across software and ML teams, contributing to products shipped across our portfolio of devices.

Minimum Qualifications

  • M.S. degree in Computer Science, Machine Learning, or a related technical field, or equivalent practical experience.

  • Strong skills in data analysis and statistical methods utilized in a problem solving environment

  • Passion for debugging, testing, and triaging issues in complex AI + software systems.

  • Proficiency in Python and experience developing production-quality code.

  • Understanding of large language models (LLMs) and awareness of their strengths and limitations.

Preferred Qualifications

  • Experience with large-scale ML model evaluation, testing pipelines, and triage.

  • Knowledge of data generation, training workflows, or context engineering.

  • Familiarity with real-world deployment challenges for AI/ML products

  • Knowledge of latest methodologies in LLM evaluations

Pay & Benefits

At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $147,400 and $272,100, and your base pay will depend on your skills, qualifications, experience, and location.

Apple employees also have the opportunity to become an Apple shareholder through participation in Apple’s discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple’s Employee Stock Purchase Plan. You’ll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses — including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation.Learn more about Apple Benefits. (https://www.apple.com/careers/us/benefits.html)

Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.

Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics.Learn more about your EEO rights as an applicant (https://www.eeoc.gov/sites/default/files/2023-06/22-088_EEOC_KnowYourRights6.12ScreenRdr.pdf) .

Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics.Learn more about your EEO rights as an applicant (https://www.eeoc.gov/sites/default/files/2023-06/22-088_EEOC_KnowYourRights6.12ScreenRdr.pdf) .

Apple will not discriminate or retaliate against applicants who inquire about, disclose, or discuss their compensation.

Apple participates in the E-Verify program in certain locations as required by law.Learn more about the E-Verify program (https://www.apple.com/jobs/pdf/EverifyPosterEnglish.pdf) .

Apple is committed to working with and providing reasonable accommodation to applicants with physical and mental disabilities. Reasonable Accommodation and Drug Free Workplace policy Learn more .

Apple is a drug-free workplace. Reasonable Accommodation and Drug Free Workplace policy Learn more .

Apple will consider for employment all qualified applicants with criminal histories in a manner consistent with applicable law. If you’re applying for a position in San Francisco, review the San Francisco Fair Chance Ordinance guidelines applicable in your area.

It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability.

Other Details

No Video Available
--

About Organization

 
About Organization