Job Details

Back to Search

Job Information

Job Title :

AIML - Machine Learning Engineer, Technical Lead, Evaluation

Job Code :

AWM-9342-AIML - Machine Learning Engineer, Technical Lead, Evaluation

Job Announced :

2/25/2026

Job Closed :

3/2/2026

Pay Rate:

Negotiable

Duration:

Permanent

Other Information

Organization Name:

Apple

Organization Url:

www.apple.com

Address :

Cupertino, CA, 95015, USA

City :

Cupertino

State :

California

Country :

United States

Zip Code :

95015

Job Description

Weekly Hours: 40

Role Number: 200646769-0836

Summary

Do you want to play a part in building a groundbreaking technology for large scale systems, generative AI experiences and shipping next generation of Apple products?
You will improve and enrich lives of Apple users across the globe.
Join the Evaluation team at Apple AIML.

Description

You will play a critical role in scaling up global evaluation of Apple AIML products, with the primary focus on next generation Siri and Apple Intelligence features.
You will drive and scale our evaluation work to enable high-velocity development and shipping of Generative AI features globally, in every country and language where Apple AIML features are available.
You will drive LLM-based evaluation as a product, delivering simulation-based evaluation of personalized user experiences, reflective of cultural and language diversity of our customers.

The focus of this technical lead role is strategy and execution of high quality evaluation datasets grounded in “personas”, acquired and synthetically generated, by language and region.
This role requires a combination of engineering experience working with GenAI and ML based products, an ability to drive scale, and a relentless drive for improving signal-to-noise ratio.
This role’s success will be driven by building deep cross-functional partnerships and by embracing and leading breakthrough technologies.

Minimum Qualifications

Extensive experience with data-driven evaluation of complex systems (ML/AI/agents) and the ability to translate findings into product improvements.
Strong analytical and statistical foundations with the ability to design metrics and interpret noisy or ambiguous signals.
Demonstrated ability to collaborate with cross-functional partners (engineering, product, research) to drive impact at scale.
Excellent communication skills, including articulating technical insights to both technical and non-technical audiences.
BS/MS in Computer Science, Machine Learning, Statistics, Applied Math, or a related quantitative field.

Preferred Qualifications

Experience with evaluation of large language models, tool-enabled agents, or multi-component AI systems.
Background in simulation-based evaluation, user behavior modeling, or experimental design.
Experience designing evaluation that scales across languages, cultures, or diverse user populations.
Familiarity with instrumentation, telemetry, and large-scale measurement systems.
Prior involvement in shipping customer-facing products where measurement informed strategic decisions.

Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant (https://www.eeoc.gov/sites/default/files/2023-06/22-088_EEOC_KnowYourRights6.12ScreenRdr.pdf) .

Other Details

About Organization

Other Jobs

View other jobs from this employer

Apply Back