Job Details

Back to Search

Job Information

Job Title :

Evaluation & Insights Engineer

Job Code :

AWM-6253-Evaluation & Insights Engineer

Job Announced :

12/4/2025

Job Closed :

12/9/2025

Pay Rate:

Negotiable

Duration:

Permanent

Other Information

Organization Name:

Apple

Organization Url:

www.apple.com

Address :

Seattle, WA, 98194, USA

City :

Seattle

State :

Washington

Country :

United States

Zip Code :

98194

Job Description

Weekly Hours: 40

Role Number: 200632687-3337

Summary

Imagine what you could do here. At Apple, great new ideas have a way of becoming extraordinary products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish!

Are you passionate about music, movies, and the world of Artificial Intelligence and Machine Learning? So are we! Join our Human-Centered AI team for Apple Products. In this role, you'll represent the user perspective on new features, review and analyze data, and evaluate AI models powering everything from search and recommendations to other innovative features. Collaborate with Data Scientists, Researchers, and Engineers to drive improvements across our platforms.

Description

We are looking for a Evaluation & Insights Engineer Human-Centered AI team to help evaluate and improve AI systems by combining data science, model behavior analysis, and qualitative insights. In this role, you will analyze AI outputs, develop evaluation frameworks, design qualitative, and translate findings into actionable improvements for product and engineering teams. This role blends deep technical expertise with strong analytical judgment to assess, interpret, and improve the behavior of advanced AI models. You will work cross-functionally with the Engineering and Project Managers, Product, and Research teams to ensure that AI experience is reliable, safe, and aligned with human expectations.

Minimum Qualifications

Bachelor’s or Master’s degree in Data Science, Computer Science, Linguistics, Cognitive Science, HCI, Psychology, or a related field.
Experience: 5+ years in data science, machine learning evaluation, ML ops, annotation quality, safety evaluation, or a similar applied role.
Technical Skills:
Proficiency in Python for data analysis (pandas, NumPy, Jupyter, etc.).
Experience working with large datasets, annotation tools, or model-evaluation pipelines.
Ability to design taxonomies, categorization schemes, or structured rating frameworks.
Analytical Strength: Ability to interpret unstructured data (text, transcripts, user sessions) and derive meaningful insights.
Communication: Strong ability to stitch together qualitative and quantitative findings into actionable guidance.

Preferred Qualifications

Experience working directly with LLMs, generative AI systems, or NLP models.
Familiarity with evaluations specific to AI safety, hallucination detection, or model alignment.
Experience designing annotation tasks or working with human labelers.
Understanding of mixed-method analysis (qualitative + quantitative).
Experience building internal tools, scripts, or dashboards for evaluation workflows.
Familiarity with prompt engineering, RAG systems, or model fine-tuning.
Experience evaluating LLMs, multimodal models, or other generative AI systems at scale.
Expertise in designing annotation guidelines and managing large annotation teams or vendors.
Background in human factors, social science, or qualitative assessment methodologies.

Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant (https://www.eeoc.gov/sites/default/files/2023-06/22-088_EEOC_KnowYourRights6.12ScreenRdr.pdf) .

Other Details

About Organization

Other Jobs

View other jobs from this employer

Apply Back