Job Details
Job Information
Other Information
Job Description
Role Number: 200622800-0836
Summary
As part of the work on machine-generated dialog we are developing novel measurements of its quality. These include cutting-edge llm-judges for aspects like groundedness (lack of hallucinations), Siri Tone and Style (a suite of Design requirements), Safety, and others.
Description
To measure our progress on this front, we need to track the state of our dataset composition, accuracy of llm-judges, human expert review results in a central and visual representation. A DRI for Metrics and Reporting will:
Minimum Qualifications
M.S. or Ph.D. in Computer Science, Data Science, Data Engineering
3+ years in data-science and/or data-engineering (iceberg, pandas python, Tableau or equivalent, data collection and visualization)
2+ years of python coding
Good understanding of metrics, crowd science, annotation analysis, statistics
Ability to work independently and cross-functionally to integrate in partner team reporting systems and pipelines
Excellent communication skills and the ability to thrive in a highly collaborative work environment
Good engineering practices to create sustainable and easy to use metric reporting pipelines
Preferred Qualifications
Experience with writing and architecting production level code
Deep understanding of Machine Learning concepts
Experience in Model training and/or evaluation
Good engineering practices to create sustainable and easy to use metric reporting pipelines
Attunement to computational linguistics, language quality is a plus
Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant (https://www.eeoc.gov/sites/default/files/2023-06/22-088_EEOC_KnowYourRights6.12ScreenRdr.pdf) .
Other Details

