Singapore, Singapore Software and Services
Our team is comprised of very talented individuals who are passionate about LLM and ensuring Apple services are at their best.
As part of the Human-Centered AI team, you'll play a central role in enhancing the user experience.Responsibilities include:- Collaborate with Engineering, Products, Research, Operations, and Editorial teams to evaluate algorithms and AI models powering various features, identifying opportunities for improvement.- Build data products (feature datasets, analyses, models, etc.) and scalable tools (typically in Python or Scala) to drive hypothesis generation and support collaborative decision-making with our partner teams in engineering and product management.- Create structured evaluations to assess the quality of AI-generated responses, ensuring they align with company standards and customer expectations.- Create evaluation task design and guidelines; identify a relevant data annotation platform to run evaluations at scale.- Implement metrics to measure the effectiveness and accuracy of models to ensure they meet performance standards.- Establish data quality thresholds and reporting on metrics & insights to inform feature business decisions.- Monitor LLM performance in production environments through human evaluations, identifying trends, and raising alerts when quality degradation occurs.- Perform detailed failure analysis to understand model weaknesses and identify areas for improvement, offering actionable insights to engineers- Maintain high standards for data quality and continuously enhance processes based on both quantitative and qualitative feedback