Back

AI Evaluation Engineer

AppleApple·Consumer Electronics

Apply effort

~12 min

Company site

Posted

124 days

01

About the role

Summary

Imagine what you could do here. At Apple, new ideas have a way of becoming outstanding products, services, and customer experiences very quickly. Bring passion and dedication to your job, and there's no telling what you could accomplish. Apple’s Sales organization generates the revenue needed to fuel our ongoing development of products and services. This, in turn, enriches the lives of hundreds of millions of people around the world. We are, in many ways, the face of Apple to our largest customers. Apple's US Decision Intelligence (DI) team is looking for a talented individual who is passionate about crafting, implementing, and operating AI solutions that have a direct and measurable impact on Apple Sales and its customers.

Description

We’re seeking a visionary AI Evaluations Engineer to own the end-to-end evaluation pipeline for our AI products. This role will focus on implementing and maintaining evaluation frameworks, instrumentation, and workflows that help us understand how well our AI systems perform, where they fail, and how they improve over time. This role will operate in both capacities, to augment existing AI roadmap, as well as innovate and trailblazing new frontier tech projects, crafting AI experiences that reduce time to insights and catalyze decision making.

Minimum Qualifications

4+ years of experience in data and AI-related fields such AI engineering, software development, ML engineering, data science, or QA roles. We’re looking for someone with an eagerness and ability to learn new skills and solve dynamic problems in an encouraging and expansive environment. Working across global teams to ensure alignment of product development. Strong Python skills. Applied knowledge of GenAI and RAG strategies, micro-services, recommendation systems, and context engineering. Familiarity of AI evaluation techniques, such as Golden datasets, LLM-as-judge, or rubric-based scoring. Experience with different LLM ecosystems (OpenAI, Anthropic, Gemini, etc.), RAG pipelines, vector databases (e.g., Pinecone, FAISS, Milvus, PostgreSQL). Proficiency in SQL and experience with at least one major data analytics platform, such as Hadoop, Spark, or Snowflake. Experience with CI/CD or release validation workflows. Familiarity with telemetry and evaluation frameworks for AI agents. Experience working with data science teams on insights generation leveraging LLMs. Knowledge of project management, and productivity tools such as Wrike and Miro. Strong time management skills with the ability to collaborate across multiple teams. Able to balance competing priorities, long-term projects, and ad hoc requirements. Ability to work in a fast-paced, dynamic, constantly evolving business environment. B.S. Degree in Computer Science/Engineering, or equivalent work experience

Preferred Qualifications

Hands-on experience with Langfuse or similar tools for LLMs observability. Sound communication skills - expert at messaging domain and technical content, at a level appropriate for the audience. Strong ability to gain trust with stakeholders and senior leadership. Familiarity with embeddings, retrieval algorithms, agents, and data modeling for vector development graphs.  Other complementary technologies for distributed systems architecture and asynchronous messaging, agent communication, and catching like RabbitMQ, Redis, and Valkey are preferred. Advanced Degree (MS or Ph.D.) in Economics, Electrical Engineering, Statistics, Data Science, or a similar quantitative field is preferred.

Skills & Tags

02

Aplyr's read

Apple is a tech giant known for its sleek design and innovation, attracting top talent in engineering, design, and business operations.

Synthesized from recent postings & public sources

What's promising

  • Apple consistently leads in tech innovation with a strong focus on design and user experience.
  • The company's global brand recognition offers employees a prestigious platform for career growth.
  • Apple's robust ecosystem integrates hardware, software, and services, creating diverse job opportunities.

What to watch

  • High-pressure work environment with demanding deadlines can impact work-life balance.
  • Apple's secretive culture may limit transparency and cross-departmental communication.
  • Dependence on hardware sales makes the company vulnerable to market saturation risks.

Why Apple

  • Apple's design philosophy emphasizes simplicity and elegance, setting it apart in the tech industry.
  • The company has a unique retail presence with its own stores enhancing customer experience.
  • Apple's closed ecosystem creates a seamless integration across its products, unmatched by competitors.

Aplyr’s read is generated by AI from public sources. Was it useful?

03

About Apple

AAPL$298.01+0.70%

Apple Inc. is a leading technology company known for its innovative consumer electronics, software, and services. The company designs and manufactures products such as the iPhone, iPad, Mac computers, and wearables, significantly influencing the tech industry and consumer behavior worldwide.

04

Similar roles