Back to Search






Mid-Level
Software Development Engineer (Level 5) - GENAI/ML, Amazon Selection and Catalog Systems (ASCS)
Confirmed live in the last 24 hours
Amazon.com Services LLC
Seattle, WA, USA
On-site
Posted January 8, 2026
Job Description
Join the Veritas team within Amazon's Selection and Catalog Systems (ASCS) organization as a Software Development Engineer focused on GENAI/ML initiatives. The Veritas team owns Amazon's premier LLM benchmarking and evaluation platform, which is critical for measuring and improving AI performance across the world's largest e-commerce product catalog.
In this role, you'll work directly with Large Language Models (LLMs) and build agents to enhance catalog data quality and customer experience at large scale. You'll have extensive opportunities to work with in-house LLM hosting and inference systems, Amazon Bedrock, prompt translation, prompt tuning techniques and agentic solutions. As part of the team that evaluates AI performance across billions of products and attributes, you'll help build and leverage AI agents at scale to assess LLM models, their applications, and the customer experiences they power.
The Veritas team provides a unique opportunity to combine advanced generative AI development with large-scale distributed systems engineering, while working on benchmarking and evaluation frameworks that teams across Amazon depend on for their AI development and deployment decisions.
Key job responsibilities
As a Software Development Engineer (SDE) in Veritas, you will develop systems and agents powered by LLMs and multi-modal LLMs to enhance benchmarking and evaluation across Amazon's catalog ecosystem. Build GenAI-driven solutions that improve evaluation quality and automation for both Bedrock models and open source LLMs for Starfish and other Store Agent systems. Design AI-driven workflows for various data to enhance LLM benchmarking and performance measurement. Work extensively with Amazon Bedrock, in-house LLM hosting, and inference systems to build scalable evaluation pipelines for models and applications. Partner with scientists and AI experts to integrate advanced developments in Generative AI, LLM evaluation, and prompt translation/optimization. Create comprehensive datasets, evaluation methodologies and standardized metrics to generalized benchmarking use cases and accelerate foundation model switch decision for various applications.
The ideal candidate brings experience in distributed systems, designing and implementing high-scale software services, and agile, continuous delivery practices. You are a Software Development Engineer who takes ownership of services, puts customers first, and is committed to delivering high-quality solutions.
About the team
The Veritas team is a specialized, innovation-focused group within Amazon Selection and Catalog Systems (ASCS) - Amazon's Catalog System Services (CSS) organization. We own Amazon's premier LLM evaluation and benchmarking platform (Veritas), used by teams across ASCS and the broader company to measure and improve model performance for catalog applications. We evaluate AI across billions of products and attributes, collaborating with science teams on catalog-specific AI research, prompt translation/optimization, and model evaluation. You'll work with the latest generative AI technology, including custom model hosting infrastructure and advanced prompt engineering tools. Growth opportunities include leading industry-defining benchmarking standards for e-commerce AI and taking on leadership roles across Amazon's catalog ecosystem.
We foster a collaborative environment where innovation thrives, technical excellence is celebrated, and every team member shapes the future of AI-powered catalog systems while maintaining work-life balance and continuous learning.
- 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience programming with at least one software programming language
- Bachelor's degree in computer science or equivalent
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.
In this role, you'll work directly with Large Language Models (LLMs) and build agents to enhance catalog data quality and customer experience at large scale. You'll have extensive opportunities to work with in-house LLM hosting and inference systems, Amazon Bedrock, prompt translation, prompt tuning techniques and agentic solutions. As part of the team that evaluates AI performance across billions of products and attributes, you'll help build and leverage AI agents at scale to assess LLM models, their applications, and the customer experiences they power.
The Veritas team provides a unique opportunity to combine advanced generative AI development with large-scale distributed systems engineering, while working on benchmarking and evaluation frameworks that teams across Amazon depend on for their AI development and deployment decisions.
Key job responsibilities
As a Software Development Engineer (SDE) in Veritas, you will develop systems and agents powered by LLMs and multi-modal LLMs to enhance benchmarking and evaluation across Amazon's catalog ecosystem. Build GenAI-driven solutions that improve evaluation quality and automation for both Bedrock models and open source LLMs for Starfish and other Store Agent systems. Design AI-driven workflows for various data to enhance LLM benchmarking and performance measurement. Work extensively with Amazon Bedrock, in-house LLM hosting, and inference systems to build scalable evaluation pipelines for models and applications. Partner with scientists and AI experts to integrate advanced developments in Generative AI, LLM evaluation, and prompt translation/optimization. Create comprehensive datasets, evaluation methodologies and standardized metrics to generalized benchmarking use cases and accelerate foundation model switch decision for various applications.
The ideal candidate brings experience in distributed systems, designing and implementing high-scale software services, and agile, continuous delivery practices. You are a Software Development Engineer who takes ownership of services, puts customers first, and is committed to delivering high-quality solutions.
About the team
The Veritas team is a specialized, innovation-focused group within Amazon Selection and Catalog Systems (ASCS) - Amazon's Catalog System Services (CSS) organization. We own Amazon's premier LLM evaluation and benchmarking platform (Veritas), used by teams across ASCS and the broader company to measure and improve model performance for catalog applications. We evaluate AI across billions of products and attributes, collaborating with science teams on catalog-specific AI research, prompt translation/optimization, and model evaluation. You'll work with the latest generative AI technology, including custom model hosting infrastructure and advanced prompt engineering tools. Growth opportunities include leading industry-defining benchmarking standards for e-commerce AI and taking on leadership roles across Amazon's catalog ecosystem.
We foster a collaborative environment where innovation thrives, technical excellence is celebrated, and every team member shapes the future of AI-powered catalog systems while maintaining work-life balance and continuous learning.
Basic Qualifications
- 3+ years of non-internship professional software development experience- 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience programming with at least one software programming language
Preferred Qualifications
- 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience- Bachelor's degree in computer science or equivalent
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.
aidataproductdesign
Similar Jobs
MongoDB
Software Engineer 3 - Query Optimization
Mid-LevelAtlanta; Boston; New...$209,000 USD
Roku
Software Engineer in Test (AI focus)
Mid-LevelCambridge, United Ki...
Roku
Senior Software Engineer, Firmware Advanced Development
SeniorCambridge, United Ki...
Roku
Senior Software Engineer - Cloud Infrastructure & Observability
SeniorBengaluru, India
Roku
Senior Software Engineer - Cloud Infrastructure & Observability
SeniorCambridge, United Ki...
Roku
Senior Software Engineer, Viewer Product
SeniorSan Jose, California$250,000 - $280,000 annually