Back to Search






Mid-Level
Sr. Software Dev Engineer, Stores Foundational AI -SFAI
Confirmed live in the last 24 hours
Amazon.com Services LLC
Seattle, WA, USA
On-site
Posted March 31, 2026
Job Description
We’re building foundational large language model capabilities for Amazon Stores that combine general world knowledge with Amazon’s e-commerce domain expertise to create more intuitive, conversational, and personalized shopping experiences for our customers. We’re looking for pioneers who are passionate about technology, innovation, and customer experience, and who want to make a lasting impact in a rapidly evolving space. You’ll work alongside talented scientists and engineers to invent on behalf of customers and unlock the next generation of LLM-powered shopping experiences.
If you’re excited about working at the intersection of large-scale ML systems, post-training and inference optimization, and customer-facing innovation, this is a unique opportunity to join a dynamic team shaping the future of AI at Amazon.
Key job responsibilities
In this role, you will leverage your engineering expertise to develop and optimize generative AI systems for shopping. On a day-to-day basis, you will:
* Design and optimize high-performance kernels, custom operators, and low-level acceleration techniques that maximize hardware utilization and reduce computational overhead for LLM training and inference.
* Drive improvements in memory management, parallel computing, kernel fusion, attention optimization, and matrix multiplication efficiency to reduce latency and increase throughput at scale.
* Partner closely with applied scientists, engineering teams and product managers to define requirements, support experimentation, and deliver production-ready systems.
* Move quickly in ambiguous environments, make thoughtful short- and long-term trade-offs, and deliver incrementally across a wide range of technologies, from distributed data processing to ML infrastructure and kernel-level optimization.
* Develop tooling to accelerate experimentation, improve observability, and generate insights across model quality, latency, throughput, and efficiency metrics.
- 5+ years of programming with at least one software programming language experience
- 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience as a mentor, tech lead or leading an engineering team
- Experience with one of the following areas: machine learning technologies, Reinforcement Learning, Deep Learning, Computer Vision, Natural Language Processing (NLP) or related applications
- Bachelor's degree in computer science or equivalent
- Experience with Machine Learning and Large Language Model fundamentals, including architecture, training/inference lifecycles, and optimization of model execution, or experience in computer architecture
- Experience with CUDA kernels or ML/low-level kernels
- Experience with vLLM, SGLang, TensorRT or similar platforms in production environments, or experience working with PyTorch or JAX software
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees, supervisors, and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service; and follow all federal, state, and local laws and Company policies. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness and professionalism, and safeguard business operations and the Company’s reputation. Pursuant to the Los Angeles County Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.
If you’re excited about working at the intersection of large-scale ML systems, post-training and inference optimization, and customer-facing innovation, this is a unique opportunity to join a dynamic team shaping the future of AI at Amazon.
Key job responsibilities
In this role, you will leverage your engineering expertise to develop and optimize generative AI systems for shopping. On a day-to-day basis, you will:
* Design and optimize high-performance kernels, custom operators, and low-level acceleration techniques that maximize hardware utilization and reduce computational overhead for LLM training and inference.
* Drive improvements in memory management, parallel computing, kernel fusion, attention optimization, and matrix multiplication efficiency to reduce latency and increase throughput at scale.
* Partner closely with applied scientists, engineering teams and product managers to define requirements, support experimentation, and deliver production-ready systems.
* Move quickly in ambiguous environments, make thoughtful short- and long-term trade-offs, and deliver incrementally across a wide range of technologies, from distributed data processing to ML infrastructure and kernel-level optimization.
* Develop tooling to accelerate experimentation, improve observability, and generate insights across model quality, latency, throughput, and efficiency metrics.
Basic Qualifications
- 5+ years of non-internship professional software development experience- 5+ years of programming with at least one software programming language experience
- 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience as a mentor, tech lead or leading an engineering team
- Experience with one of the following areas: machine learning technologies, Reinforcement Learning, Deep Learning, Computer Vision, Natural Language Processing (NLP) or related applications
Preferred Qualifications
- 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience- Bachelor's degree in computer science or equivalent
- Experience with Machine Learning and Large Language Model fundamentals, including architecture, training/inference lifecycles, and optimization of model execution, or experience in computer architecture
- Experience with CUDA kernels or ML/low-level kernels
- Experience with vLLM, SGLang, TensorRT or similar platforms in production environments, or experience working with PyTorch or JAX software
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees, supervisors, and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service; and follow all federal, state, and local laws and Company policies. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness and professionalism, and safeguard business operations and the Company’s reputation. Pursuant to the Los Angeles County Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.
rustawsmachine learningaidataproductdesign
Similar Jobs
GE HealthCare
AI Algorithm and Development Software Engineer
Mid-LevelBeiJing
GE HealthCare
Staff Data Scientist
StaffIND19-01-Bengaluru-E...
UPS
Procurement Process Specialist -Grade 101
Mid-LevelIN - PUNE III GLOBAL...
Citigroup
AI Capable Java Engineer
Mid-LevelPune Maharashtra Ind...€52,400 - €82,915/year
Wells Fargo
Senior Software Engineer - Gen AI
SeniorBengaluru, India
Wells Fargo
Financial Crimes Associate Manager
Lead / ManagerHyderabad, India