Back to Search
As a Model Optimization & Deployment Engineer, you will focus on bringing highly efficient, production-ready large-scale models to our on-vehicle stack. We are looking for experts with hands-on experience in compressing, accelerating, and deploying complex models (LLMs, VLMs, or FMs) for power- and thermal-constrained vehicle SOCs. You will optimize the ML models, write custom CUDA kernels, and build highly concurrent inference code to ensure real-time, deterministic execution on edge devices.


Senior
Senior AI Inference Engineer - Model Optimization & Deployment
Confirmed live in the last 24 hours
Zoox
Compensation
$242k - $290k/per-year-salary
Foster City, CA
On-site
Posted April 11, 2026
Job Description
The Perception team is pioneering the development of a multi-modality foundation model to drive the next generation of autonomous system intelligence.
As a Model Optimization & Deployment Engineer, you will focus on bringing highly efficient, production-ready large-scale models to our on-vehicle stack. We are looking for experts with hands-on experience in compressing, accelerating, and deploying complex models (LLMs, VLMs, or FMs) for power- and thermal-constrained vehicle SOCs. You will optimize the ML models, write custom CUDA kernels, and build highly concurrent inference code to ensure real-time, deterministic execution on edge devices.
aiproduct
Similar Jobs
Microsoft
Product Manager II - Foundry Model Inference (CoreAI)
Lead / ManagerUnited States, Washi...
Weights & Biases
Principal Product Manager, W&B Inference - Weights & Biases
PrincipalLivingston, NJ / New...$206,000 - $303,000/year
Microsoft
Senior Product Manager - Foundry Model Inference (CoreAI)
SeniorUnited States, Washi...