Back to Search
Overview
Mid-Level

Machine Learning Engineer, Data Mining

Confirmed live in the last 24 hours

Motional

Motional

Compensation

$144,000 - $192,000/year

Remote U.S.
Hybrid
Posted April 14, 2026

Job Description

Mission Summary:

At Motional, we're transforming how autonomous vehicles discover critical intelligence hidden within petabytes of multimodal sensor data. Our next-generation autonomous driving stack depends on finding the rare edge cases, long-tail scenarios, and model errors that matter most. Omnitag, our ML-powered multimodal data mining framework, is the engine that powers this discovery.

As a Machine Learning Engineer on the Data Mining team, your mission is to help build the "Brain" of this engine. You will work with state-of-the-art foundation models to extract insights from Motional’s driving data, working at the intersection of large-scale representation learning and data retrieval. By building smarter mining tools and efficient data pipelines, you will accelerate the model improvement lifecycle for teams working on post-training analysis, error diagnosis, and dataset curation.

What You'll Do:

  • Build and Train ML Pipelines: Develop, train, and fine-tune machine learning models for multimodal sensor data (e.g., vision, LiDAR). Focus on implementing supervised and self-supervised learning approaches to improve data search and retrieval.
  • Support Model Deployment: Implement scalable data preprocessing and augmentation pipelines. Assist in applying standard optimization techniques (e.g., batch inference, quantization) to ensure models run efficiently in production environments.
  • Data Mining & Analysis: Help develop embedding-based search tools and "active learning" workflows to identify critical driving scenarios.
  • Monitor Production Performance: Help build and maintain dashboards to monitor model health, data drift, and system performance. Identify regressions and assist in the operational support of our data mining services.
  • Learn and Apply Best Practices: Follow software engineering standards (version control, CI/CD, unit testing) for ML code. Participate in code reviews and contribute to technical documentation.
  • Collaborate Across Teams: Work closely with senior engineers and machine learning engineers to translate model prototypes into maintainable, scalable engineering solutions.

What We're Looking For (Must-Haves):

  • BS or MS in Computer Science, Machine Learning, or a related field.
  • Hands-on experience with PyTorch (preferred) or TensorFlow/JAX. You should be comfortable training models and evaluating them using standard metrics.
  • Strong proficiency in Python with the ability to write clean, modular, and well-documented code.
  • Working knowledge of version control, unit testing, and basic software design patterns.
  • Experience working with large datasets, including proficiency in SQL and data libraries like Pandas and NumPy.
  • A solid grasp of the full ML lifecycle, from data cleaning and feature engineering to validation and deployment basics.
  • A proactive learner who thrives on constructive feedback and is eager to grow within a high-stakes engineering environment.

Bonus Points (Nice-to-Haves):

  • MS/PhD in Computer Science, Machine Learning, or related field.
  • Experience with agentic systems, autonomous reasoning, chain-of-thought models, or LLM-based planning.
  • Background in autonomous driving, robotics, or real-time decision-making systems.
  • Familiarity with multimodal learning, sensor fusion, or embodied AI.
  • Experience building active learning loops, using the model to find the data that breaks the model.
  • Experience with ML-based data mining, active learning, or contrastive learning.
  • Knowledge of model serving tools (TF Serving, Triton, TorchServe) and MLOps platforms.
  • Publication in top-tier conferences (e.g., ICCV, CVPR, ECCV)

We encourage a hybrid schedule with in-office time at one of our locations in Boston, Pittsburgh, or Las Vegas to support collaboration, or this role can be fully remote.

pythongomachine learningaiiosdataproductdesign