About the role
By applying, you agree to our Applicant Privacy Policy.
You will help us move toward a future of decoupled control and data planes, scaling big data compute and storage platforms while ensuring secure and governed data access for MLOps and research. You will take full lifecycle ownership: from architecting the migration away from legacy orchestrators to implementing production-grade pipelines and participating in on-call rotations for critical training jobs.
- Build & Scale: Help us reach our goal of operating massive distributed compute and storage systems
- Global Orchestration: Architect and maintain multi-cluster orchestration layers to optimize workload placement across diverse hardware and regions.
- Design Future-Proof Storage: Architect our transition to modern storage formats to handle fine-tuning datasets at a scale that anticipates exabyte growth.
- Platform Engineering: Contribute to the development of our internal training platform, ensuring seamless model training and fine-tuning capabilities across Kubernetes and SLURM based environments.
- Metadata & Lineage: Implement and manage systems to provide clear visibility and lineage as our data and model pipelines grow in complexity.
- Operational Excellence: Use modern deployment workflows to manage cloud-native deployments, ensuring our data platform can scale by orders of magnitude while remaining reliable and efficient.
You might thrive in this role if you:
-
Have 4+ years of experience in Data Infrastructure, MLOps, or Infrastructure Engineering.
-
Have experience or a strong interest in supporting foundational compute and storage platforms.
-
Are proficient in Python and enjoy solving the "brittle data lake" problem with modern, columnar storage standards.
-
Are well-versed in Kubernetes-native tooling and excited to debug large-scale distributed systems across multi-cluster environments.
-
Take pride in building and operating scalable, reliable, and secure systems from the ground up.
-
Are comfortable with ambiguity and the challenges of building high-scale infrastructure in a rapid-growth AI environment.
Aplyr's read
Mistral AI is at the forefront of AI innovation, attracting talent keen on advancing natural language processing and machine understanding.
What's promising
- •Mistral AI focuses on cutting-edge natural language processing technology.
- •The company has a diverse range of roles, indicating growth and expansion.
- •Mistral AI's work aims to improve automation across various industries.
What to watch
- •Limited public information about the company's financial stability.
- •Potentially high competition in the AI sector could impact market share.
- •Rapid technological changes may require constant adaptation and learning.
Why Mistral AI
- •Mistral AI specializes in enhancing machine understanding of human language.
- •The company hires globally, indicating a commitment to diverse perspectives.
- •Mistral AI's focus on natural language processing sets it apart in the AI field.
Aplyr’s read is generated by AI from public sources. Was it useful?
About Mistral AI
Mistral AI is an innovative company focused on developing advanced AI models and solutions, particularly in the realm of natural language processing. Their work aims to enhance machine understanding and generation of human language, impacting various industries by improving automation and efficiency.
Similar roles
Senior Research Engineer, Data Engine
Intrinsic (Alphabet)
Lead Lab Support Engineer
Graphcore
Research Engineer, Frontier Speculative Decoding
Together AI
Research Engineer, Data Infrastructure
Mistral AI
Research Engineer / Scientist, Alignment Science - London
Anthropic
Research Engineer / Scientist, Alignment Science
Anthropic