Back
Verified active · 12h ago

Senior Machine Learning Engineer- LLMs & Self-Hosted AI

NavanNavan·Travel Technology

Apply effort

~12 min

Company site

Posted

5 days

01

About the role

We are looking for a highly skilled Senior ML Engineer to lead our transition from third-party LLM APIs to a fully self-hosted ecosystem by fine-tuning high-performance, domain-specific models.

Our core product is an advanced, agentic support chatbot capable of complex reasoning, API tool calling, database lookups, and orchestrating specialized LLMs for specific tasks.

What You’ll Do:

  • Model Fine-Tuning: Design and execute fine-tuning strategies to improve model accuracy on specific domain tasks and tool-calling execution.
  • Agentic Workflows: Develop and refine the chatbot's agentic capabilities, ensuring reliable tool-use, routing, and interactions between massive LLMs and specialized SLMs.
  • Inference Optimization: Deploy and manage large-scale models using high-performance inference engines (like vLLM) to ensure low latency and high throughput for our agentic chatbot.
  • Rigorous Evaluation: Build comprehensive offline and online evaluation frameworks to constantly measure model performance and business impact through structured A/B testing.

What We’re Looking For:

Core Engineering & AI Frameworks

  • Deep experience with PyTorch and the Hugging Face ecosystem.
  • Strong Data Engineering skills: data manipulation, synthetic data generation, and active learning/margin-sampling.
  • High proficiency with AI-assisted development workflows (e.g., Claude Code, Cursor, Codex) to accelerate development.

LLMs & Agents

  • Strong fundamental understanding of LLM architectures, attention mechanisms, and generation parameters.
  • Hands-on experience building Agentic systems (ReAct, function/tool calling, RAG).
  • Expertise in fine-tuning strategies (e.g., SFT, RLHF, DPO) and parameter-efficient techniques (PEFT/LoRA).

Bonus Points

  • Alignment Techniques: Experience with RLHF and DPO strategies for future reasoning-model development.
  • Containerization & Orchestration: Experience with Ray for orchestrating large-scale model deployments across multi-GPU clusters.
  • Model Quantization: Experience with memory optimization techniques like AWQ, GPTQ, or GGUF to fit 70B models efficiently onto hardware.
  • API Development: Proficiency in building robust, asynchronous microservices using FastAPI to serve model requests.
  • Experience with core MLOps practices, including dataset versioning (e.g., DVC), experiment tracking (e.g., Weights & Biases, MLflow), and model registries.
02

Aplyr's read

Navan streamlines business travel and expense management, attracting tech-savvy professionals seeking to innovate in travel technology.

Synthesized from recent postings & public sources

What's promising

  • Navan's platform simplifies complex travel and expense processes, enhancing user experience.
  • Recent hiring of engineering and strategy roles indicates growth and investment in innovation.
  • Focus on specialty travel shows adaptability to diverse business needs.

What to watch

  • Highly competitive travel tech market may pressure Navan's growth.
  • Limited public information about financial health raises questions for potential employees.
  • Rapid expansion could strain company culture and resources.

Why Navan

  • Navan integrates travel booking and expense management into a single platform.
  • Strong emphasis on user-friendly technology differentiates Navan in the travel sector.
  • Specialty travel roles suggest a focus on niche market segments.

Aplyr’s read is generated by AI from public sources. Was it useful?

03

About Navan

Navan is a travel and expense management platform that simplifies business travel and expense reporting for companies.

04

Similar roles