About the role

Aplyr's Quick Take

This role is for a Senior Machine Learning Engineer focused on transitioning from third-party LLM APIs to a self-hosted model ecosystem. You'll be responsible for architecting and optimizing infrastructure for deploying and managing multiple large-scale models, fine-tuning them for specific tasks, and ensuring high performance for a chatbot product.

Good fit

Ideal candidates have extensive experience with LLMs, model deployment, and fine-tuning, particularly in Python and PyTorch. A hands-on approach and familiarity with high-performance inference engines will help you thrive in this role.

Worth noting

The position emphasizes building a scalable infrastructure for a large number of models, which may be challenging given the current deployment issues. The focus on agentic systems and advanced chatbot capabilities suggests a cutting-edge environment but may require a steep learning curve.

We are looking for a highly skilled Senior ML Engineer to lead our transition from third-party LLM APIs to a fully self-hosted ecosystem by fine-tuning high-performance, domain-specific models.

Our core product is an advanced, agentic support chatbot capable of complex reasoning, API tool calling, database lookups, and orchestrating specialized LLMs for specific tasks.

What You’ll Do:

Model Fine-Tuning: Design and execute fine-tuning strategies to improve model accuracy on specific domain tasks and tool-calling execution.
Agentic Workflows: Develop and refine the chatbot's agentic capabilities, ensuring reliable tool-use, routing, and interactions between massive LLMs and specialized SLMs.
Inference Optimization: Deploy and manage large-scale models using high-performance inference engines (like vLLM) to ensure low latency and high throughput for our agentic chatbot.
Rigorous Evaluation: Build comprehensive offline and online evaluation frameworks to constantly measure model performance and business impact through structured A/B testing.

What We’re Looking For:

Core Engineering & AI Frameworks

Deep experience with PyTorch and the Hugging Face ecosystem.
Strong Data Engineering skills: data manipulation, synthetic data generation, and active learning/margin-sampling.
High proficiency with AI-assisted development workflows (e.g., Claude Code, Cursor, Codex) to accelerate development.

LLMs & Agents

Strong fundamental understanding of LLM architectures, attention mechanisms, and generation parameters.
Hands-on experience building Agentic systems (ReAct, function/tool calling, RAG).
Expertise in fine-tuning strategies (e.g., SFT, RLHF, DPO) and parameter-efficient techniques (PEFT/LoRA).

Bonus Points

Alignment Techniques: Experience with RLHF and DPO strategies for future reasoning-model development.
Containerization & Orchestration: Experience with Ray for orchestrating large-scale model deployments across multi-GPU clusters.
Model Quantization: Experience with memory optimization techniques like AWQ, GPTQ, or GGUF to fit 70B models efficiently onto hardware.
API Development: Proficiency in building robust, asynchronous microservices using FastAPI to serve model requests.
Experience with core MLOps practices, including dataset versioning (e.g., DVC), experiment tracking (e.g., Weights & Biases, MLflow), and model registries.

Skills & Tags

react go machine learning ai data product design

Aplyr's read

Navan streamlines business travel and expense management, attracting tech-savvy professionals seeking to innovate in travel technology.
Synthesized from recent postings & public sources

What's promising

•Navan's platform simplifies complex travel and expense processes, enhancing user experience.
•Recent hiring of engineering and strategy roles indicates growth and investment in innovation.
•Focus on specialty travel shows adaptability to diverse business needs.

What to watch

•Highly competitive travel tech market may pressure Navan's growth.
•Limited public information about financial health raises questions for potential employees.
•Rapid expansion could strain company culture and resources.

Why Navan

•Navan integrates travel booking and expense management into a single platform.
•Strong emphasis on user-friendly technology differentiates Navan in the travel sector.
•Specialty travel roles suggest a focus on niche market segments.

Aplyr’s read is generated by AI from public sources. Was it useful?