Back to Search
Overview
Lead / Manager

Senior Data Scientist

Confirmed live in the last 24 hours

Sauce Labs Inc.

Sauce Labs Inc.

Gurgaon, india
Hybrid
Posted April 8, 2026

Job Description

Location Preference: Gurgaon, India

This is an in office position 5 days a week.

About Us:

Sauce Labs is the world’s largest full-lifecycle, test automation platform, and the company behind Selenium. Trusted by 80% of the world’s top ten largest financial institutions and over 300,000 enterprise users, Sauce Labs provides the only AI platform capable of turning business intent into autonomous testing and quality assurance. With a proprietary dataset of 8.7 billion test runs, Sauce Labs empowers the Fortune 2000 to bridge the gap between AI-driven code generation and enterprise-grade software quality. Learn more at saucelabs.com.

The Role:

At Sauce Labs, we’re looking for a Data Scientist / GenAI Engineer to join our team and work directly with our engineering crew on the next generation of AI-powered products. You’ll be right in the mix of building, evaluating, and refining our new AI Assistant, helping our customers unlock deeper, smarter insights from their testing data. If you love collaborating across teams to turn complex data into helpful AI features, we’d love to meet you!


Responsibilities:

  • Collaborate with the engineering team to execute experiments and provide insights
    • Prompt engineering and optimization for accuracy, relevance, and hallucination reduction
    • Research new use cases for AI-powered features
    • Monitor the accuracy of AI solutions over time
  • Collect and analyze data across Sauce Labs
    • Manage the data directory across Sauce Labs - work with the data engineering team
    • Analyze time-series testing datasets to identify patterns and insights
    • Analyze telemetry data for performance and usage patterns
    • Analyze logs and traces for root cause analysis
    • Discover actionable insights from the data
  • Evaluate model performance using GenAI evaluation frameworks
    • Design and maintain golden datasets for GenAI evaluation
    • Build evaluation pipelines using MLflow and LLM-as-judge frameworks
    • Develop deterministic and LLM-based scoring rubrics for answer validation

Required Skills:

  • 5+ years of experience
  • Strong Python skills (Pandas, data manipulation, LLM frameworks)
  • Experience with GenAI evaluation metrics (recall@k, MRR, faithfulness, F1)
  • Proficiency in prompt engineering (few-shot, grounding, structured outputs)
  • Familiarity with RAG techniques (hybrid retrieval, re-ranking, chunking strategies)
  • SQL proficiency (Snowflake or PostgreSQL)
  • Understanding of LLM-as-judge evaluation and scoring rubrics
  • Knowledge of data governance (bronze/silver/gold data tiers)
  • Experience with experiment tracking tools (MLflow, Weights & Biases, LangSmith)
  • Experience with agentic frameworks (MCP, tool calling, ReAct patterns)

Nice to Have:

  • Knowledge of fine-tuning techniques (SFT, LoRA, DPO)
  • Familiarity with vector databases (Pinecone, Weaviate, Chroma)
  • Understanding of LLM security (prompt injection defense, tool safety)
  • Experience with advanced RAG (Graph-RAG, Self-RAG, Corrective RAG)
  • Knowledge of Snowflake Cortex AI features

Please note our privacy terms when applying for a job at Sauce Labs.

Sauce Labs is proud to be an Equal Opportunity employee and values diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender identity/expression/status, sexual orientation, age, marital status, veteran status or disability status.

Security responsibilities at Sauce

At Sauce, we will commit to supporting the health and safety of employees and properties, partnering with internal stakeholders to learn and act on ever-evolving security protocols and procedures. You’ll be expected to fully comply with all

reactpythongorustaidataproductdesign