About the role
About Sentry
Bad software is everywhere, and we’re tired of it. Sentry is on a mission to help developers write better software faster so we can get back to enjoying technology.
With more than $217 million in funding and 100,000+ organizations that believe we’re on to something, we're building performance and error monitoring tools that help companies like Disney, Microsoft, and Atlassian spend less time fixing bugs and more time building products.
Sentry embraces a hybrid work model across our global hubs, with Mondays, Tuesdays, and Thursdays set as in-office anchor days to encourage meaningful collaboration. If you like to selfishly build things that make your digital life better, come help us build the next generation of software monitoring tools.
About the role
As a Senior Software Engineer on Sentry’s AI/ML team, you’ll be responsible for building the evaluation infrastructure that measures the accuracy, reliability, and real-world performance of our AI systems. This role is critical to ensuring that our debugging agents and AI-powered features behave correctly, safely, and predictably as they scale. You’ll design datasets, benchmarks, and test harnesses that turn ambiguous AI behavior into measurable signals, helping the team ship AI with confidence.
In this role you will
Design and build robust evaluation frameworks to measure accuracy, reliability, regressions, and edge cases in AI systems
Create and curate high-quality datasets, golden test cases, and benchmarks grounded in real production data
Build automated test harnesses and metrics pipelines to continuously evaluate models, prompts, and agentic workflows
Partner closely with applied AI engineers and product leaders to define what “good” looks like and translate it into measurable criteria
Own the evaluation lifecycle for major AI initiatives, from early experimentation through production monitoring
You’ll love this job if you
Care deeply about correctness, rigor, and measurement in AI systems
Enjoy turning fuzzy product goals and model behavior into concrete tests and metrics
Like building foundational infrastructure that unlocks faster iteration and higher confidence for the entire AI team
Thrive in cross-functional environments and enjoy influencing model design through better evaluation
Qualifications
Minimum 5+ years of professional experience with a Bachelor’s degree in computer science, machine learning, or a related field
Experience building testing, evaluation, or data infrastructure for complex systems (AI/ML experience strongly preferred)
Comfort writing production-quality code (we use Python and TypeScript)
Experience working with structured and unstructured datasets, labeling workflows, or data quality pipelines
Familiarity with modern ML systems and evaluation techniques (e.g., offline metrics, online evaluation, regression testing for models or prompts)
Bonus: experience evaluating LLMs, agentic systems, or AI-assisted developer tools
The base salary range (or hourly wage range, if applicable) that Sentry reasonably expects to pay for this position is $240,000 to $280,000 USD. A successful candidate’s actual base salary (or hourly wage) amount will be determined by a variety of relevant factors including, without limitation, the candidate’s work location, education, work and other relevant experience, skills, and job-related knowledge. A successful candidate will be eligible to participate in Sentry’s employee benefit plans/programs applicable to the candidate’s position (including incentive compensation, equity grants, paid time off, and group health insurance coverage). See Sentry Benefits for more details about the Company’s benefit plans/programs.
Equal Opportunity at Sentry
Sentry is committed to providin
Aplyr's read
Sentry empowers developers with real-time error tracking and performance monitoring, attracting tech professionals focused on enhancing software quality and efficiency.
What's promising
- •Sentry's platform helps developers quickly identify and resolve software errors, enhancing application performance.
- •The company is actively hiring across various technical roles, indicating growth and demand for its services.
- •Sentry's focus on real-time monitoring supports developers in delivering high-quality applications efficiently.
What to watch
- •The competitive landscape in application monitoring may challenge Sentry's market position.
- •Limited public information about Sentry's long-term financial stability and growth trajectory.
- •Potential for high-pressure environments due to the real-time nature of error tracking.
Why Sentry
- •Sentry offers detailed error tracking that is crucial for improving user experience in software applications.
- •The company supports a diverse range of roles, from engineering to technical program management.
- •Sentry's real-time monitoring capabilities set it apart in the application performance sector.
Aplyr’s read is generated by AI from public sources. Was it useful?
About Sentry
Sentry is an application monitoring platform that helps developers identify and fix errors in real-time, improving software performance and user experience. By providing detailed error tracking and performance monitoring, Sentry empowers teams to deliver high-quality applications more efficiently.