About the role

Aplyr's Quick Take

This role is for a Senior Software Engineer focused on creating evaluation frameworks for AI systems at Sentry. You'll be building tools to measure the performance and reliability of AI features, working closely with other engineers and product leaders to ensure quality in AI outputs.

Good fit

Ideal candidates have at least 5 years of experience in software engineering, particularly in testing or evaluation infrastructure. A strong background in AI or machine learning, along with a collaborative work style, will help you thrive here.

Worth noting

The role emphasizes building foundational infrastructure for AI, which could be a unique opportunity for those interested in influencing AI development from the ground up. The hybrid work model requires in-office attendance three days a week, which might not suit everyone.

About Sentry

Bad software is everywhere, and we’re tired of it. Sentry is on a mission to help developers write better software faster so we can get back to enjoying technology.

With more than $217 million in funding and 100,000+ organizations that believe we’re on to something, we're building performance and error monitoring tools that help companies like Disney, Microsoft, and Atlassian spend less time fixing bugs and more time building products.

Sentry embraces a hybrid work model across our global hubs, with Mondays, Tuesdays, and Thursdays set as in-office anchor days to encourage meaningful collaboration. If you like to selfishly build things that make your digital life better, come help us build the next generation of software monitoring tools.

About the role

As a Senior Software Engineer on Sentry’s AI/ML team, you’ll be responsible for building the evaluation infrastructure that measures the accuracy, reliability, and real-world performance of our AI systems. This role is critical to ensuring that our debugging agents and AI-powered features behave correctly, safely, and predictably as they scale. You’ll design datasets, benchmarks, and test harnesses that turn ambiguous AI behavior into measurable signals, helping the team ship AI with confidence.

In this role you will

Design and build robust evaluation frameworks to measure accuracy, reliability, regressions, and edge cases in AI systems
Create and curate high-quality datasets, golden test cases, and benchmarks grounded in real production data
Build automated test harnesses and metrics pipelines to continuously evaluate models, prompts, and agentic workflows
Partner closely with applied AI engineers and product leaders to define what “good” looks like and translate it into measurable criteria
Own the evaluation lifecycle for major AI initiatives, from early experimentation through production monitoring

You’ll love this job if you

Care deeply about correctness, rigor, and measurement in AI systems
Enjoy turning fuzzy product goals and model behavior into concrete tests and metrics
Like building foundational infrastructure that unlocks faster iteration and higher confidence for the entire AI team
Thrive in cross-functional environments and enjoy influencing model design through better evaluation

Qualifications

Minimum 5+ years of professional experience with a Bachelor’s degree in computer science, machine learning, or a related field
Experience building testing, evaluation, or data infrastructure for complex systems (AI/ML experience strongly preferred)
Comfort writing production-quality code (we use Python and TypeScript)
Experience working with structured and unstructured datasets, labeling workflows, or data quality pipelines
Familiarity with modern ML systems and evaluation techniques (e.g., offline metrics, online evaluation, regression testing for models or prompts)
Bonus: experience evaluating LLMs, agentic systems, or AI-assisted developer tools

The base salary range (or hourly wage range, if applicable) that Sentry reasonably expects to pay for this position is $240,000 to $280,000 USD. A successful candidate’s actual base salary (or hourly wage) amount will be determined by a variety of relevant factors including, without limitation, the candidate’s work location, education, work and other relevant experience, skills, and job-related knowledge. A successful candidate will be eligible to participate in Sentry’s employee benefit plans/programs applicable to the candidate’s position (including incentive compensation, equity grants, paid time off, and group health insurance coverage). See Sentry Benefits for more details about the Company’s benefit plans/programs.

Equal Opportunity at Sentry

Sentry is committed to providin

Skills & Tags

python typescript go machine learning ai data product design

Aplyr's read

Sentry empowers developers with real-time error tracking and performance monitoring, attracting tech professionals focused on enhancing software quality and efficiency.
Synthesized from recent postings & public sources

What's promising

•Sentry's platform helps developers quickly identify and resolve software errors, enhancing application performance.
•The company is actively hiring across various technical roles, indicating growth and demand for its services.
•Sentry's focus on real-time monitoring supports developers in delivering high-quality applications efficiently.

What to watch

•The competitive landscape in application monitoring may challenge Sentry's market position.
•Limited public information about Sentry's long-term financial stability and growth trajectory.
•Potential for high-pressure environments due to the real-time nature of error tracking.

Why Sentry

•Sentry offers detailed error tracking that is crucial for improving user experience in software applications.
•The company supports a diverse range of roles, from engineering to technical program management.
•Sentry's real-time monitoring capabilities set it apart in the application performance sector.

Aplyr’s read is generated by AI from public sources. Was it useful?

About Sentry

Sentry

sentry.io

View company

Sentry is an application monitoring platform that helps developers identify and fix errors in real-time, improving software performance and user experience. By providing detailed error tracking and performance monitoring, Sentry empowers teams to deliver high-quality applications more efficiently.