Back to Search
Overview
Mid-Level

AI Research Engineer - AI Safety

Confirmed live in the last 24 hours

Helsing

Helsing

Berlin; London; Munich
Remote
Posted March 17, 2026

Job Description

Who we are

At Helsing we deliver AI-based capabilities and the enabling foundation that allow machines to perceive and assist human decision-making. You will have the unique opportunity to shape AI capabilities in one of the most challenging sectors, where high generalisation capabilities need to be paired with hardware constraints and robustness against adversarial attacks.

You will join a team focused on AI Assurance, where you will develop cutting-edge techniques for scalable evaluation of AI products across the company, design data collection and experimentation strategies to extract causal insights, and enhance responsible decision-making via uncertainty quantification and safety mechanisms.

The day-to-day

You will be responsible for defining operational domains and evaluating the reliability of the AI capabilities developed in-house. You will develop and extend the state-of-the-art in uncertainty quantification and uncertainty calibration. This will involve understanding the AI systems we build, interfacing with them, and evaluating their robustness in real-world and adversarial scenarios. You will contribute to impactful projects and will collaborate with people across several teams and backgrounds.

You should apply if you

  • Hold a STEM MSc with solid mathematical and statistical background.
  • Have excellent communication skills and the ability to report and present research findings clearly and efficiently both internally and externally

  • Are passionate about keeping up to date with current research and enjoy reimplementing / extending papers on state of the art Deep Learning-based approaches

  • Possess solid software engineering skills, writing clean and well-structured code in Python and/or languages like Rust, Java, or modern C++, and experience deploying AI software to production including testing, QA, and monitoring

Note: We operate at an intersection where women, as well as other minority groups, are systemically under-represented. We encourage you to apply even if you don’t meet all the listed qualifications;  ability and impact cannot be summarised in a few bullet points.

Nice to have

  • PhD in either model evaluation and robustness, uncertainty quantification, experimental design, causal inference or related fields.

  • Have authored publications in top-tier jou

pythonjavagorustaiiosdataproductdesign