About the role

Aplyr's Quick Take

This role is for a Research Engineer focused on developing AI coding agents at JetBrains. You'll be designing and implementing training and evaluation pipelines for large language models (LLMs) that assist developers in coding tasks. It's a hands-on technical position that requires deep expertise in machine learning and coding frameworks.

Good fit

Ideal candidates have extensive experience with LLMs and deep learning frameworks like PyTorch, and are comfortable working independently on complex projects. A strong background in AI research or production environments is essential.

Worth noting

The position involves collaboration across multiple teams and requires a solid grasp of both theoretical and practical aspects of LLMs. The role is remote-friendly across several European locations, which may appeal to those seeking flexibility.

At JetBrains, code is our passion. Ever since we started, back in 2000, we’ve been striving to make the strongest, most effective developer tools on earth. Today, AI-powered assistance and agents are becoming a core part of how developers work in our IDEs.

We’re building multi-step coding agents that can understand large codebases, plan changes, call tools, and iterate with the user. As a Research Engineer in the Agentic Models team, you’ll be responsible for the models, training loops, and evaluation pipelines that power these agents.

You’ll work at the intersection of SFT and RL-style post-training, and product-driven evaluation, using our distributed GPU and MapReduce clusters to ship models into JetBrains products.

As part of our team, you will:

Design, implement, and maintain SFT and RL post-training pipelines for multi-step coding agents.
Train and adapt LLMs for agent workflows, including planning, tool use, and multi-step interactions inside JetBrains IDEs.
Build and develop evaluation and simulation environments where coding agents can act, be measured, and compared on realistic developer tasks.
Design evaluation frameworks and metrics for agent behavior, analyze traces and logs, and close the loop from evaluation back into training, data, and reward design.
Analyze training and evaluation results to propose and implement improvements to model architectures, training recipes, and datasets.
Work with large-scale infrastructure, including distributed training on GPU clusters and large MapReduce-style data processing for pre-training and fine-tuning datasets.
Collaborate closely with research, product, and infrastructure teams to turn high-level product visions into concrete models, experiments, and shipped features.

We’ll be happy to bring you on board if you have:

Extensive hands-on experience training LLMs (pre-training, fine-tuning, or post-training) in a research or production setting.
Deep expertise in modern deep learning frameworks such as PyTorch, and specialized LLM training stacks (e.g. Megatron, NeMo, verl, or similar).
Strong theoretical and practical understanding of LLM fundamentals: architectures, tokenization, data pipelines, batching, mixed precision, distributed training, and debugging unstable runs.
The ability to own projects end to end, starting from a high-level problem or product pain point and overseeing it through the design, experimentation, implementation, and iteration phases.
A product-aware mindset – you care about how developers actually use agents and can translate product needs and failure modes into modeling and evaluation work.
At least 3 years of Python experience writing clean, maintainable code in modern ML codebases.

Our ideal candidate would have experience with:

ML orchestrators and workflow tools such as Kubeflow, Dagster, Airflow, ZenML, and/or job schedulers like Kubernetes or SLURM.
Large-scale data and training pipelines, e.g. MapReduce-style clusters, multi-node GPU training, or workloads on the order of 1M+ CPU/GPU hours.
Designing and maintaining evaluation pipelines for LLMs or agents, including metrics, dashboards, experiment tracking, and automated regression checks.
AI agent development, such as tool-using agents, planners, or multi-step coding workflows, and familiarity with agentic frameworks or patterns.
Experiment tracking and observability using tools like Weights & Biases, MLflow, Langfuse, or similar.
Inference optimization and serving optimized models in production.

#LI-KP1

We are an equal opportunity employer

We know great ideas can come from anyone, anywhere. That’s why we do our best to create an open and inclusive workplace – one that welcomes everyone regardless of their background, identity, religion, age, accessibility needs, or orientation.

We process the data provided in your job application in accordance with the Recruitment Privacy Policy.

Skills & Tags

node python kubernetes ai data product design

Aplyr's read

JetBrains is a leader in intelligent development tools, attracting tech-savvy professionals passionate about enhancing developer productivity and innovation.
Synthesized from recent postings & public sources

What's promising

•JetBrains' tools like IntelliJ IDEA are industry standards, widely adopted by developers.
•The company is at the forefront of AI integration in development tools.
•JetBrains offers a diverse range of roles, from AI to product management.

What to watch

•JetBrains faces intense competition from other development tool providers.
•The company's rapid growth may lead to internal communication challenges.
•Limited public information about JetBrains' workplace culture and employee satisfaction.

Why JetBrains

•JetBrains emphasizes intelligent automation in developer tools, setting it apart.
•The company has a strong focus on AI-driven product development.
•JetBrains supports a wide array of programming languages, enhancing developer flexibility.

Aplyr’s read is generated by AI from public sources. Was it useful?

About JetBrains

JetBrains

jetbrains.com

View company

JetBrains is a software development company known for creating intelligent development tools that streamline programming and enhance productivity. Their products, such as IntelliJ IDEA and PyCharm, are widely used by developers around the world, significantly impacting the software development landscape.