Reinforcement Learning Engineer
Confirmed live in the last 24 hours
Weights & Biases
Job Description
Our Team
The OpenPipe team at CoreWeave is building tools to help agents learn from experience. This is a critical step to make agents reliable enough to perform long tasks autonomously, in the same way human employees are. We’re systematically identifying and solving the major bottlenecks between today’s tech and those future self-improving agents. So far, we’ve:
- Released ART, the easiest library for getting started with RL.
- Developed RULER, a general-purpose reward function that works across many diverse tasks.
- Built Serverless RL, an elegant API that gives RL practitioners full control over their data, environment and reward function while letting them outsource the headaches of managing GPU infrastructure.
These releases have a theme: we’re systematically tackling each major roadblock to successfully training self-improving agents. Several serious challenges remain. Building simulated environments often requires substantial human labor, and existing training methods are not data efficient enough. We're laser-focused on solving these problems and making self-improvement a reality for agent developers.
In startup terms, this is a classic hard-tech bet. Our roadmap involves substantial techn
Similar Jobs
Figure AI
Staff Reinforcement Learning Engineer – Whole Body Control
Anthropic
Research Engineer, Machine Learning (Reinforcement Learning)
Anthropic
Research Engineer, Machine Learning (Reinforcement Learning)
Anthropic
Research Engineer, Cybersecurity Reinforcement Learning
DoorDash
Senior/Staff Deep Reinforcement Learning Engineer
XPENG Motors