Full-Time

Research Engineer, Machine Learning (RL Velocity) at Anthropic

Company Anthropic
Salary $500,000-$850,000 USD
How You'll Work hybrid
Level senior
Sector Technology
Posted Posted 0 days ago

Job Description

About the role

The RL Velocity team owns the efficiency and reliability of our RL Science stack – the infrastructure, tooling, and systems that let researchers iterate quickly on training runs. As a Research Engineer on the team, you'll build and improve the core platform that underpins how we do RL at Anthropic, removing bottlenecks that slow down research and making it easier for the broader org to ship better models faster.

Responsibilities

  • Build and improve the RL training infrastructure that researchers depend on day-to-day
  • Identify and remove bottlenecks across the RL stack: debugging, profiling, and rearchitecting where needed
  • Partner closely with researchers and with adjacent engineering teams (inference, sandboxing, and many more) to understand pain points and ship tooling that makes them faster
  • Own the reliability and performance of research runs end-to-end
  • Contribute to design decisions that shape how Anthropic does RL at scale

You may be a good fit if you

  • Have strong software engineering fundamentals and a track record of building performant, reliable systems
  • Have worked on ML infrastructure, distributed systems, or research tooling
  • Care about enabling other people's work and find leverage through platforms rather than individual experiments
  • Are comfortable operating across the stack, from low-level performance work to RL algorithms
  • Have a bias toward shipping and iterating quickly, with a mix of high agency and low ego

Strong candidates may also have

  • Experience with large-scale distributed training (RL, pre-training, or post-training)
  • Familiarity with JAX, PyTorch, or similar ML frameworks
  • A track record of operating at the edge of research and infra in a fast-moving environment

We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work.

Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links,visit anthropic.com/careers directly for confirmed position openings.

XML job scraping automation by YubHub

Similar Jobs

Full-Time

Applied AI Engineer

Mistral AI
Paris
More Info
Full-Time

Communications Manager

ElevenLabs
More Info
Full-Time

AI Creative Producer – Ads

ElevenLabs
More Info
Full-Time

B2B Marketing Lead – ElevenCreative

ElevenLabs
London
More Info
Full-Time

Deployment Strategist

ElevenLabs
Spain
More Info
Full-Time

Enterprise Solutions Engineer – North America

ElevenLabs
United States
More Info

Receive the latest articles in your inbox

Join the Houtini Newsletter

Practical AI tools, local LLM updates, and MCP workflows straight to your inbox.