Menu
Full-Time

Research Engineer, Production Model Post-Training at Anthropic

Company Anthropic
Location London
Salary Competitive salary
Posted Posted 1 days ago

Job Description

Opening. This role exists to enhance the capabilities, alignment, and safety of Anthropic's production models.

What you'll do

As a Research Engineer on our Post-Training team, you'll train our base models through the complete post-training stack to deliver the production Claude models that users interact with. You'll work at the intersection of cutting-edge research and production engineering, implementing, scaling, and improving post-training techniques like Constitutional AI, RLHF, and other alignment methodologies. Your work will directly impact the quality, safety, and capabilities of our production models.

  • Implement and optimize post-training techniques at scale on frontier models
  • Conduct research to develop and optimize post-training recipes that directly improve production model quality

What you need

  • Strong software engineering skills with experience building complex ML systems
  • Experience with training, fine-tuning, or evaluating large language models
  • Ability to balance research exploration with engineering rigor and operational reliability

Why this matters

This role has a significant impact on the quality, safety, and capabilities of our production models, which are used by users to interact with our AI systems. By working on this role, you'll be contributing to the development of beneficial AI systems that can positively impact society.

Similar Jobs

Full-Time

Website Engineering

ElevenLabs
London
More Info
Full-Time

Audio Engineer

ElevenLabs
empty string
More Info
Full-Time

Affiliate & Influencer Marketing Manager

ElevenLabs
United States
More Info
Full-Time

Sales Development Representative

ElevenLabs
San Francisco
More Info
Full-Time

Enterprise Solutions Engineer

ElevenLabs
San Francisco
More Info
Full-Time

Full-Stack Engineer (Back-End Leaning)

ElevenLabs
United Kingdom
More Info
Apply Now