Opening. This role exists to enhance the capabilities, alignment, and safety of Anthropic's production models.
What you'll do
As a Research Engineer on our Post-Training team, you'll train our base models through the complete post-training stack to deliver the production Claude models that users interact with. You'll work at the intersection of cutting-edge research and production engineering, implementing, scaling, and improving post-training techniques like Constitutional AI, RLHF, and other alignment methodologies.
- Implement and optimize post-training techniques at scale on frontier models
- Conduct research to develop and optimize post-training recipes that directly improve production model quality
What you need
- Strong software engineering skills with experience building complex ML systems
- Experience with training, fine-tuning, or evaluating large language models
- Ability to balance research exploration with engineering rigor and operational reliability
Why this matters
Your work will directly impact the quality, safety, and capabilities of our production models, making a significant difference in the lives of our users and the broader AI ecosystem.