Opening. This role exists to enhance the capabilities, alignment, and safety of Anthropic's production models.
What you'll do
As a Research Engineer on our Post-Training team, you'll train our base models through the complete post-training stack to deliver the production Claude models that users interact with. You'll work at the intersection of cutting-edge research and production engineering, implementing, scaling, and improving post-training techniques like Constitutional AI, RLHF, and other alignment methodologies. Your work will directly impact the quality, safety, and capabilities of our production models.
- Implement and optimize post-training techniques at scale on frontier models
- Conduct research to develop and optimize post-training recipes that directly improve production model quality
What you need
- Strong software engineering skills with experience building complex ML systems
- Experience with training, fine-tuning, or evaluating large language models
- Ability to balance research exploration with engineering rigor and operational reliability
Why this matters
This role has a significant impact on the quality, safety, and capabilities of our production models, which are used by users to interact with our AI systems. By working on this role, you'll be contributing to the development of beneficial AI systems that can positively impact society.