Opening. This role exists to develop the next generation of large language models. We're seeking a Research Engineer to join our Pretraining team, responsible for developing safe, steerable, and trustworthy AI systems.
What you'll do
Brief intro paragraph.
- Conduct research and implement solutions in areas such as model architecture, algorithms, data processing, and optimizer development
- Independently lead small research projects while collaborating with team members on larger initiatives
- Design, run, and analyze scientific experiments to advance our understanding of large language models
- Optimize and scale our training infrastructure to improve efficiency and reliability
- Develop and improve dev tooling to enhance team productivity
- Contribute to the entire stack, from low-level optimizations to high-level model design
What you need
- Advanced degree (MS or PhD) in Computer Science, Machine Learning, or a related field
- Strong software engineering skills with a proven track record of building complex systems
- Expertise in Python and experience with deep learning frameworks (PyTorch preferred)
- Familiarity with large-scale machine learning, particularly in the context of language models
- Ability to balance research goals with practical engineering constraints
- Strong problem-solving skills and a results-oriented mindset
- Excellent communication skills and ability to work in a collaborative environment
Why this matters
One paragraph about career impact and value.
At Anthropic, we are committed to fostering a diverse and inclusive workplace. We strongly encourage applications from candidates of all backgrounds, including those from underrepresented groups in tech.