Opening. We are looking for a Machine Learning Engineer to help us train Claude specifically for virtual collaborator workflows. This role will be responsible for designing and implementing reinforcement learning pipelines, building and scaling data creation platforms, and training Claude on advanced document manipulation.
What you'll do
- Designing and implementing reinforcement learning pipelines specifically targeted at virtual collaborator use cases (productivity, organizational navigation, vertical domains)
- Building and scaling our data creation platform for generating high-quality, open-ended tasks with domain experts and crowdworkers
- Integrating real organizational data to create authentic training environments
- Developing robust rubric-based evaluation systems that maintain quality while avoiding reward hacking
- Training Claude on advanced document manipulation, including understanding, enhancing, and co-creating
- Partnering directly with product teams to ensure training aligns with shipped features
What you need
- Strong machine learning experience
- Ability to design and implement reinforcement learning pipelines
- Experience with data creation platforms and data integration
- Strong programming skills in Python
- Excellent communication and collaboration skills
Why this matters
This role will play a critical part in helping us achieve our mission of creating reliable, interpretable, and steerable AI systems. By training Claude on advanced document manipulation, we can improve the accuracy and effectiveness of our virtual collaborator workflows, ultimately leading to better outcomes for our users and society as a whole.