Summary
Microsoft AI are looking for a talented Member of Technical Staff – Post Training – MAI Superintelligence Team at their Mountain View office. This role sits at the heart of post-training and improving pre-trained models to advance the state-of-the-art on a wide variety of internal and external benchmarks. You'll work on the bleeding edge and leverage the most powerful pretrained models and algorithms for your needs.
About the Role
This role involves contributions to all stages of the post-training process: driving data collection and acquisition, building evaluations of model capabilities, and applying advanced reward modeling and RL techniques to develop and improve the post-training recipe. You will design hypotheses and experiment plans for rapidly iterating on model performance. You will work on the bleeding edge and leverage the most powerful pretrained models and algorithms for your needs.
Accountabilities
- Develop data collection, evaluation, and post-training methods for models.
- Design hypotheses and experiment plans for rapidly iterating on model performance.
The Candidate we're looking for
Experience:
- Bachelor’s Degree in Computer Science, Machine Learning, Mathematics, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
Technical skills:
- Experience with reward modeling, RL, or other post-training techniques.
Personal attributes:
- Passionate about advancing the state of post-training research.
- Will thrive in a highly collaborative, fast-paced environment.
Benefits
- Starting January 26, 2026, MAI employees are expected to work from a designated Microsoft office at least four days a week if they live within 50 miles (U.S.) or 25 miles (non-U.S., country-specific) of that location.
- Microsoft Superintelligence team’s mission is to empower every person and every organization on the planet to achieve more.