We are seeking a Machine Learning Research Engineer to join our Enterprise ML Research Lab. As an Agent MLRE, you will work on applying our Agent RL Training + Building algorithms to real-life enterprise datasets across our clients + benchmarks. This will involve creating best-in-class Agents that achieve state-of-the-art results through a combination of post-training + agent-building algorithms.
Responsibilities:
- Train state-of-the-art models, developed both internally and from the community, to deploy to our enterprise customers.
- Research cutting-edge algorithms to integrate directly into our training stack.
- Build agents that leverage our proprietary agent-building algorithms to automatically hill climb datasets – including defining highly performant tools, multi-agent systems, and complex rewards.
Ideally, you'd have:
- 1-3 years of building with LLMs in a production environment
- Experience with post-training methods like RLHF/RLVR and related algorithms like PPO/GRPO etc.
- Publications in top conferences such as NEURIPS, ICLR, or ICML within the last two years
- PhD or Masters in Computer Science or a related field
Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position, determined by work location and additional factors, including job-related skills, experience, interview performance, and relevant education or training.
Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is:
$218,400 – $273,000 USD
About Us:
At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact.
XML job scraping automation by YubHub