We are seeking a Machine Learning Research Engineer to join our Enterprise ML Research Lab. As an Agent MLRE, you will work on applying our Agent RL Training + Building algorithms to real-life enterprise datasets across our clients + benchmarks. This will involve creating best-in-class Agents that achieve state-of-the-art results through a combination of post-training + agent-building algorithms.

Responsibilities:

Train state-of-the-art models, developed both internally and from the community, to deploy to our enterprise customers.
Research cutting-edge algorithms to integrate directly into our training stack.
Build agents that leverage our proprietary agent-building algorithms to automatically hill climb datasets – including defining highly performant tools, multi-agent systems, and complex rewards.

Ideally, you'd have:

1-3 years of building with LLMs in a production environment
Experience with post-training methods like RLHF/RLVR and related algorithms like PPO/GRPO etc.
Publications in top conferences such as NEURIPS, ICLR, or ICML within the last two years
PhD or Masters in Computer Science or a related field

Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position, determined by work location and additional factors, including job-related skills, experience, interview performance, and relevant education or training.

Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is:

$218,400 – $273,000 USD

About Us:

At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact.

XML job scraping automation by YubHub

Machine Learning Research Engineer, Agents – Enterprise GenAI at Scale

Job Description

Model Behavior Tutor – Social Cognition & EQ

Model Behavior Tutor – Epistemic Rigor & Truthfulness

Member of Technical Staff – Grok Chat Model

Member of Technical Staff – X Platform Security

IT Systems Engineer

Site Reliability Engineer – Cybersecurity

Job Description

Similar Jobs

Model Behavior Tutor – Social Cognition & EQ

Model Behavior Tutor – Epistemic Rigor & Truthfulness

Member of Technical Staff – Grok Chat Model

Member of Technical Staff – X Platform Security

IT Systems Engineer

Site Reliability Engineer – Cybersecurity

Receive the latest articles in your inbox

Join the Houtini Newsletter

Building the Agentic Stack for Work.