Full-Time

Machine Learning Research Engineer, Agents – Enterprise GenAI at Scale

Company Scale
Sector Technology
Posted Posted 1 days ago

Job Description

We are seeking a Machine Learning Research Engineer to join our Enterprise ML Research Lab. As an Agent MLRE, you will work on applying our Agent RL Training + Building algorithms to real-life enterprise datasets across our clients + benchmarks. This will involve creating best-in-class Agents that achieve state-of-the-art results through a combination of post-training + agent-building algorithms.

Responsibilities:

  • Train state-of-the-art models, developed both internally and from the community, to deploy to our enterprise customers.
  • Research cutting-edge algorithms to integrate directly into our training stack.
  • Build agents that leverage our proprietary agent-building algorithms to automatically hill climb datasets – including defining highly performant tools, multi-agent systems, and complex rewards.

Ideally, you'd have:

  • 1-3 years of building with LLMs in a production environment
  • Experience with post-training methods like RLHF/RLVR and related algorithms like PPO/GRPO etc.
  • Publications in top conferences such as NEURIPS, ICLR, or ICML within the last two years
  • PhD or Masters in Computer Science or a related field

Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position, determined by work location and additional factors, including job-related skills, experience, interview performance, and relevant education or training.

Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is:

$218,400 – $273,000 USD

About Us:

At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact.

XML job scraping automation by YubHub

Similar Jobs

Full-Time

Model Behavior Tutor – Social Cognition & EQ

xAI
Remote
More Info
Full-Time

Model Behavior Tutor – Epistemic Rigor & Truthfulness

xAI
Remote
More Info
Full-Time

Member of Technical Staff – Grok Chat Model

xAI
Palo Alto, CA
More Info
Full-Time

Member of Technical Staff – X Platform Security

xAI
Palo Alto, CA
More Info
Full-Time

IT Systems Engineer

xAI
Palo Alto, CA
More Info
Full-Time

Site Reliability Engineer – Cybersecurity

xAI
Palo Alto, CA
More Info

Receive the latest articles in your inbox

Join the Houtini Newsletter

Practical AI tools, local LLM updates, and MCP workflows straight to your inbox.