Full-Time

Staff Machine Learning Research Engineer, Agent Post-training – Enterprise GenAI at Scale

Company Scale
Sector Technology
Posted Posted 1 days ago

Job Description

Job Title: Staff Machine Learning Research Engineer, Agent Post-training – Enterprise GenAI

About the Role: We are seeking a Staff Machine Learning Research Engineer to join our Enterprise ML Research Lab. As a key member of our team, you will build out our next-gen Agent RL training platform, integrating cutting-edge research into our training stack.

Responsibilities:

  • Train state-of-the-art models, developed both internally and from the community, to deploy to our enterprise customers.
  • Research cutting-edge algorithms to integrate directly into our training stack.
  • Design solutions that enable complex multi-agent systems to directly learn from both process + outcome-based rewards.

Ideal Candidate:

  • 5+ years of LLM training in a production environment.
  • Experience with post-training methods like RLHF/RLVR and related algorithms like PPO/GRPO etc.
  • Publications in top conferences such as NEURIPS, ICLR, or ICML within the last two years.
  • PhD or Masters in Computer Science or a related field.

Compensation: Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position, determined by work location and additional factors, including job-related skills, experience, interview performance, and relevant education or training.

Benefits: Comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend.

XML job scraping automation by YubHub

Similar Jobs

Full-Time

Model Behavior Tutor – Social Cognition & EQ

xAI
Remote
More Info
Full-Time

Model Behavior Tutor – Epistemic Rigor & Truthfulness

xAI
Remote
More Info
Full-Time

Member of Technical Staff – Grok Chat Model

xAI
Palo Alto, CA
More Info
Full-Time

Member of Technical Staff – X Platform Security

xAI
Palo Alto, CA
More Info
Full-Time

IT Systems Engineer

xAI
Palo Alto, CA
More Info
Full-Time

Site Reliability Engineer – Cybersecurity

xAI
Palo Alto, CA
More Info

Receive the latest articles in your inbox

Join the Houtini Newsletter

Practical AI tools, local LLM updates, and MCP workflows straight to your inbox.