Full-Time

Member of Technical Staff – Post-Training and RL at xAI

Company xAI
Location Palo Alto, CA
Salary $180,000 - $600,000 USD
How You'll Work onsite
Level staff
Sector Technology
Posted Posted 0 days ago

Job Description

About the Role

You will work on the most critical post-training and reinforcement learning challenges at any given time , including reward modeling, preference optimisation (RLHF/DPO), and RL for improving reasoning, truthfulness, and real-world capabilities.

You will get clarity on your first project before an offer.

Responsibilities

  • Work on post-training and reinforcement learning challenges
  • Develop and implement reward models and preference optimisation techniques
  • Improve reasoning, truthfulness, and real-world capabilities using RL

Qualifications

  • Believe truth-seeking AI is the most important and challenging problem
  • Obsessed about building incredibly useful models through post-training and RL techniques
  • Power user of AI models and eager to push the boundaries of what's possible with reinforcement learning and alignment methods

Compensation and Benefits

$180,000 – $600,000 USD

Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks.

XML job scraping automation by YubHub

Similar Jobs

Full Time|part Time|contract

AI Tutor – Swahili

xAI
Remote
More Info
Full Time|part Time|contract

AI Tutor – Russian

xAI
Remote
More Info
Full Time|part Time|contract

AI Tutor – Hungarian

xAI
Remote
More Info
Full-Time

Supervisor, Logistics – Data Center Operations

xAI
Memphis, TN
More Info
Full Time|part Time|contract

AI Tutor – Greek

xAI
Remote
More Info
Full-Time

Construction Manager (Structural)

xAI
Memphis, TN
More Info

Receive the latest articles in your inbox

Join the Houtini Newsletter

Practical AI tools, local LLM updates, and MCP workflows straight to your inbox.