Full-Time

Member of Technical Staff – Voice Model at xAI

Company xAI
Location Palo Alto, CA
Salary $150,000 - $450,000 USD
How You'll Work onsite
Level staff
Sector Technology
Posted Posted 0 days ago

Job Description

Join the Grok Voice Model team to help build the world's best voice AI. You will design and execute large-scale speech data curation and processing pipelines, work on pre-training and post-training of speech-language models, and build a comprehensive evaluation framework. As a member of this team, you will work closely with product teams to integrate voice models into applications and real-time environments.

We're seeking exceptionally smart, execution-oriented engineers to help us get there. You will have the opportunity to work on challenging projects, collaborate with a highly motivated team, and contribute directly to the company's mission.

Responsibilities:

  • Design and execute large-scale speech data curation and processing pipelines, including collection of diverse real-world audio, synthetic data generation, and automated annotation workflows to enable high-quality model training and evaluation.
  • Work on pre-training and post-training of speech-language models, with targeted enhancements through supervised fine-tuning, reinforcement learning, and other techniques to ensure Grok Voice responses are accurate, factually grounded, natural and idiomatic in spoken style, conversational in tone, and fluent across multiple languages.
  • Build and iterate a comprehensive evaluation framework covering objective metrics (accuracy, quality, latency, expressiveness), human preference studies, content factuality assessments, real-time interaction quality, and experimentation infrastructure to measure and improve performance.
  • Work closely with product teams to integrate voice models into applications and real-time environments, define spoken interaction specifications, and handle the full lifecycle from prototype to global-scale deployment for stable, low-latency, delightful voice experiences.

Basic Qualifications:

  • Python expert with deep proficiency in writing clean, efficient code for AI/ML systems.
  • Hands-on experience processing large-scale datasets using tools like Spark and Ray for cleaning, augmentation, and feature extraction.
  • Proficiency in pre-training and post-training speech-language models using JAX/PyTorch, including supervised fine-tuning, reinforcement learning, and optimizations for accuracy, factuality, natural spoken style, detail, and multilingual fluency.
  • Ability to set up and run rigorous evaluation pipelines: objective metrics, human preference studies, content factuality checks, and iterative A/B testing to drive model improvements.
  • Experience building or working with large-scale distributed training and inference systems on Kubernetes.
  • Proactive, self-driven attitude , ready to grind in a fast-paced, high-caliber team to deliver outstanding voice AI experiences.

Compensation and Benefits:

$150,000 – $450,000 USD

Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks.

XML job scraping automation by YubHub

Similar Jobs

Full-Time

Applied AI Engineer

Mistral AI
Paris
More Info
Full-Time

Communications Manager

ElevenLabs
More Info
Full-Time

AI Creative Producer – Ads

ElevenLabs
More Info
Full-Time

B2B Marketing Lead – ElevenCreative

ElevenLabs
London
More Info
Full-Time

Deployment Strategist

ElevenLabs
Spain
More Info
Full-Time

Customer Success Lead – LATAM

ElevenLabs
Mexico
More Info

Receive the latest articles in your inbox

Join the Houtini Newsletter

Practical AI tools, local LLM updates, and MCP workflows straight to your inbox.