Full-Time

Member of Technical Staff, Inference at xAI

Company xAI
Location Palo Alto
Salary Competitive salary
Posted Posted 1 days ago

Job Description

As a Member of Technical Staff, Inference, you will be responsible for optimizing the latency and throughput of model inference, building reliable and performant production serving systems to serve billions of users, and accelerating research on scaling test-time compute and rollout in reinforcement learning training.

What you'll do

  • Optimizing the latency and throughput of model inference.
  • Building reliable and performant production serving systems to serve billions of users.

What you need

  • Experience with system optimizations for model serving, such as batching, caching, load balancing, and parallelism.

Similar Jobs

Full-Time

Application Security Engineer

ElevenLabs
United Kingdom
More Info
Full-Time

Head of Engineering

Fifth Dimension
London
More Info
Full-Time

Enterprise Account Executive

Fifth Dimension
New York
More Info
Full-Time

Product Engineer (Staff/Principal)

Fifth Dimension
London
More Info
Full-Time

AI Solutions Architect

Fifth Dimension AI
London
More Info
Full-Time

Founding Technical Pre-Sales Engineer

Fifth Dimension
Singapore
More Info

Receive the latest articles in your inbox

Join the Houtini Newsletter

Practical AI tools, local LLM updates, and MCP workflows straight to your inbox.