Full-Time

Member of Technical Staff, Applied Inference at xAI

Company xAI
Location Palo Alto
Salary Competitive salary
Posted Posted 0 days ago

Job Description

We are seeking a highly skilled Member of Technical Staff, Applied Inference to join our team. As a key member of our team, you will be responsible for designing and implementing scalable distributed infrastructure for model serving, ensuring the reliability of inference services, and creating custom tools to trace, replay, and fix issues or crashes across the entire stack.

What you'll do

  • Architect and implement scalable distributed infrastructure for model serving, such as load balancing, auto scaling, batch scheduling, and global KVcache systems.
  • Ensure the reliability of inference services, targeting 100% uptime, a 0% error rate, and good tail performance, through proactive monitoring, fault-tolerant designs, and rigorous testing.

What you need

  • Experience with large-scale, high-concurrent production serving.
  • Experience with GPU inference engines.
  • Experience with testing, benchmarking, and the reliability of inference services.
  • Experience with designing and implementing CI/CD infrastructure.

Similar Jobs

Full-Time

Facilities Maintenance Assistant

xAI
Memphis
More Info
Full-Time

Power Generation Engineer

xAI
Memphis
More Info
Full-Time

Facilities Operations Manager

xAI
Southaven, MS
More Info
Full-Time

Receiving and Logistics Clerk

xAI
Memphis
More Info
Full-Time

Electrical Engineer (EIT)

xAI
Memphis
More Info

Receive the latest articles in your inbox

Join the Houtini Newsletter

Practical AI tools, local LLM updates, and MCP workflows straight to your inbox.