Member of Technical Staff, Applied Inference at xAI

Company xAI

Location Palo Alto

Salary Competitive salary

Posted Posted 0 days ago

Job Description

We are seeking a highly skilled Member of Technical Staff, Applied Inference to join our team. As a key member of our team, you will be responsible for designing and implementing scalable distributed infrastructure for model serving, ensuring the reliability of inference services, and creating custom tools to trace, replay, and fix issues or crashes across the entire stack.

What you'll do

Architect and implement scalable distributed infrastructure for model serving, such as load balancing, auto scaling, batch scheduling, and global KVcache systems.
Ensure the reliability of inference services, targeting 100% uptime, a 0% error rate, and good tail performance, through proactive monitoring, fault-tolerant designs, and rigorous testing.

What you need

Experience with large-scale, high-concurrent production serving.
Experience with GPU inference engines.
Experience with testing, benchmarking, and the reliability of inference services.
Experience with designing and implementing CI/CD infrastructure.

Similar Jobs

Full-Time

Site Ops Lead

xAI

Memphis

More Info

Full-Time

Facilities Maintenance Assistant

xAI

Memphis

More Info

Full-Time

Power Generation Engineer

xAI

Memphis

More Info

Full-Time

Facilities Operations Manager

xAI

Southaven, MS

More Info

Full-Time

Receiving and Logistics Clerk

xAI

Memphis

More Info

Full-Time

Electrical Engineer (EIT)

xAI

Memphis

More Info

Job Description

Similar Jobs

Site Ops Lead

Facilities Maintenance Assistant

Power Generation Engineer

Facilities Operations Manager

Receiving and Logistics Clerk

Electrical Engineer (EIT)

Receive the latest articles in your inbox

Join the Houtini Newsletter

Building the Agentic Stack.