Opening. This role is to build and maintain critical systems that serve Claude to millions of users worldwide. We bring Claude to life by serving our models via the industry's largest compute-agnostic inference deployments.
What you'll do
Our Inference team is responsible for building and maintaining the critical systems that serve Claude to millions of users worldwide. We bring Claude to life by serving our models via the industry's largest compute-agnostic inference deployments.
- Responsibility 1: Designing intelligent routing algorithms that optimize request distribution across thousands of accelerators
- Responsibility 2: Autoscaling our compute fleet to dynamically match supply with demand across production, research, and experimental workloads
What you need
- Required skill 1: High-performance, large-scale distributed systems
- Required skill 2: Implementing and deploying machine learning systems at scale