Full-Time

AI Infra Engineer at Perplexity

Company Perplexity
Location San Francisco, Palo Alto
Salary Competitive salary
Posted Posted 0 days ago

Job Description

We are looking for an AI Infra engineer to join our growing team. We work with Kubernetes, Slurm, Python, C++, PyTorch, and primarily on AWS. As an AI Infrastructure Engineer, you will be partnering closely with our Inference and Research teams to build, deploy, and optimize our large-scale AI training and inference clusters

What you'll do

  • Design, deploy, and maintain scalable Kubernetes clusters for AI model inference and training workloads
  • Manage and optimize Slurm-based HPC environments for distributed training of large language models

What you need

  • Strong expertise in Kubernetes administration, including custom resource definitions, operators, and cluster management
  • Hands-on experience with Slurm workload management, including job scheduling, resource allocation, and cluster optimization

Similar Jobs

Full-Time

Customer Success Associate (Comet Browser)

Perplexity
New York City, Belgrade, London
More Info
Full-Time

Data Scientist, Evals

Perplexity
London
More Info
Full-Time

Tech Lead Manager – Agents

Perplexity
San Francisco
More Info
Full-Time

Forward-Deployed Engineer – API Platform

Perplexity AI
New York City, London, San Francisco, Seattle
More Info
Full-Time

Business Development Representative

Perplexity
San Francisco, New York City
More Info
Full-Time

Engineering Site Lead

Perplexity
London
More Info

Receive the latest articles in your inbox

Join the Houtini Newsletter

Practical AI tools, local LLM updates, and MCP workflows straight to your inbox.