Perplexity is excited to announce the Internship Program for exceptional Master’s or PhD students studying Computer Science or Engineering in the UK, enrolled in the 2025-2026 academic year. This is an intensive program in which you will work directly with our AI Inference team.
What you'll do
- Work with the inference team to improve serving latency and throughput
- Bring up support for new models and state-of-the-art inference optimizations or quantization schemes
- Optimize inference across the entire stack, from GPU kernels to serving endpoints
What you need
- Strong engineering track record with proven knowledge of fundamentals and programming languages (multi-threaded programming, networking, compilation, systems programming, etc)
- Pursuing a Master's or PhD in Computer Science with a focus on performance-related subjects (HPC, Compilers, Distributed Systems)