Full-Time

Applied AI, Evaluation Engineer at Mistral AI

Company Mistral AI
Location Paris
Salary Competitive salary
Posted Posted 0 days ago

Job Description

Opening. This role exists to design and implement evaluation systems that help our customers understand model performance across their specific use cases.

What you'll do

You will design and implement comprehensive evaluation frameworks to measure LLM capabilities across diverse customer use cases, including text generation, reasoning, code, and domain-specific applications.

  • Design and implement comprehensive evaluation frameworks to measure LLM capabilities across diverse customer use cases, including text generation, reasoning, code, and domain-specific applications
  • Build scalable evaluation infrastructure and pipelines that enable rapid, reproducible assessment of model performance

What you need

  • 3+ years of experience in ML evaluation, benchmarking for LLM or agentic systems
  • Deep understanding of concepts and algorithms underlying machine learning and LLMs
  • Strong technical coding skills in Python

Similar Jobs

Full-Time

Customer Success Associate (Comet Browser)

Perplexity
New York City, Belgrade, London
More Info
Full-Time

Data Scientist, Evals

Perplexity
London
More Info
Full-Time

Tech Lead Manager – Agents

Perplexity
San Francisco
More Info
Full-Time

Forward-Deployed Engineer – API Platform

Perplexity AI
New York City, London, San Francisco, Seattle
More Info
Full-Time

Business Development Representative

Perplexity
San Francisco, New York City
More Info
Full-Time

Engineering Site Lead

Perplexity
London
More Info

Receive the latest articles in your inbox

Join the Houtini Newsletter

Practical AI tools, local LLM updates, and MCP workflows straight to your inbox.