Applied AI, Evaluation Engineer at Mistral AI

Company Mistral AI

Location Paris

Salary Competitive salary

Posted Posted 0 days ago

Job Description

Opening. This role exists to design and implement evaluation systems that help our customers understand model performance across their specific use cases.

What you'll do

You will design and implement comprehensive evaluation frameworks to measure LLM capabilities across diverse customer use cases, including text generation, reasoning, code, and domain-specific applications.

Design and implement comprehensive evaluation frameworks to measure LLM capabilities across diverse customer use cases, including text generation, reasoning, code, and domain-specific applications
Build scalable evaluation infrastructure and pipelines that enable rapid, reproducible assessment of model performance

What you need

3+ years of experience in ML evaluation, benchmarking for LLM or agentic systems
Deep understanding of concepts and algorithms underlying machine learning and LLMs
Strong technical coding skills in Python

Similar Jobs

Full-Time

Customer Success Associate (Comet Browser)

Perplexity

New York City, Belgrade, London

More Info

Full-Time

Data Scientist, Evals

Perplexity

London

More Info

Full-Time

Tech Lead Manager – Agents

Perplexity

San Francisco

More Info

Full-Time

Forward-Deployed Engineer – API Platform

Perplexity AI

New York City, London, San Francisco, Seattle

More Info

Full-Time

Business Development Representative

Perplexity

San Francisco, New York City

More Info

Full-Time

Engineering Site Lead

Perplexity

London

More Info

Job Description

Similar Jobs

Customer Success Associate (Comet Browser)

Data Scientist, Evals

Tech Lead Manager – Agents

Forward-Deployed Engineer – API Platform

Business Development Representative

Engineering Site Lead

Receive the latest articles in your inbox

Join the Houtini Newsletter

Building the Agentic Stack.