What the team is looking for.

At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity.

As a Model Behavior Architect, you are at the forefront of defining and measuring LLM behaviour. We are looking for people who have built a career in engineering, machine learning, and large language models and are experts in model evaluation, policy writing, and creating eval pipelines for complicated tasks.

What you will do

Interact with models to identify where model behavior can be improved
Gather internal and external feedback on model behavior to scope areas for improvement
Design and implement evals, data guidelines, data generation, and synthetic testing environments
Identify and fix edge case behaviors through rigorous testing
Develop robust evaluation pipelines for our model candidates
Work collaboratively with AI Scientists

About you

You have a deep understanding of either 1) linguistics, language, and translation, 2) engineering and code behavior, 3) LLM agents at work, including reasoning and tool use
You have prior knowledge in training and optimising model behaviour
You are an expert at building robust evaluations
You thrive in dynamic and technically complex environments
You have a track record of delivering innovative, out-of-the-box solutions to address real-world constraints

Skills mentioned

large language models
model evaluation
policy writing
eval pipelines
linguistics
language
translation
engineering
code behavior
LLM agents
reasoning
tool use

Model Behavior Architect

What the team is looking for.

What you will do

About you

Other roles you might consider.

Enterprise Product Manager

Solutions Architect - Japan

Technical Program Manager, API Platform

Member of Technical Staff (Software Engineer, Cloud Infrastructure)

Engineering Manager, Enterprise

Technical Program Manager, Launches

New to AI work? Start with these.

Claude Desktop, from zero.

The best MCPs for Claude Desktop.

Claude Code, the complete beginners' guide.

How to set up LM Studio.

Beginner's guide to AI hardware.

MCP catalogue.