Job Title

This role is for a Principal level Research Engineer to lead the strategic development and execution of robust data pipelines, evaluation frameworks, and metric systems for the Gemini family of models and their associated product applications.

About Us

Artificial Intelligence could be one of humanity's most useful inventions. At Google DeepMind, we're a team of scientists, engineers, machine learning experts, and more, working together to advance the state of the art in artificial intelligence.

The Role

As a Principle Research Engineer, you will operate as a technical expert and leader within the Gemini Data and Evaluation team. Your primary focus will be to architect and execute the rigorous evaluation and data systems that underpin all major model release and product launch decisions for Gemini.

Key Responsibilities

Technical Leadership & Strategy

Work on post-training evaluation and fine-tuning of large-scale models to improve performance and safety.
Define and champion the technical roadmap for large-scale data and evaluation supporting the Gemini model family and its real-world applications.
Drive the research of novel, high-signal evaluation methods (automated, human-in-the-loop, and adversarial) to measure model capabilities, alignment, safety, and trustworthiness.
Actively contribute to the broader scientific community by presenting findings on cutting-edge AI evaluation and safety methods.

About You

To set you up for success as a researcher at Google DeepMind, we look for the following skills and experience:

10+ years of experience in researching engineering, with at least 5 years in a technical leadership role.
Experience with large-scale machine learning systems, data processing pipelines and evaluation methodologies.
Experience with large language models (LLMs) and their evaluation.
Experience in post-training evaluation research.

XML job scraping automation by YubHub

Principal Research Engineer, Gemini Evals at Google DeepMind

Job Description

Model Behavior Tutor – Social Cognition & EQ

Model Behavior Tutor – Epistemic Rigor & Truthfulness

Member of Technical Staff – Grok Chat Model

Member of Technical Staff – X Platform Security

IT Systems Engineer

Senior IT Systems Engineer

Job Description

Similar Jobs

Model Behavior Tutor – Social Cognition & EQ

Model Behavior Tutor – Epistemic Rigor & Truthfulness

Member of Technical Staff – Grok Chat Model

Member of Technical Staff – X Platform Security

IT Systems Engineer

Senior IT Systems Engineer

Receive the latest articles in your inbox

Join the Houtini Newsletter

Building the Agentic Stack for Work.