Full-Time

Principal Research Engineer, Gemini Evals at Google DeepMind

Company Google DeepMind
Sector Technology
Posted Posted 1 days ago

Job Description

Job Title

This role is for a Principal level Research Engineer to lead the strategic development and execution of robust data pipelines, evaluation frameworks, and metric systems for the Gemini family of models and their associated product applications.

About Us

Artificial Intelligence could be one of humanity's most useful inventions. At Google DeepMind, we're a team of scientists, engineers, machine learning experts, and more, working together to advance the state of the art in artificial intelligence.

The Role

As a Principle Research Engineer, you will operate as a technical expert and leader within the Gemini Data and Evaluation team. Your primary focus will be to architect and execute the rigorous evaluation and data systems that underpin all major model release and product launch decisions for Gemini.

Key Responsibilities

Technical Leadership & Strategy

  • Work on post-training evaluation and fine-tuning of large-scale models to improve performance and safety.
  • Define and champion the technical roadmap for large-scale data and evaluation supporting the Gemini model family and its real-world applications.
  • Drive the research of novel, high-signal evaluation methods (automated, human-in-the-loop, and adversarial) to measure model capabilities, alignment, safety, and trustworthiness.
  • Actively contribute to the broader scientific community by presenting findings on cutting-edge AI evaluation and safety methods.

About You

To set you up for success as a researcher at Google DeepMind, we look for the following skills and experience:

  • 10+ years of experience in researching engineering, with at least 5 years in a technical leadership role.
  • Experience with large-scale machine learning systems, data processing pipelines and evaluation methodologies.
  • Experience with large language models (LLMs) and their evaluation.
  • Experience in post-training evaluation research.

XML job scraping automation by YubHub

Similar Jobs

Full-Time

Model Behavior Tutor – Social Cognition & EQ

xAI
Remote
More Info
Full-Time

Model Behavior Tutor – Epistemic Rigor & Truthfulness

xAI
Remote
More Info
Full-Time

Member of Technical Staff – Grok Chat Model

xAI
Palo Alto, CA
More Info
Full-Time

Member of Technical Staff – X Platform Security

xAI
Palo Alto, CA
More Info
Full-Time

IT Systems Engineer

xAI
Palo Alto, CA
More Info
Full-Time

Senior IT Systems Engineer

xAI
Palo Alto, CA
More Info

Receive the latest articles in your inbox

Join the Houtini Newsletter

Practical AI tools, local LLM updates, and MCP workflows straight to your inbox.