Research Scientist, Science of Post-Training and Reinforcement Learning at Google DeepMind

Company Google DeepMind

Location London

Salary Competitive salary

Posted Posted 0 days ago

Job Description

Opening. This role is a hands-on opportunity to contribute to the development of a real science of post-training for agents. The core loop is: form a hypothesis, implement it, run strong experiments, analyze what happened, and decide what to do next.

What you'll do

You will work closely with Ian Osband and the team on research around post-training for agents and LLMs, including practical RL methods and evaluation. This is not a theory-only role; you should expect to implement code, run experiments, and own results end-to-end.

Propose and test research hypotheses in post-training and RL for agents/LLMs.
Implement algorithm ideas and run end-to-end experiments, including setup, execution, analysis, and iteration.

What you need

A research track record in ML/RL, demonstrated through publications or high-quality projects.
Strong implementation ability and comfort working in research codebases.
Evidence of owning experiments end-to-end, including analysis and interpretation.

Similar Jobs

Full-Time

Analytics Engineer

Constructor

More Info

Full-Time

Customer Success Advocate

Constructor.io

More Info

Full-Time

Lead Counsel, Network Infrastructure

Meta

Menlo Park, CA

More Info

Full-Time

Quality Measurement Specialist – Bengali

Meta

Dublin, Ireland

More Info

Full-Time

Associate General Counsel, Product (Business AI)

Meta

Los Angeles

More Info

Full-Time

CapEx Sourcing Manager

Meta

Sunnyvale, CA

More Info

Job Description

Similar Jobs

Analytics Engineer

Customer Success Advocate

Lead Counsel, Network Infrastructure

Quality Measurement Specialist – Bengali

Associate General Counsel, Product (Business AI)

CapEx Sourcing Manager

Receive the latest articles in your inbox

Join the Houtini Newsletter

Building the Agentic Stack.