Research Engineer / Scientist, Alignment Science – London at Anthropic

Company Anthropic

Location London, UK

Salary £260,000-£370,000 GBP

How You'll Work hybrid

Level senior

Sector Technology

Posted Posted 0 days ago

Job Description

About the role:

You will contribute to exploratory experimental research on AI safety, with a focus on risks from powerful future systems. As a Research Engineer on Alignment Science, you'll work on creating methods to ensure advanced AI systems remain safe and harmless in unfamiliar or adversarial scenarios.

Responsibilities:

Conduct research on AI control and alignment stress-testing
Develop and implement new techniques for ensuring AI safety
Collaborate with other teams, including Interpretability, Fine-Tuning, and the Frontier Red Team
Test and evaluate the effectiveness of AI safety techniques

Requirements: