Full-Time

Research Engineer, Model Evaluations at Anthropic

Company Anthropic
Location San Francisco, CA | New York City, NY
Salary Competitive salary
Posted Posted 0 days ago

Job Description

As a Research Engineer on the Model Evaluations team, you'll lead the design and implementation of Anthropic's evaluation platform—a critical system that shapes how we understand, measure, and improve our models' capabilities and safety.

What you'll do

  • Design novel evaluation methodologies to assess model capabilities across diverse domains including reasoning, safety, helpfulness, and harmlessness
  • Lead the design and architecture of Anthropic's evaluation platform, ensuring it scales with our rapidly evolving model capabilities and research needs

What you need

  • Experience designing and implementing evaluation systems for machine learning models, particularly large language models
  • Strong programming skills in Python and experience with distributed computing frameworks

Similar Jobs

Full-Time

Receiving and Logistics Clerk

xAI
Memphis
More Info
Full-Time

Electrical Engineer (EIT)

xAI
Memphis
More Info
Full-Time

Datacenter Operations Technician

xAI
Memphis
More Info

Receive the latest articles in your inbox

Join the Houtini Newsletter

Practical AI tools, local LLM updates, and MCP workflows straight to your inbox.