People
Researchers, authors, and figures discussed across the essays.
Evan Hubinger
AnthropicEvan Hubinger leads the Alignment Stress-Testing team at Anthropic and is the most wide-ranging technical figure in this line of…
Chris Olah
AnthropicChris Olah is a co-founder of Anthropic and has led, since 2021, the team dedicated to mechanistic interpretability. The…
Dario Amodei
AnthropicDario Amodei is co-founder and CEO of Anthropic, which he co-founded in 2021 after leaving OpenAI. His earlier trajectory places…
Amanda Askell
AnthropicAmanda Askell is a philosopher at Anthropic and the declared lead author of the Claude Constitution, in its January 2026 version.…
Jonathan Birch
London School of EconomicsJonathan Birch is a philosopher at the London School of Economics and the author of The Edge of Sentience (Oxford University…
Paul Christiano
US Center for AI Standards and Innovation (NIST)Paul Christiano leads AI safety at the Center for AI Standards and Innovation, a unit of the United States National Institute of…
Jan Leike
AnthropicJan Leike co-led, until mid-2024, OpenAI's Superalignment team with Ilya Sutskever, in a programme that carried a public…
Kyle Fish
AnthropicKyle Fish is the first dedicated model welfare researcher hired by Anthropic, in 2024. On 24 April 2025, in an interview…