AI Watch

Welcome! This is a website to track people and organizations in the AI safety/alignment/AI existential risk communities. A position or organization being on AI Watch does not indicate an assessment that that position or organization is actually making AI safer or that the position or organization is good for the world in any way. It is mostly a sociological indication that the position or organization is associated with these communities, as well as an indication that the position or organization claims to be working on AI safety or alignment. (There are some plans to eventually introduce such assessments on AI Watch, but for now there are none.) See the code repository for the source code and data of this website.

This website is developed by Issa Rice with data contributions from Sebastian Sanchez, Amana Rice, and Vipul Naik, and has been partially funded by Vipul Naik and Mati Roy (who in July 2023 paid for the time Issa had spent answering people’s questions about AI Watch up until that point).

Last updated on 2025-12-31; see here for a full list of recent changes.

Table of contents

Agendas

Agenda name Associated people Associated organizations
Iterated amplification Paul Christiano, Buck Shlegeris, Dario Amodei OpenAI
Embedded agency Eliezer Yudkowsky, Scott Garrabrant, Abram Demski Machine Intelligence Research Institute
Comprehensive AI services Eric Drexler Future of Humanity Institute
Ambitious value learning Stuart Armstrong Future of Humanity Institute
Factored cognition Andreas Stuhlmüller Ought
Recursive reward modeling Jan Leike, David Krueger, Tom Everitt, Miljan Martic, Vishal Maini, Shane Legg Google DeepMind
Debate Paul Christiano OpenAI
Interpretability Christopher Olah
Inverse reinforcement learning
Preference learning
Cooperative inverse reinforcement learning
Imitation learning
Alignment for advanced machine learning systems Jessica Taylor, Eliezer Yudkowsky, Patrick LaVictoire, Andrew Critch Machine Intelligence Research Institute
Learning-theoretic AI alignment Vanessa Kosoy
Counterfactual reasoning Jacob Steinhardt

Positions grouped by person

Showing 5 people with positions.

Name Number of organizations List of organizations
Paul Christiano 4 AI Impacts, Machine Intelligence Research Institute, OpenAI, Ought
Jaan Tallinn 3 Berkeley Existential Risk Initiative, Centre for the Study of Existential Risk, Machine Intelligence Research Institute
Oliver Habryka 3 Center for Applied Rationality, Lightcone Infrastructure, Machine Intelligence Research Institute
Ryan Carey 3 Centre for the Study of Existential Risk, Machine Intelligence Research Institute, Ought
Stuart Russell 3 Berkeley Existential Risk Initiative, Centre for the Study of Existential Risk, Machine Intelligence Research Institute

Positions grouped by organization

Showing 36 organizations.

Organization Number of people List of people
Machine Intelligence Research Institute 151 Martin Lucas, David Abecassis, Aaron Scher, Brittany Ferrero, Joe Rogero, Mitchell Howe, Lisa Thiergart, Harlan Stewart, Jimmy Rintjema, Alex Vermeer, Gretta Duleba, Peter Barnett, Edward Kmett, James Payor, Victoria Krakovna, Blake Borgeson, Christine Peterson, Carson Jones, David Simmons, Daniel Demski, Ben Weinstein-Raun, Alex Zhu, Alex Mennen, Alex Appel, Linda Linsefors, Evan Hubinger, Buck Shlegeris, Nick Tarleton, Kurt Brown, Jesse Liptrap, Benjamin Mann, Sam Eisenstat, Jeremy Schlatter, Andrew Critch, Jan Leike, Matthew Graves, Ryan Carey, Connor Flexman, Colm Ó Riain, Aaron Silverbook, Gary Drescher, Andrew Lapinski-Barker, Robin Hanson, Kaya Stechly, Anna Salamon, Bart Selman, Stuart Russell, Ramana Kumar, Jack Gallagher, Jaan Tallinn, Vanessa Kosoy, Abram Demski, Stuart Armstrong, Nate Thomas, Scott Garrabrant, Rob Bensinger, Jessica Taylor, Jed McCaleb, Jake Moskowitz, Jesse Galef, Tsvi Benson-Tilsen, Matthew Fallshaw, Eliezer Yudkowsky, Benya Fallenstein, Peter Thiel, Liron Shapira, Nicolas Gagné, Elizabeth Morningstar, Lila Rieber, Vipul Naik, Nate Soares, Daniel Lewis, Richard Neal, Robert Mushkatblat, Dávid Natingga, Steve Omohundro, Roman Yampolskiy, Nick Bostrom, Nathan Clark, Moshe Looks, Kaj Sotala, Sebastian Nickel, Evan Erickson, Oliver Habryka, Bill Hibbard, Alex Altair, Paul Christiano, Patrick LaVictoire, Nisan Stiennon, Mihaly Barasz, Jeremy Miller, Stephen Barnes, Louie Helm, Daniel Roth, Patrick Robotham, Pedro Chaves, Topher Brennan, Carl Shulman, Cameron Taylor, Nickolai Leschov, Jonathan Wang, Tomer Kagan, Malo Bourgon, Jake Miller, Gwern Branwen, Erica Edelman, Luke Muehlhauser, Lincoln Quirk, Diego Caleiro, Will Newsome, Nevin Freeman, Minda Myers, Keefe Roedersheimer, Peter Scheyer, Peter de Blanc, Jasen Murray, Daniel Dewey, Abraham Wolk, Thomas Colthurst, Stanislas Sochacki, Luke Grecki, Dennis Fan, Janos Kramar, Ben Hoskin, Tim Czech, Jason Levin, Frank Adamek, Amy Willey, Kevin Fischer, Ray Kurzweil, Ben Goertzel, Edwin Evans, Steve Rayhawk, Michael Anissimov, Harrison Willey, Michael Blume, Kemal Eren, Andrew Rettek, Andriy Brodskyy, Andrew Hay, Vincent Fagot, Thomas McCabe, Steven Kaas, Roko Mijic, Katja Grace, Justin Shovelain, Henrik Jonsson, Bryan Bishop, Alyssa Vance, Marcello Herreshoff, Jeff Alexander
Centre for the Study of Existential Risk 100 Pablo Suarez, Reuben Makomere, Aarathi Krishnan, Thomas Moynihan, Elizabeth Cooper, Alexandra Klein, Laura Elmer, Shoshana Dahdi, Clare Arnstein, Kennedy Mbeva, Julian Huppert, Taniel Yusef, Cecil Abungu, Madhulika Srikumar, Zoe Hemsley, Matthew Connelly, Constantin Arnscheidt, Coleman Snell, Clarissa Rios Rojas, Dennis Müller, Sarah Dryhurst, Maurice Chiodo, Sam Clarke, Fazl Barez, Ross Gruetzemacher, Abdullahi Alim, Nathaniel Cooke, Paul Ingram, José Hernández-Orallo, Freya Jephcott, Jessica Bland, Catherine Rhodes, Matthijs M. Maas, Charlotte Christiane Hammer, Tom Hobson, Shin-Shin Hua, Lara Mani, Shahar Avin, Rumtin Sepasspour, S. J. Beard, Carla Zoe Cremer, Adrian Weller, Chris Lowe, Adrian Kent, Sean Holden, Stephen Hawking, Hermann Hauser, Tim Crane, David Cleevely, Jonathan Wiener, Max Tegmark, Peter Singer, Murray Shanahan, Dana Scott, Stuart Russell, Peter Piot, Tim Palmer, Elon Musk, Robert May, David Chalmers, Nick Bostrom, Margaret Boden, Martina Kunz, Beth Barnes, Yang Liu, Simon Goldhill, Jane Heal, Partha Dasgupta, Lalitha Sundaram, Haydn Belfield, Seán Ó hÉigeartaigh, Huw Price, Jaan Tallinn, Martin Rees, Alison Gopnik, Ryan Carey, William Sutherland, Susan Owens, Mami Mizutori, Piers Millett, Thomas Homer-Dixon, Robert Doubleday, Beatrice Crona, Belinda Cleeland, Des Browne, Yuval Noah Harari, Olaf Corry, Seth Baum, Caroline Baylon, Sophie Dannreuther, Charlotte Hammer, Jaime Sevilla, David Krueger, Elizabeth Seger, Di Cooke, Mike Cassidy, James Ginns, Andrew Tanentzap, Allan Dafoe, Rachel Burgess
FAR.AI 51 Jessica Lim, Edward Yee, Lilian Hughes, Chris Cundy, Taylor Boyle, Lindsay Murachver, Jeremy Rich, Saad Siddiqui, Oskar Hollinsworth, Anastasiia Gaidashenko, Philip Quirke, Brendan Murphy, Isabella Duan, Ian McKenzie, Dillon Bowen, Michał Zając, Siao Si Looi, Chris MacLeod, Aaron Tucker, Claudia Shi, Moritz von Knebel, Tony Wang, Fynn Heide, Kellin Pelrine, Ben Goldhaber, Lev McKinney, Adrià Garriga-Alonso, Pablo Moreno, Tomasz Korbak, Pedro Freire, Nora Belrose, Jérémy Scheurer, Juan Rocamonde, Alex Tamkin, Niki Howe, Ethan Perez, ChengCheng Tan, Adam Gleave, Nino Scherrer, Sawyer Bernath, Karl Berzins, Hannah Betts, Edmund Mills, Josh Jacobson, Alyse Spiehler, Tom Tseng, Mohammad Taufeeque, Joseph Miller, Euan McLean, Lawrence Chan, Scott Emmons
Berkeley Existential Risk Initiative 41 Michael Jemison, Sarah Otis, Cierra Johnson, Elisabeth Siegel, Karuna Nandkumar, Krystal Jackson, Deepika Raman, Andreas Pashos, Sawyer Bernath, Lara Lincoln, Jess Reidel, Elizabeth Cooper, Gary Menezes, Scott Singer, Kayla Blomquist, Joseph Castellano, Nada Madkour, Ian Baker, James Paul Gonzales, Jess Riedel, Stuart Russell, Sofia Davis-Fogel, Alex Flint, Josh Jacobson, Sam Bankman-Fried, Matt Fallshaw, Colleen Gleason, Jeremy Schlatter, Jaan Tallinn, Kyle Scott, Rebecca Raible, Kenzi Amodei, Jacob Tsimerman, Seán Ó hÉigeartaigh, Malo Bourgon, Qiaochu Yuan, Eric Rogstad, Andrew Snyder-Beattie, Michael Keenan, Gina Stuessy, Andrew Critch
GoodAI 30 Joseph Davidson, Šárka Krejčová, Shantesh Patil, Martin Poliak, Karolína H., Ryan Camilleri, Jose Solorzano, Alex Angelini, Dominik Čech, Reham Bukhari, Stephanie Wendler, Lucia Šicková, Šimon Šicko, Nicholas Guttenberg, Viktorie Knezkova, Steffen Eichler, Isabeau Premont-Schwarz, Filip Hauptfleisch, Jan Štafa, Michal Dvořák, Christine Lee, Will Millership, Petr Hlubuček, Marek Havrda, Jan Feyereisl, Olga Afanasjeva, Marek Rosa, Simon Andersson, Přemek Paška, Jaroslav Vitku
Ought 25 Lukas Finnveden, Owain Evans, Owen Cotton-Barratt, Luke Stebbing, Ian McKenzie, Justin Reppert, Eli Lifland, Amanda Ngo, Aparna Ashok, Jungwon Byun, Paul Christiano, Ozzie Gooen, Neal Jean, Milan Griffes, Girish Sastry, Chris Cundy, Ben Weinstein-Raun, Ben Goldhaber, Andrew Schreiber, Ben West, Ben Rachbach, Zachary Miller, Tom McGrath, Noah Goodman, Ryan Carey
AI Impacts 15 Jeffrey Heninger, Jimmy Rintjema, Richard Korzekwa, Katja Grace, Daniel Kokotajlo, Asya Bergal, Ronja Lutz, Tegan McCaslin, Paul Christiano, Ben Hoffman, Justis Mills, Connor Flexman, Finan Adamson, John Salvatier, Stephanie Zolayvar
Lightcone Infrastructure 11 Robert Mushkatblat, Rafe Kennedy, Oliver Habryka, Raymond Arnold, Ben Albert Pace, Ruben Bloom, Jacob Lagerros, Matthew Graves, Harmanas Chopra, Eric Rogstad, James Babcock
OpenAI 9 Mor Katz, Christopher Olah, Jeffrey Wu, Ethan Knight, Daniel Ziegler, Joshua Achiam, Geoffrey Irving, Paul Christiano, Dario Amodei
Road to AI Safety Excellence 8 Remmelt Ellen, Trent Fowler, Erik Istre, Rupert McCallum, Robert Miles, Johannes Heidecke, Veerle de Goederen, Toon Alfrink
AI Safety Camp 7 Remmelt Ellen, Jessica Cooper, Kristina Nemcova, Jirí Nadvorník, Anne Wissemann, Jan Kulveit, Johannes Heidecke
Center for Applied Rationality 5 Xavier Prospero, Logan Brienne Strohl, Luke Raskopf, Adom Hartell, Oliver Habryka
Foundational Research Institute 5 Brian Tomasik, Kaj Sotala, Caspar Oesterheld, Lukas Gloor, Tobias Baumann
Future of Humanity Institute 5 Sam Clarke, Tom McGrath, Sören Mindermann, Tamay Besiroglu, Anders Sandberg
EthicsNet 4 Aleksandra Orchowska, Remco Bloemen, Anish Mohammed, Nell Watson
Center for Human-Compatible AI 3 Christopher Cundy, Beth Barnes, Dmitrii Krasheninnikov
Google DeepMind 3 Vishal Maini, Pedro A. Ortega, Chris Maddison
AIDEUS 2 Sergey Rodionov, Alexey Potapov
Learning Intelligent Distribution Agent 2 Tamas Madl, Stan Franklin
University of Oxford 2 Ruth Fong, Chris Maddison
1 Angela P.
Australian National University 1 Jarryd Martin
Carnegie Mellon University 1 Noam Brown
Centre for Effective Altruism 1 Johannes Treutlein
ETH Zurich 1 Felix Berkenkamp
Institute of Ethics and Emerging Technologies 1 Steven Umbrello
Massachusetts Institute of Technology 1 Jon Gauthier
Oregon State University 1 Thomas Dietterich
Phenomenological AI Safety Research Institute 1 G Gordon Worley III
Sorbonne University 1 Michaël Trazzi
Stanford University 1 Aditi Raghunathan
The Australian National University 1 Michael Cohen
The Consortium on the Landscape of AI Safety 1 Alexis Carlier
University of Amsterdam 1 Dmitrii Krasheninnikov
University of California, Berkeley 1 Michael Janner
University of Toronto 1 Roger Grosse