AI Watch

Welcome! This is a website to track people and organizations in the AI safety/alignment/AI existential risk communities. A position or organization being on AI Watch does not indicate an assessment that that position or organization is actually making AI safer or that the position or organization is good for the world in any way. It is mostly a sociological indication that the position or organization is associated with these communities, as well as an indication that the position or organization claims to be working on AI safety or alignment. (There are some plans to eventually introduce such assessments on AI Watch, but for now there are none.) See the code repository for the source code and data of this website.

This website is developed by Issa Rice with data contributions from Sebastian Sanchez, Amana Rice, and Vipul Naik, and has been partially funded by Vipul Naik and Mati Roy (who in July 2023 paid for the time Issa had spent answering people’s questions about AI Watch up until that point).

Last updated on 2026-03-29; see here for a full list of recent changes.

Table of contents

Agendas

Agenda name Associated people Associated organizations
Iterated amplification Paul Christiano, Buck Shlegeris, Dario Amodei OpenAI
Embedded agency Eliezer Yudkowsky, Scott Garrabrant, Abram Demski Machine Intelligence Research Institute
Comprehensive AI services Eric Drexler Future of Humanity Institute
Ambitious value learning Stuart Armstrong Future of Humanity Institute
Factored cognition Andreas Stuhlmüller Ought
Recursive reward modeling Jan Leike, David Krueger, Tom Everitt, Miljan Martic, Vishal Maini, Shane Legg Google DeepMind
Debate Paul Christiano OpenAI
Interpretability Christopher Olah
Inverse reinforcement learning
Preference learning
Cooperative inverse reinforcement learning
Imitation learning
Alignment for advanced machine learning systems Jessica Taylor, Eliezer Yudkowsky, Patrick LaVictoire, Andrew Critch Machine Intelligence Research Institute
Learning-theoretic AI alignment Vanessa Kosoy
Counterfactual reasoning Jacob Steinhardt

Positions grouped by person

Showing 5 people with positions.

Name Number of organizations List of organizations
Paul Christiano 4 AI Impacts, Machine Intelligence Research Institute, OpenAI, Ought
Jaan Tallinn 3 Berkeley Existential Risk Initiative, Centre for the Study of Existential Risk, Machine Intelligence Research Institute
Oliver Habryka 3 Center for Applied Rationality, Lightcone Infrastructure, Machine Intelligence Research Institute
Ryan Carey 3 Centre for the Study of Existential Risk, Machine Intelligence Research Institute, Ought
Stuart Russell 3 Berkeley Existential Risk Initiative, Centre for the Study of Existential Risk, Machine Intelligence Research Institute

Positions grouped by organization

Showing 37 organizations.

Organization Number of people List of people
Machine Intelligence Research Institute 152 Alana Horowitz Friedman, Martin Lucas, David Abecassis, Aaron Scher, Brittany Ferrero, Joe Rogero, Mitchell Howe, Harlan Stewart, Jimmy Rintjema, Lisa Thiergart, Gretta Duleba, Peter Barnett, James Payor, Edward Kmett, Victoria Krakovna, Blake Borgeson, Christine Peterson, Carson Jones, Linda Linsefors, Evan Hubinger, David Simmons, Daniel Demski, Ben Weinstein-Raun, Alex Zhu, Alex Mennen, Alex Appel, Buck Shlegeris, Nick Tarleton, Kurt Brown, Jesse Liptrap, Sam Eisenstat, Benjamin Mann, Jeremy Schlatter, Jan Leike, Andrew Critch, Matthew Graves, Ryan Carey, Connor Flexman, Colm Ó Riain, Aaron Silverbook, Gary Drescher, Robin Hanson, Kaya Stechly, Andrew Lapinski-Barker, Anna Salamon, Stuart Russell, Ramana Kumar, Jack Gallagher, Jaan Tallinn, Bart Selman, Vanessa Kosoy, Stuart Armstrong, Nate Thomas, Abram Demski, Scott Garrabrant, Rob Bensinger, Jessica Taylor, Jed McCaleb, Jake Moskowitz, Jesse Galef, Tsvi Benson-Tilsen, Matthew Fallshaw, Eliezer Yudkowsky, Peter Thiel, Liron Shapira, Nicolas Gagné, Elizabeth Morningstar, Lila Rieber, Vipul Naik, Nate Soares, Daniel Lewis, Richard Neal, Robert Mushkatblat, Dávid Natingga, Steve Omohundro, Roman Yampolskiy, Nick Bostrom, Nathan Clark, Moshe Looks, Kaj Sotala, Sebastian Nickel, Evan Erickson, Oliver Habryka, Paul Christiano, Patrick LaVictoire, Nisan Stiennon, Mihaly Barasz, Jeremy Miller, Bill Hibbard, Benya Fallenstein, Alex Altair, Stephen Barnes, Louie Helm, Patrick Robotham, Daniel Roth, Pedro Chaves, Topher Brennan, Carl Shulman, Nickolai Leschov, Jonathan Wang, Cameron Taylor, Tomer Kagan, Malo Bourgon, Jake Miller, Gwern Branwen, Erica Edelman, Alex Vermeer, Luke Muehlhauser, Lincoln Quirk, Will Newsome, Nevin Freeman, Minda Myers, Keefe Roedersheimer, Diego Caleiro, Peter Scheyer, Peter de Blanc, Jasen Murray, Thomas Colthurst, Stanislas Sochacki, Luke Grecki, Daniel Dewey, Abraham Wolk, Janos Kramar, Dennis Fan, Ben Hoskin, Tim Czech, Jason Levin, Frank Adamek, Amy Willey, Kevin Fischer, Ray Kurzweil, Ben Goertzel, Steve Rayhawk, Michael Anissimov, Harrison Willey, Edwin Evans, Michael Blume, Kemal Eren, Andrew Rettek, Vincent Fagot, Thomas McCabe, Steven Kaas, Roko Mijic, Katja Grace, Justin Shovelain, Henrik Jonsson, Andriy Brodskyy, Andrew Hay, Bryan Bishop, Alyssa Vance, Marcello Herreshoff, Jeff Alexander
Centre for the Study of Existential Risk 100 Pablo Suarez, Reuben Makomere, Aarathi Krishnan, Thomas Moynihan, Elizabeth Cooper, Paul Ingram, Alexandra Klein, Laura Elmer, Shoshana Dahdi, Kennedy Mbeva, Julian Huppert, Taniel Yusef, Cecil Abungu, Madhulika Srikumar, Zoe Hemsley, Matthew Connelly, Constantin Arnscheidt, Coleman Snell, Clarissa Rios Rojas, Dennis Müller, Sarah Dryhurst, Maurice Chiodo, Sam Clarke, Fazl Barez, Ross Gruetzemacher, Abdullahi Alim, Nathaniel Cooke, José Hernández-Orallo, Jessica Bland, Catherine Rhodes, Matthijs M. Maas, Charlotte Christiane Hammer, Freya Jephcott, Tom Hobson, Lara Mani, Shin-Shin Hua, Shahar Avin, Rumtin Sepasspour, Seán Ó hÉigeartaigh, S. J. Beard, Clare Arnstein, Carla Zoe Cremer, Adrian Weller, Chris Lowe, Adrian Kent, Sean Holden, Stephen Hawking, Hermann Hauser, Tim Crane, David Cleevely, Jonathan Wiener, Max Tegmark, Peter Singer, Murray Shanahan, Dana Scott, Stuart Russell, Peter Piot, Tim Palmer, Elon Musk, Robert May, David Chalmers, Nick Bostrom, Margaret Boden, Martina Kunz, Beth Barnes, Yang Liu, Simon Goldhill, Jane Heal, Partha Dasgupta, Lalitha Sundaram, Haydn Belfield, Huw Price, Jaan Tallinn, Martin Rees, Alison Gopnik, Ryan Carey, William Sutherland, Susan Owens, Mami Mizutori, Piers Millett, Thomas Homer-Dixon, Robert Doubleday, Beatrice Crona, Belinda Cleeland, Des Browne, Yuval Noah Harari, Olaf Corry, Seth Baum, Caroline Baylon, Sophie Dannreuther, Charlotte Hammer, Jaime Sevilla, David Krueger, Elizabeth Seger, Di Cooke, Mike Cassidy, James Ginns, Andrew Tanentzap, Allan Dafoe, Rachel Burgess
FAR.AI 51 Jessica Lim, Edward Yee, Lilian Hughes, Taylor Boyle, Chris Cundy, Lindsay Murachver, Anastasiia Gaidashenko, Jeremy Rich, Saad Siddiqui, Oskar Hollinsworth, Philip Quirke, Brendan Murphy, Isabella Duan, Ian McKenzie, Dillon Bowen, Aaron Tucker, Michał Zając, Siao Si Looi, Chris MacLeod, Claudia Shi, Moritz von Knebel, Tony Wang, Fynn Heide, Kellin Pelrine, Ben Goldhaber, Lev McKinney, Adrià Garriga-Alonso, Adam Gleave, Pablo Moreno, Tomasz Korbak, Pedro Freire, Nora Belrose, Jérémy Scheurer, Juan Rocamonde, Alex Tamkin, Niki Howe, Ethan Perez, ChengCheng Tan, Karl Berzins, Nino Scherrer, Sawyer Bernath, Hannah Betts, Edmund Mills, Josh Jacobson, Alyse Spiehler, Tom Tseng, Mohammad Taufeeque, Joseph Miller, Euan McLean, Lawrence Chan, Scott Emmons
Leverhulme Centre for the Future of Intelligence 44 Raphael Hernandes, Demetrius A. Floudas, Harriet C., Asher Kessler, Diana Lengua, Farah Nanji, Minja Axelsson, Leah Madelaine Schmidt, Christoffer Koch Andersen, Connor Wright, Suren Pahlevan, Helen Leung, Viviana Fascianella, Lina Vyšniauskienė, Konstantinos V., Aanya Niaz, Sammy McKinney, Julien Porquet, Benjamin Henke, Seraphina Zhang, Kerry McInerney, Muhammed Alakitan, Wout Schellaert, Emily Elstub, Jedrzej Niklas, Milena Ivanova, Claire Benn, Flavia Saxler, Henry Shevlin, Aisha Sobey, Beryl Pong, Anna Odynets, Xiang Li, Matthijs M. Maas, Tomasz Hollanek, Harry Law, Cassie Robinson, Daniel White, Adrian Weller, Carla Zoe Cremer, Jonnie Penn, Nóra Ní Loideáin, José Hernández-Orallo, Huw Price
Berkeley Existential Risk Initiative 41 Michael Jemison, Sarah Otis, Cierra Johnson, Elisabeth Siegel, Karuna Nandkumar, Krystal Jackson, Deepika Raman, Andreas Pashos, Lara Lincoln, Jess Reidel, Gary Menezes, Scott Singer, Kayla Blomquist, Joseph Castellano, Nada Madkour, Elizabeth Cooper, Ian Baker, James Paul Gonzales, Jess Riedel, Stuart Russell, Sawyer Bernath, Sofia Davis-Fogel, Alex Flint, Josh Jacobson, Sam Bankman-Fried, Matt Fallshaw, Colleen Gleason, Jeremy Schlatter, Jaan Tallinn, Kyle Scott, Rebecca Raible, Kenzi Amodei, Jacob Tsimerman, Seán Ó hÉigeartaigh, Malo Bourgon, Qiaochu Yuan, Eric Rogstad, Andrew Snyder-Beattie, Michael Keenan, Gina Stuessy, Andrew Critch
GoodAI 30 Joseph Davidson, Šárka Krejčová, Shantesh Patil, Martin Poliak, Karolína H., Ryan Camilleri, Jose Solorzano, Alex Angelini, Dominik Čech, Reham Bukhari, Stephanie Wendler, Lucia Šicková, Šimon Šicko, Nicholas Guttenberg, Viktorie Knezkova, Steffen Eichler, Isabeau Premont-Schwarz, Filip Hauptfleisch, Jan Štafa, Michal Dvořák, Christine Lee, Will Millership, Petr Hlubuček, Marek Havrda, Jan Feyereisl, Olga Afanasjeva, Marek Rosa, Simon Andersson, Přemek Paška, Jaroslav Vitku
Ought 25 Lukas Finnveden, Owain Evans, Owen Cotton-Barratt, Luke Stebbing, Ian McKenzie, Justin Reppert, Eli Lifland, Amanda Ngo, Aparna Ashok, Jungwon Byun, Paul Christiano, Ozzie Gooen, Neal Jean, Milan Griffes, Girish Sastry, Chris Cundy, Ben Weinstein-Raun, Ben Goldhaber, Andrew Schreiber, Ben West, Ben Rachbach, Zachary Miller, Tom McGrath, Noah Goodman, Ryan Carey
AI Impacts 15 Jeffrey Heninger, Jimmy Rintjema, Richard Korzekwa, Katja Grace, Daniel Kokotajlo, Asya Bergal, Ronja Lutz, Tegan McCaslin, Paul Christiano, Ben Hoffman, Justis Mills, Connor Flexman, Finan Adamson, John Salvatier, Stephanie Zolayvar
Lightcone Infrastructure 11 Robert Mushkatblat, Rafe Kennedy, Oliver Habryka, Raymond Arnold, Ben Albert Pace, Ruben Bloom, Jacob Lagerros, Matthew Graves, Harmanas Chopra, Eric Rogstad, James Babcock
OpenAI 9 Mor Katz, Christopher Olah, Jeffrey Wu, Ethan Knight, Daniel Ziegler, Joshua Achiam, Geoffrey Irving, Paul Christiano, Dario Amodei
Road to AI Safety Excellence 8 Remmelt Ellen, Trent Fowler, Erik Istre, Rupert McCallum, Robert Miles, Johannes Heidecke, Veerle de Goederen, Toon Alfrink
AI Safety Camp 7 Remmelt Ellen, Jessica Cooper, Kristina Nemcova, Jirí Nadvorník, Anne Wissemann, Jan Kulveit, Johannes Heidecke
Center for Applied Rationality 5 Xavier Prospero, Logan Brienne Strohl, Luke Raskopf, Adom Hartell, Oliver Habryka
Foundational Research Institute 5 Brian Tomasik, Kaj Sotala, Caspar Oesterheld, Lukas Gloor, Tobias Baumann
Future of Humanity Institute 5 Sam Clarke, Tom McGrath, Sören Mindermann, Tamay Besiroglu, Anders Sandberg
EthicsNet 4 Aleksandra Orchowska, Remco Bloemen, Anish Mohammed, Nell Watson
Center for Human-Compatible AI 3 Christopher Cundy, Beth Barnes, Dmitrii Krasheninnikov
Google DeepMind 3 Vishal Maini, Pedro A. Ortega, Chris Maddison
AIDEUS 2 Sergey Rodionov, Alexey Potapov
Learning Intelligent Distribution Agent 2 Tamas Madl, Stan Franklin
University of Oxford 2 Ruth Fong, Chris Maddison
1 Angela P.
Australian National University 1 Jarryd Martin
Carnegie Mellon University 1 Noam Brown
Centre for Effective Altruism 1 Johannes Treutlein
ETH Zurich 1 Felix Berkenkamp
Institute of Ethics and Emerging Technologies 1 Steven Umbrello
Massachusetts Institute of Technology 1 Jon Gauthier
Oregon State University 1 Thomas Dietterich
Phenomenological AI Safety Research Institute 1 G Gordon Worley III
Sorbonne University 1 Michaël Trazzi
Stanford University 1 Aditi Raghunathan
The Australian National University 1 Michael Cohen
The Consortium on the Landscape of AI Safety 1 Alexis Carlier
University of Amsterdam 1 Dmitrii Krasheninnikov
University of California, Berkeley 1 Michael Janner
University of Toronto 1 Roger Grosse