AI Watch

Welcome! This is a website to track people and organizations in the AI safety/alignment/AI existential risk communities. A position or organization being on AI Watch does not indicate an assessment that that position or organization is actually making AI safer or that the position or organization is good for the world in any way. It is mostly a sociological indication that the position or organization is associated with these communities, as well as an indication that the position or organization claims to be working on AI safety or alignment. (There are some plans to eventually introduce such assessments on AI Watch, but for now there are none.) See the code repository for the source code and data of this website.

This website is developed by Issa Rice with data contributions from Sebastian Sanchez, Amana Rice, and Vipul Naik, and has been partially funded by Vipul Naik and Mati Roy (who in July 2023 paid for the time Issa had spent answering people’s questions about AI Watch up until that point).

Last updated on 2026-04-30; see here for a full list of recent changes.

Table of contents

Agendas

Agenda name Associated people Associated organizations
Iterated amplification Paul Christiano, Buck Shlegeris, Dario Amodei OpenAI
Embedded agency Eliezer Yudkowsky, Scott Garrabrant, Abram Demski Machine Intelligence Research Institute
Comprehensive AI services Eric Drexler Future of Humanity Institute
Ambitious value learning Stuart Armstrong Future of Humanity Institute
Factored cognition Andreas Stuhlmüller Ought
Recursive reward modeling Jan Leike, David Krueger, Tom Everitt, Miljan Martic, Vishal Maini, Shane Legg Google DeepMind
Debate Paul Christiano OpenAI
Interpretability Christopher Olah
Inverse reinforcement learning
Preference learning
Cooperative inverse reinforcement learning
Imitation learning
Alignment for advanced machine learning systems Jessica Taylor, Eliezer Yudkowsky, Patrick LaVictoire, Andrew Critch Machine Intelligence Research Institute
Learning-theoretic AI alignment Vanessa Kosoy
Counterfactual reasoning Jacob Steinhardt

Positions grouped by person

Showing 10 people with positions.

Name Number of organizations List of organizations
Paul Christiano 4 AI Impacts, Machine Intelligence Research Institute, OpenAI, Ought
Ben Goldhaber 3 FAR.AI, Fund for Alignment Research, Ought
Chris Cundy 3 FAR.AI, Fund for Alignment Research, Ought
Ian McKenzie 3 FAR.AI, Fund for Alignment Research, Ought
Jaan Tallinn 3 Berkeley Existential Risk Initiative, Centre for the Study of Existential Risk, Machine Intelligence Research Institute
Jimmy Rintjema 3 AI Impacts, Center for Applied Rationality, Machine Intelligence Research Institute
Oliver Habryka 3 Center for Applied Rationality, Lightcone Infrastructure, Machine Intelligence Research Institute
Ryan Carey 3 Centre for the Study of Existential Risk, Machine Intelligence Research Institute, Ought
Sawyer Bernath 3 Berkeley Existential Risk Initiative, FAR.AI, Fund for Alignment Research
Stuart Russell 3 Berkeley Existential Risk Initiative, Centre for the Study of Existential Risk, Machine Intelligence Research Institute

Positions grouped by organization

Showing 39 organizations.

Organization Number of people List of people
Machine Intelligence Research Institute 152 Alana Horowitz Friedman, Martin Lucas, David Abecassis, Aaron Scher, Brittany Ferrero, Joe Rogero, Mitchell Howe, Lisa Thiergart, Harlan Stewart, Jimmy Rintjema, Alex Vermeer, Gretta Duleba, Peter Barnett, James Payor, Edward Kmett, Victoria Krakovna, Blake Borgeson, Christine Peterson, Carson Jones, Ben Weinstein-Raun, Alex Zhu, Alex Mennen, Alex Appel, Linda Linsefors, Evan Hubinger, David Simmons, Daniel Demski, Buck Shlegeris, Nick Tarleton, Kurt Brown, Jesse Liptrap, Benjamin Mann, Sam Eisenstat, Jeremy Schlatter, Andrew Critch, Jan Leike, Matthew Graves, Ryan Carey, Aaron Silverbook, Connor Flexman, Colm Ó Riain, Gary Drescher, Andrew Lapinski-Barker, Robin Hanson, Kaya Stechly, Anna Salamon, Bart Selman, Stuart Russell, Ramana Kumar, Jack Gallagher, Jaan Tallinn, Vanessa Kosoy, Abram Demski, Stuart Armstrong, Nate Thomas, Scott Garrabrant, Rob Bensinger, Jessica Taylor, Jed McCaleb, Jake Moskowitz, Jesse Galef, Tsvi Benson-Tilsen, Matthew Fallshaw, Eliezer Yudkowsky, Benya Fallenstein, Peter Thiel, Liron Shapira, Nicolas Gagné, Elizabeth Morningstar, Lila Rieber, Vipul Naik, Nate Soares, Daniel Lewis, Richard Neal, Robert Mushkatblat, Dávid Natingga, Steve Omohundro, Roman Yampolskiy, Nick Bostrom, Nathan Clark, Moshe Looks, Kaj Sotala, Sebastian Nickel, Evan Erickson, Oliver Habryka, Bill Hibbard, Alex Altair, Paul Christiano, Patrick LaVictoire, Nisan Stiennon, Mihaly Barasz, Jeremy Miller, Stephen Barnes, Louie Helm, Patrick Robotham, Daniel Roth, Pedro Chaves, Topher Brennan, Carl Shulman, Nickolai Leschov, Jonathan Wang, Cameron Taylor, Tomer Kagan, Malo Bourgon, Jake Miller, Gwern Branwen, Erica Edelman, Luke Muehlhauser, Lincoln Quirk, Will Newsome, Nevin Freeman, Minda Myers, Keefe Roedersheimer, Diego Caleiro, Peter Scheyer, Peter de Blanc, Jasen Murray, Abraham Wolk, Thomas Colthurst, Stanislas Sochacki, Luke Grecki, Daniel Dewey, Janos Kramar, Dennis Fan, Ben Hoskin, Tim Czech, Jason Levin, Frank Adamek, Amy Willey, Kevin Fischer, Ray Kurzweil, Ben Goertzel, Edwin Evans, Steve Rayhawk, Michael Anissimov, Harrison Willey, Michael Blume, Kemal Eren, Andrew Rettek, Andriy Brodskyy, Andrew Hay, Vincent Fagot, Thomas McCabe, Steven Kaas, Roko Mijic, Katja Grace, Justin Shovelain, Henrik Jonsson, Bryan Bishop, Alyssa Vance, Marcello Herreshoff, Jeff Alexander
Centre for the Study of Existential Risk 100 Pablo Suarez, Reuben Makomere, Aarathi Krishnan, Thomas Moynihan, Elizabeth Cooper, Alexandra Klein, Shoshana Dahdi, Kennedy Mbeva, Julian Huppert, Taniel Yusef, Cecil Abungu, Madhulika Srikumar, Zoe Hemsley, Constantin Arnscheidt, Coleman Snell, Clarissa Rios Rojas, Dennis Müller, Sarah Dryhurst, Maurice Chiodo, Sam Clarke, Fazl Barez, Ross Gruetzemacher, Abdullahi Alim, Nathaniel Cooke, Paul Ingram, José Hernández-Orallo, Catherine Rhodes, Laura Elmer, Matthijs M. Maas, Freya Jephcott, Charlotte Christiane Hammer, Tom Hobson, Lara Mani, Shin-Shin Hua, Rumtin Sepasspour, S. J. Beard, Carla Zoe Cremer, Adrian Weller, Chris Lowe, Adrian Kent, Sean Holden, Stephen Hawking, Hermann Hauser, Tim Crane, David Cleevely, Jonathan Wiener, Max Tegmark, Peter Singer, Murray Shanahan, Dana Scott, Stuart Russell, Peter Piot, Tim Palmer, Elon Musk, Robert May, David Chalmers, Nick Bostrom, Margaret Boden, Martina Kunz, Beth Barnes, Yang Liu, Huw Price, Simon Goldhill, Jane Heal, Partha Dasgupta, Lalitha Sundaram, Haydn Belfield, Shahar Avin, Seán Ó hÉigeartaigh, Rachel Burgess, Alison Gopnik, Ryan Carey, Clare Arnstein, Jaan Tallinn, William Sutherland, Martin Rees, Susan Owens, Mami Mizutori, Piers Millett, Thomas Homer-Dixon, Robert Doubleday, Beatrice Crona, Matthew Connelly, Belinda Cleeland, Des Browne, Jessica Bland, Yuval Noah Harari, Olaf Corry, Seth Baum, Caroline Baylon, Sophie Dannreuther, Charlotte Hammer, Jaime Sevilla, David Krueger, Elizabeth Seger, Di Cooke, Mike Cassidy, James Ginns, Andrew Tanentzap, Allan Dafoe
Fund for Alignment Research 72 Tigist Diriba, Rick Korzekwa, Oscar Mata, Nick Louie, Matt Pallissard, Karolina Walęcik, Jasper Timm, Isadora de Andrade, Heather McIntyre, Hale Guyer, Frances Lorenz, Yordanos Asmare, Vits Voronkov, Thomas Costello, Stefan Heimersheim, Samuel Bauer, Sam Adam-Day, Roman Coussement, Matthew Kowal, Mark Nitzberg, Lukas Struppek, Liz Ibarra, Levon Avagyan, Lars Yencken, Kulraj Chavda, Kenya Scott, Jean-François Godbout, Helen Moser, Gordon Pennycook, David Rand, Antonio Arechar, Annie Lehman-Ludwig, Oskar Hollinsworth, Philip Quirke, Lindsay Murachver, Saad Siddiqui, Anastasiia Gaidashenko, Vael Gates, Siao Si Looi, Tony Wang, Taylor Boyle, Lilian Hughes, Jessica Lim, Isaac Levine, Edward Yee, Chris MacLeod, Chris Cundy, ChengCheng Tan, Ann-Kathrin Dombrowski, Aaron Tucker, Conor McGurk, Moritz von Knebel, Fynn Heide, Niki Howe, Ben Goldhaber, Adrià Garriga-Alonso, Adam Gleave, Sawyer Bernath, Lawrence Chan, Kellin Pelrine, Karl Berzins, Mohammad Taufeeque, Hannah Betts, Tom Tseng, Nora Belrose, Tomasz Korbak, Scott Emmons, Jun Shern Chan, Jérémy Scheurer, Ian McKenzie, Ethan Perez, Claudia Shi
FAR.AI 51 Jessica Lim, Edward Yee, Lilian Hughes, Chris Cundy, Taylor Boyle, Lindsay Murachver, Jeremy Rich, Saad Siddiqui, Oskar Hollinsworth, Anastasiia Gaidashenko, Philip Quirke, Brendan Murphy, Isabella Duan, Ian McKenzie, Dillon Bowen, Michał Zając, Siao Si Looi, Chris MacLeod, Aaron Tucker, Claudia Shi, Moritz von Knebel, Tony Wang, Fynn Heide, Kellin Pelrine, Ben Goldhaber, Lev McKinney, Adrià Garriga-Alonso, Pablo Moreno, Tomasz Korbak, Pedro Freire, Nora Belrose, Jérémy Scheurer, Juan Rocamonde, Alex Tamkin, Niki Howe, Ethan Perez, ChengCheng Tan, Adam Gleave, Nino Scherrer, Sawyer Bernath, Karl Berzins, Hannah Betts, Edmund Mills, Josh Jacobson, Alyse Spiehler, Tom Tseng, Mohammad Taufeeque, Joseph Miller, Euan McLean, Lawrence Chan, Scott Emmons
Leverhulme Centre for the Future of Intelligence 44 Raphael Hernandes, Demetrius A. Floudas, Tomasz Hollanek, Harriet C., Asher Kessler, José Hernández-Orallo, Diana Lengua, Farah Nanji, Kerry McInerney, Minja Axelsson, Leah Madelaine Schmidt, Christoffer Koch Andersen, Connor Wright, Suren Pahlevan, Helen Leung, Viviana Fascianella, Lina Vyšniauskienė, Konstantinos V., Aanya Niaz, Sammy McKinney, Julien Porquet, Benjamin Henke, Seraphina Zhang, Muhammed Alakitan, Wout Schellaert, Emily Elstub, Jedrzej Niklas, Milena Ivanova, Claire Benn, Flavia Saxler, Henry Shevlin, Aisha Sobey, Beryl Pong, Anna Odynets, Xiang Li, Adrian Weller, Matthijs M. Maas, Harry Law, Cassie Robinson, Daniel White, Carla Zoe Cremer, Jonnie Penn, Nóra Ní Loideáin, Huw Price
Berkeley Existential Risk Initiative 41 Michael Jemison, Sarah Otis, Cierra Johnson, Elisabeth Siegel, Karuna Nandkumar, Krystal Jackson, Deepika Raman, Andreas Pashos, Sawyer Bernath, Lara Lincoln, Jess Reidel, Elizabeth Cooper, Gary Menezes, Scott Singer, Kayla Blomquist, Joseph Castellano, Nada Madkour, Ian Baker, James Paul Gonzales, Jess Riedel, Stuart Russell, Sofia Davis-Fogel, Alex Flint, Josh Jacobson, Sam Bankman-Fried, Matt Fallshaw, Colleen Gleason, Jeremy Schlatter, Jaan Tallinn, Kyle Scott, Rebecca Raible, Kenzi Amodei, Jacob Tsimerman, Seán Ó hÉigeartaigh, Malo Bourgon, Qiaochu Yuan, Eric Rogstad, Andrew Snyder-Beattie, Michael Keenan, Gina Stuessy, Andrew Critch
GoodAI 30 Joseph Davidson, Šárka Krejčová, Shantesh Patil, Martin Poliak, Karolína H., Ryan Camilleri, Jose Solorzano, Alex Angelini, Dominik Čech, Reham Bukhari, Stephanie Wendler, Lucia Šicková, Šimon Šicko, Nicholas Guttenberg, Viktorie Knezkova, Steffen Eichler, Isabeau Premont-Schwarz, Filip Hauptfleisch, Jan Štafa, Michal Dvořák, Christine Lee, Will Millership, Petr Hlubuček, Marek Havrda, Jan Feyereisl, Olga Afanasjeva, Marek Rosa, Simon Andersson, Přemek Paška, Jaroslav Vitku
Ought 25 Lukas Finnveden, Owain Evans, Owen Cotton-Barratt, Luke Stebbing, Ian McKenzie, Justin Reppert, Eli Lifland, Amanda Ngo, Aparna Ashok, Jungwon Byun, Paul Christiano, Ozzie Gooen, Neal Jean, Milan Griffes, Girish Sastry, Chris Cundy, Ben Weinstein-Raun, Ben Goldhaber, Andrew Schreiber, Ben West, Ben Rachbach, Zachary Miller, Tom McGrath, Noah Goodman, Ryan Carey
AI Impacts 15 Jeffrey Heninger, Jimmy Rintjema, Katja Grace, Daniel Kokotajlo, Asya Bergal, Ronja Lutz, Richard Korzekwa, Tegan McCaslin, Paul Christiano, Ben Hoffman, Justis Mills, Connor Flexman, Finan Adamson, John Salvatier, Stephanie Zolayvar
Lightcone Infrastructure 11 Robert Mushkatblat, Rafe Kennedy, Oliver Habryka, Raymond Arnold, Ben Albert Pace, Ruben Bloom, Jacob Lagerros, Matthew Graves, Harmanas Chopra, Eric Rogstad, James Babcock
OpenAI 9 Mor Katz, Christopher Olah, Jeffrey Wu, Ethan Knight, Daniel Ziegler, Joshua Achiam, Geoffrey Irving, Paul Christiano, Dario Amodei
Road to AI Safety Excellence 8 Remmelt Ellen, Trent Fowler, Erik Istre, Rupert McCallum, Robert Miles, Johannes Heidecke, Veerle de Goederen, Toon Alfrink
AI Safety Camp 7 Remmelt Ellen, Jessica Cooper, Kristina Nemcova, Jirí Nadvorník, Anne Wissemann, Jan Kulveit, Johannes Heidecke
Center for Applied Rationality 5 Jimmy Rintjema, Logan Brienne Strohl, Luke Raskopf, Adom Hartell, Oliver Habryka
Foundational Research Institute 5 Brian Tomasik, Kaj Sotala, Caspar Oesterheld, Lukas Gloor, Tobias Baumann
Future of Humanity Institute 5 Sam Clarke, Tom McGrath, Sören Mindermann, Tamay Besiroglu, Anders Sandberg
EthicsNet 4 Aleksandra Orchowska, Remco Bloemen, Anish Mohammed, Nell Watson
Center for Human-Compatible AI 3 Christopher Cundy, Beth Barnes, Dmitrii Krasheninnikov
Google DeepMind 3 Vishal Maini, Pedro A. Ortega, Chris Maddison
AIDEUS 2 Sergey Rodionov, Alexey Potapov
Learning Intelligent Distribution Agent 2 Tamas Madl, Stan Franklin
University of Oxford 2 Ruth Fong, Chris Maddison
1 Angela P.
Australian National University 1 Jarryd Martin
Carnegie Mellon University 1 Noam Brown
Centre for Effective Altruism 1 Johannes Treutlein
ETH Zurich 1 Felix Berkenkamp
Humans in Control 1 Vael Gates
Institute of Ethics and Emerging Technologies 1 Steven Umbrello
Massachusetts Institute of Technology 1 Jon Gauthier
Oregon State University 1 Thomas Dietterich
Phenomenological AI Safety Research Institute 1 G Gordon Worley III
Sorbonne University 1 Michaël Trazzi
Stanford University 1 Aditi Raghunathan
The Australian National University 1 Michael Cohen
The Consortium on the Landscape of AI Safety 1 Alexis Carlier
University of Amsterdam 1 Dmitrii Krasheninnikov
University of California, Berkeley 1 Michael Janner
University of Toronto 1 Roger Grosse