AI Watch

Welcome! This is a website to track people and organizations working on AI safety. See the code repository for the source code and data of this website.

This website is developed by Issa Rice with data contributions from Sebastian Sanchez, Amana Rice, and Vipul Naik, and has been partially funded by Vipul Naik.

If you like (or want to like) this website and have money: the current funder is mostly only funding data updates to existing organizations as well as adding data for some new effective altruist organizations. As a result, the site is not getting any new features or improvements in design. If you want to bring this site to the next level, contact Issa at riceissa@gmail.com. What you get: site improvements, recognition in the site credits. What the site needs: money.

If you have time and want experience building websites: this website is looking for contributors. If you want to help out, contact Issa at riceissa@gmail.com. What you get: little or no pay (this could change if the site gets funding; see previous paragraph), recognition in the site credits, privilege of working with me, knowledge of the basics of web development (MySQL, PHP, Git). What the site needs: data collection/entry and website code improvements.

Last updated on 2023-01-02; see here for a full list of recent changes.

Table of contents

Agendas

Agenda name Associated people Associated organizations
Iterated amplification Paul Christiano, Buck Shlegeris, Dario Amodei OpenAI
Embedded agency Eliezer Yudkowsky, Scott Garrabrant, Abram Demski Machine Intelligence Research Institute
Comprehensive AI services Eric Drexler Future of Humanity Institute
Ambitious value learning Stuart Armstrong Future of Humanity Institute
Factored cognition Andreas Stuhlmüller Ought
Recursive reward modeling Jan Leike, David Krueger, Tom Everitt, Miljan Martic, Vishal Maini, Shane Legg Google DeepMind
Debate Paul Christiano OpenAI
Interpretability Christopher Olah
Inverse reinforcement learning
Preference learning
Cooperative inverse reinforcement learning
Imitation learning
Alignment for advanced machine learning systems Jessica Taylor, Eliezer Yudkowsky, Patrick LaVictoire, Andrew Critch Machine Intelligence Research Institute
Learning-theoretic AI alignment Vanessa Kosoy
Counterfactual reasoning Jacob Steinhardt

Positions grouped by person

Showing 24 people with positions.

Name Number of organizations List of organizations
Paul Christiano 5 AI Impacts, Machine Intelligence Research Institute, OpenAI, Ought, Theiss Research
Oliver Habryka 3 Center for Applied Rationality, LessWrong 2.0, Machine Intelligence Research Institute
Ryan Carey 3 Centre for the Study of Existential Risk, Machine Intelligence Research Institute, Ought
Andrew Critch 2 Berkeley Existential Risk Initiative, Machine Intelligence Research Institute
Ben Weinstein-Raun 2 Machine Intelligence Research Institute, Ought
Beth Barnes 2 Center for Human-Compatible AI, Centre for the Study of Existential Risk
Brian Tomasik 2 Center for Reducing Suffering, Foundational Research Institute
Chris Maddison 2 Google DeepMind, University of Oxford
Connor Flexman 2 AI Impacts, Machine Intelligence Research Institute
Dmitrii Krasheninnikov 2 Center for Human-Compatible AI, University of Amsterdam
Eric Rogstad 2 Berkeley Existential Risk Initiative, LessWrong 2.0
Jaan Tallinn 2 Centre for the Study of Existential Risk, Machine Intelligence Research Institute
Jeremy Schlatter 2 Berkeley Existential Risk Initiative, Machine Intelligence Research Institute
Jimmy Rintjema 2 AI Impacts, Machine Intelligence Research Institute
Johannes Heidecke 2 AI Safety Camp, Road to AI Safety Excellence
Kaj Sotala 2 Foundational Research Institute, Machine Intelligence Research Institute
Matthew Graves 2 LessWrong 2.0, Machine Intelligence Research Institute
Nick Bostrom 2 Centre for the Study of Existential Risk, Machine Intelligence Research Institute
Qiaochu Yuan 2 Berkeley Existential Risk Initiative, Machine Intelligence Research Institute
Remmelt Ellen 2 AI Safety Camp, Road to AI Safety Excellence
Sam Clarke 2 Future of Humanity Institute, The Future Society
Stuart Russell 2 Centre for the Study of Existential Risk, Machine Intelligence Research Institute
Tobias Baumann 2 Center for Reducing Suffering, Foundational Research Institute
Tom McGrath 2 Future of Humanity Institute, Ought

Positions grouped by organization

Showing 38 organizations.

Organization Number of people List of people
Machine Intelligence Research Institute 143 James Payor, Edward Kmett, Victoria Krakovna, Blake Borgeson, Carson Jones, Alex Zhu, Alex Mennen, Alex Appel, Linda Linsefors, Evan Hubinger, David Simmons, Daniel Demski, Ben Weinstein-Raun, Qiaochu Yuan, Buck Shlegeris, Nick Tarleton, Kurt Brown, Jesse Liptrap, Tsvi Benson-Tilsen, Sam Eisenstat, Benjamin Mann, Jeremy Schlatter, Andrew Critch, Jan Leike, Matthew Graves, Ryan Carey, Aaron Silverbook, Connor Flexman, Colm Ó Riain, Malo Bourgon, Gary Drescher, Andrew Lapinski-Barker, Robin Hanson, Kaya Stechly, Nate Soares, Anna Salamon, Vanessa Kosoy, Stuart Russell, Ramana Kumar, Jed McCaleb, Jack Gallagher, Jaan Tallinn, Bart Selman, Abram Demski, Stuart Armstrong, Nate Thomas, Scott Garrabrant, Rob Bensinger, Jessica Taylor, Jake Moskowitz, Jesse Galef, Matthew Fallshaw, Peter Thiel, Liron Shapira, Christine Peterson, Nicolas Gagné, Elizabeth Morningstar, Antonius Lourenço Kasbergen, Lila Rieber, Katja Grace, Vipul Naik, Jimmy Rintjema, Daniel Lewis, Richard Neal, Robert Mushkatblat, Dávid Natingga, Steve Omohundro, Roman Yampolskiy, Nick Bostrom, Nathan Clark, Moshe Looks, Kaj Sotala, Sebastian Nickel, Evan Erickson, Oliver Habryka, Alex Altair, Paul Christiano, Patrick LaVictoire, Nisan Stiennon, Mihaly Barasz, Jeremy Miller, Bill Hibbard, Benya Fallenstein, Stephen Barnes, Louie Helm, Patrick Robotham, Daniel Roth, Pedro Chaves, Topher Brennan, Carl Shulman, Nickolai Leschov, Jonathan Wang, Cameron Taylor, Alex Vermeer, Tomer Kagan, Jake Miller, Gwern Branwen, Erica Edelman, Luke Muehlhauser, Lincoln Quirk, Will Newsome, Nevin Freeman, Minda Myers, Keefe Roedersheimer, Diego Caleiro, Peter Scheyer, Peter de Blanc, Jasen Murray, Abraham Wolk, Thomas Colthurst, Stanislas Sochacki, Luke Grecki, Daniel Dewey, Janos Kramar, Dennis Fan, Ben Hoskin, Tim Czech, Jason Levin, Frank Adamek, Amy Willey, Kevin Fischer, Ray Kurzweil, Ben Goertzel, Steve Rayhawk, Michael Anissimov, Harrison Willey, Eliezer Yudkowsky, Michael Blume, Kemal Eren, Andrew Rettek, Andriy Brodskyy, Andrew Hay, Vincent Fagot, Thomas McCabe, Steven Kaas, Roko Mijic, Justin Shovelain, Henrik Jonsson, Bryan Bishop, Alyssa Vance, Marcello Herreshoff, Jeff Alexander, Mariah Wang
GoodAI 34 Karolína H., Ryan Camilleri, Jose Solorzano, Alex Angelini, Sarka Krejcova, Reham Bukhari, Stephanie Wendler, Petr Šimánek, Viktorie Knezkova, Steffen Eichler, Isabeau Premont-Schwarz, Petr Šrámek, Christine Lee, Michal Dvořák, Will Millership, Lucie Krestova, Marek Havrda, Jan Feyereisl, Olga Afanasjeva, Marek Rosa, Přemek Paška, Simon Andersson, Jaroslav Vitku, Wendelin Boehmer, Dominik Čech, Lucia Šicková, Šimon Šicko, Filip Hauptfleisch, Jan Štafa, Shantesh Patil, Joseph Davidson, Petr Hlubuček, Nicholas Guttenberg, Martin Poliak
Ought 26 Ian McKenzie, Luke Stebbing, Justin Reppert, Aisha Aishwarya, Eli Lifland, Amanda Ngo, Aparna Ashok, Jungwon Byun, Chris Cundy, Girish Sastry, Neal Jean, Ryan Carey, Zac Kenton, Tom McGrath, Ben Weinstein-Raun, Ozzie Gooen, Milan Griffes, Zachary Miller, Ben Goldhaber, Ben West, Ben Rachbach, Noah Goodman, Paul Christiano, Owain Evans, Andreas Stuhlmüller, Andrew Schreiber
Centre for the Study of Existential Risk 24 Adrian Weller, Sean Holden, Stephen Hawking, Tim Crane, Max Tegmark, Murray Shanahan, Dana Scott, Stuart Russell, Elon Musk, Alison Gopnik, David Chalmers, Nick Bostrom, Margaret Boden, Ryan Carey, Martina Kunz, Seth Baum, Beth Barnes, Yang Liu, Jaan Tallinn, Martin Rees, Huw Price, Haydn Belfield, Shahar Avin, Seán Ó hÉigeartaigh
AI Impacts 13 Daniel Kokotajlo, Asya Bergal, Rick Korzekwa, Ronja Lutz, Tegan McCaslin, Paul Christiano, Jimmy Rintjema, Justis Mills, Connor Flexman, Finan Adamson, John Salvatier, Stephanie Zolayvar, Ben Hoffman
OpenAI 10 Mor Katz, Sam McCandlish, Christopher Olah, Jeffrey Wu, Ethan Knight, Daniel Ziegler, Joshua Achiam, Geoffrey Irving, Paul Christiano, Dario Amodei
Berkeley Existential Risk Initiative 9 Sofia Davis-Fogel, Alex Flint, Josh Jacobson, Sam Bankman-Fried, Colleen Gleason, Jeremy Schlatter, Qiaochu Yuan, Eric Rogstad, Andrew Critch
LessWrong 2.0 8 Ruben Bloom, Raymond Arnold, Eric Rogstad, Harmanas Chopra, Ben Albert Pace, Matthew Graves, Oliver Habryka, James Babcock
Road to AI Safety Excellence 8 Remmelt Ellen, Trent Fowler, Erik Istre, Rupert McCallum, Robert Miles, Johannes Heidecke, Veerle de Goederen, Toon Alfrink
AI Safety Camp 7 Remmelt Ellen, Jessica Cooper, Kristina Nemcova, Jirí Nadvorník, Anne Wissemann, Jan Kulveit, Johannes Heidecke
Center for Applied Rationality 5 Xavier Prospero, Brienne Strohl, Luke Raskopf, Adom Hartell, Oliver Habryka
Center for Reducing Suffering 5 Winston Oswald-Drummond, Magnus Vinding, Teo Ajantaival, Brian Tomasik, Tobias Baumann
Foundational Research Institute 5 Brian Tomasik, Kaj Sotala, Caspar Oesterheld, Lukas Gloor, Tobias Baumann
Future of Humanity Institute 5 Sam Clarke, Tom McGrath, Sören Mindermann, Tamay Besiroglu, Anders Sandberg
EthicsNet 4 Aleksandra Orchowska, Remco Bloemen, Anish Mohammed, Nell Watson
Center for Human-Compatible AI 3 Christopher Cundy, Beth Barnes, Dmitrii Krasheninnikov
Google DeepMind 3 Vishal Maini, Pedro A. Ortega, Chris Maddison
AIDEUS 2 Sergey Rodionov, Alexey Potapov
Learning Intelligent Distribution Agent 2 Tamas Madl, Stan Franklin
University of Oxford 2 Ruth Fong, Chris Maddison
1 Angela P.
Australian National University 1 Jarryd Martin
Carnegie Mellon University 1 Noam Brown
Centre for Effective Altruism 1 Johannes Treutlein
ETH Zurich 1 Felix Berkenkamp
Institute of Ethics and Emerging Technologies 1 Steven Umbrello
Massachusetts Institute of Technology 1 Jon Gauthier
Oregon State University 1 Thomas Dietterich
Phenomenological AI Safety Research Institute 1 G Gordon Worley III
Sorbonne University 1 Michaël Trazzi
Stanford University 1 Aditi Raghunathan
The Australian National University 1 Michael Cohen
The Consortium on the Landscape of AI Safety 1 Alexis Carlier
The Future Society 1 Sam Clarke
Theiss Research 1 Paul Christiano
University of Amsterdam 1 Dmitrii Krasheninnikov
University of California, Berkeley 1 Michael Janner
University of Toronto 1 Roger Grosse