AI Watch

Welcome! This is a website to track people and organizations working on AI safety. See the code repository for the source code and data of this website.

This website is developed by Issa Rice and has been partially funded by Vipul Naik.

If you like (or want to like) this website and have money: the current funder doesn't want to continue funding this project. As a result, it is currently mostly sitting around. If you want to bring this site to the next level, contact Issa at What you get: site improvements, recognition in the site credits. What the site needs: money.

If you have time and want experience building websites: this website is looking for contributors. If you want to help out, contact Issa at What you get: little or no pay (this could change if the site gets funding; see previous paragraph), recognition in the site credits, privilege of working with me, knowledge of the basics of web development (MySQL, PHP, Git). What the site needs: data collection/entry and website code improvements.

Last updated on 2019-11-27.

Table of contents


Agenda name Associated people Associated organizations
Iterated amplification Paul Christiano, Buck Shlegeris, Dario Amodei OpenAI
Embedded agency Eliezer Yudkowsky, Scott Garrabrant, Abram Demski Machine Intelligence Research Institute
Comprehensive AI services Eric Drexler Future of Humanity Institute
Ambitious value learning Stuart Armstrong Future of Humanity Institute
Factored cognition Andreas Stuhlmüller Ought
Recursive reward modeling Jan Leike, David Krueger, Tom Everitt, Miljan Martic, Vishal Maini, Shane Legg Google DeepMind
Debate Paul Christiano OpenAI
Interpretability Christopher Olah
Inverse reinforcement learning
Preference learning
Cooperative inverse reinforcement learning
Imitation learning
Alignment for advanced machine learning systems Jessica Taylor, Eliezer Yudkowsky, Patrick LaVictoire, Andrew Critch Machine Intelligence Research Institute
Learning-theoretic AI alignment Vanessa Kosoy
Counterfactual reasoning Jacob Steinhardt

Positions grouped by person

Showing 32 people with positions.

Name Number of organizations List of organizations
Chris Maddison 2 Google DeepMind, University of Oxford
Paul Christiano 2 OpenAI, Theiss Research
Aditi Raghunathan 1 Stanford University
Alex Appel 1 Machine Intelligence Research Institute
Alex Mennen 1 Machine Intelligence Research Institute
Alex Zhu 1 Machine Intelligence Research Institute
Alexey Potapov 1 AIDEUS
Beth Barnes 1 Center for Human-Compatible AI
Caspar Oesterheld 1 Foundational Research Institute
Christopher Cundy 1 Center for Human-Compatible AI
Christopher Olah 1 OpenAI
Daniel Demski 1 Machine Intelligence Research Institute
Dario Amodei 1 OpenAI
David Simmons 1 Machine Intelligence Research Institute
Dmitrii Krasheninnikov 1 Center for Human-Compatible AI
Eliezer Yudkowsky 1 Machine Intelligence Research Institute
Evan Hubinger 1 Machine Intelligence Research Institute
Felix Berkenkamp 1 ETH Zurich
Geoffrey Irving 1 OpenAI
Jarryd Martin 1 Australian National University
Jon Gauthier 1 Massachusetts Institute of Technology
Linda Linsefors 1 Machine Intelligence Research Institute
Michael Janner 1 University of California, Berkeley
Noam Brown 1 Carnegie Mellon University
Patrick LaVictoire 1 Machine Intelligence Research Institute
Pedro A. Ortega 1 Google DeepMind
Roger Grosse 1 University of Toronto
Ruth Fong 1 University of Oxford
Sergey Rodionov 1 AIDEUS
Sören Mindermann 1 Future of Humanity Institute
Stan Franklin 1 Learning Intelligent Distribution Agent
Tamas Madl 1 Learning Intelligent Distribution Agent

Positions grouped by organization

Showing 17 organizations.

Organization Number of people List of people
Machine Intelligence Research Institute 9 Alex Zhu, David Simmons, Alex Mennen, Linda Linsefors, Evan Hubinger, Daniel Demski, Alex Appel, Patrick LaVictoire, Eliezer Yudkowsky
OpenAI 4 Christopher Olah, Geoffrey Irving, Paul Christiano, Dario Amodei
Center for Human-Compatible AI 3 Beth Barnes, Dmitrii Krasheninnikov, Christopher Cundy
AIDEUS 2 Sergey Rodionov, Alexey Potapov
Google DeepMind 2 Pedro A. Ortega, Chris Maddison
Learning Intelligent Distribution Agent 2 Tamas Madl, Stan Franklin
University of Oxford 2 Ruth Fong, Chris Maddison
Australian National University 1 Jarryd Martin
Carnegie Mellon University 1 Noam Brown
ETH Zurich 1 Felix Berkenkamp
Foundational Research Institute 1 Caspar Oesterheld
Future of Humanity Institute 1 Sören Mindermann
Massachusetts Institute of Technology 1 Jon Gauthier
Stanford University 1 Aditi Raghunathan
Theiss Research 1 Paul Christiano
University of California, Berkeley 1 Michael Janner
University of Toronto 1 Roger Grosse