AI Watch

Welcome! This is a website to track people and organizations in the AI safety/alignment/AI existential risk communities. A position or organization being on AI Watch does not indicate an assessment that that position or organization is actually making AI safer or that the position or organization is good for the world in any way. It is mostly a sociological indication that the position or organization is associated with these communities, as well as an indication that the position or organization claims to be working on AI safety or alignment. (There are some plans to eventually introduce such assessments on AI Watch, but for now there are none.) See the code repository for the source code and data of this website.

This website is developed by Issa Rice with data contributions from Sebastian Sanchez, Amana Rice, and Vipul Naik, and has been partially funded by Vipul Naik and Mati Roy (who in July 2023 paid for the time Issa had spent answering people’s questions about AI Watch up until that point).

Last updated on 2026-03-29; see here for a full list of recent changes.

Table of contents

Agendas

Agenda name Associated people Associated organizations
Iterated amplification Paul Christiano, Buck Shlegeris, Dario Amodei OpenAI
Embedded agency Eliezer Yudkowsky, Scott Garrabrant, Abram Demski Machine Intelligence Research Institute
Comprehensive AI services Eric Drexler Future of Humanity Institute
Ambitious value learning Stuart Armstrong Future of Humanity Institute
Factored cognition Andreas Stuhlmüller Ought
Recursive reward modeling Jan Leike, David Krueger, Tom Everitt, Miljan Martic, Vishal Maini, Shane Legg Google DeepMind
Debate Paul Christiano OpenAI
Interpretability Christopher Olah
Inverse reinforcement learning
Preference learning
Cooperative inverse reinforcement learning
Imitation learning
Alignment for advanced machine learning systems Jessica Taylor, Eliezer Yudkowsky, Patrick LaVictoire, Andrew Critch Machine Intelligence Research Institute
Learning-theoretic AI alignment Vanessa Kosoy
Counterfactual reasoning Jacob Steinhardt

Positions grouped by person

Showing 0 people with positions.

Name Number of organizations List of organizations

Positions grouped by organization

Showing 5 organizations.

Organization Number of people List of people
OpenAI 330 Meghan Dorn, Jessica Shieh, Zico Kolter, Stefanie Biaggi, CJ Minott, Michael Chen, Rosie Campbell, Peter Hoeschele, Mike B., Christopher Berner, Yasuyoshi Sakamoto, Shuyuan Zhang, Mada Aflak, Laura W., Evan Weiss, Mati Roy, Weiyi Zheng, Uğurcan Türkdoğan, Stewart Hall, Siyuan Fu, Ollie Jaffe, Kleanthes K., Amber Yore, Austin Wiseman, Tiffany C., Thomas Dimson, Pedram Keyani, John Rizzo, Francis Z., Enoch Cheung, Wei An Lee, Ofir Nachum, Ilan Bigio, Allan J., Will Saborio, Erica T., Eric Rynerson, Noam Brown, Lucas Negritto, David Carr, Daniel Kappler, Anton Tananaev, Todor Markov, Srinivas Narayanan, Andrei Alexandru, Cory Decareaux, Brydon Eastman, Ali Kamali, Jacy Reese Anthis, Tarun Gogineni, David Medina, David Hengky, Tianhao Zheng, Michelle Pokrass, Adam Perelman, Bram Adams, Jan Hendrik Kirchner, Hossem Ben Ayed, Mira Murati, Vishal Kuo, Daniel Levy, Akila Welihinda, Yaniv Markovski, Steven Bills, Steven Adler, Chester Cho, Adam Nace, Eugene Wu, Davit Khachatryan, Oleg Mürk, Bogo Giertler , Daniel Kokotajlo, Tatiana Zolotova, Sully Chen, Ryan Peterson, Helen Toner, Juston Forte, Joanne Jang, Chaitra A., Arun Vijayvergiya, Angela Jiang, Preston Tuggle, Dave Willner, Atqiya Abida Anjum, Rob Mallery, Adam Goldberg, Rajeev Nayak, Lama Ahmad, Matthew Gentzel, Sarah Shoker, Carroll Wainwright, Anna Makanju, Richard Ngo, Vitchyr Pong, Victor Benito Garcia Rocha, Elie Georges, Angie Luo, Vlad Ursu, Lukasz Kaiser, Lisa Dethridge, Isabel Alves de Lima, Johannes H., Sarthak Agrawal, Radhika Mathur, Kyle Kosic, Jason Kwon, Emanuele Marchiori, Tabarak Khan, Natalie Summers, Nicolas Norberto Corizzo, Jesse Han, Ishant Singh, Hannah Wong, Bob Rotsted, Giambattista Parascandolo, Che Chang, Zack Kass, Evan Morikawa, Miles Brundage, Jonathan Gordon, Maddie Simens, Suchir Balaji, Peter Welinder, Tyna Eloundou, Philippe Tillet, Julián Santoro, Adam Rhodes, Phuong Vu, Theresa Lopez, Mo Bavarian, Fotios Chantzis, Dave Cummings, Joel Lehman, Denny Jin, Raul Puri, Joost Huizinga, Red A., Emy Parparita, Kelly Sims, Tim Yanchen Wang, Arvind Neelakantan, Rachel Lim, Jeff Clune, Shivon Zilis, Fraser Kelton, Jian O., Aris Konstantinidis, Roger Xu Jiang, Tao Xu, Nikolas Tezak, Stanislas Polu, Gretchen M. Krueger, Mario Saltarelli, Girish Sastry, Cullen O"Keefe, Luke Miller, Benjamin Mann, Long Ouyang, Ife Riamah, Frances Choi, Karson Elmgren, Ilge Akkaya, Jerry Tworek, Alex Paino, Szymon Sidor, Janet Brown, Elynn Chen, Danny Hernandez, Edgar Barraza, Jonathan Michaux, Maxim Sokolov, Fatma Tarlaci, Christina Hendrickson, Yuhao Wan, Nancy Otero, Katie Mayer, Bianca Martin, Ben Chess, Qiming Yuan, Mateusz Litwin, Tom Brown, Janine Korovesis, Clemens Winter, Amanda Askell, Mikhail Pavlov, Lei Zhang, Jacob Hilton, Justin Wang, Daniela Amodei, Ian Atha, Taehoon Kim, Maddie Hall, Jacob Jackson, Gillian Hadfield, Matt Mochary, Michał Staniszewski, Ingmar Kanitscheider, Brad Lightcap, Arthur Petron, Nadja Rhodes, Munashe Shumba, Sophia Arakelyan, Karl Cobbe, Joshua Meier, Xingyou (Richard) Song, Holly Grimm, Hannah Davis, Ifu Aniemeka, Yilun Du, Johannes Otterbach, Will Rice, Christine McLeavey Payne, Michael Petrov, Susan Zhang, Hanjun Dai, Aravind Srinivas, Sam McCandlish, Erin Grant, Sadhika Malladi, Will Grathwohl, Peter Zhokhov, Thomas Anthony, Henrique Ponde de Oliveira Pinto, Aleksandar Botev, Elena Chatziathanasiadou, Manuel Sherbakoff, Diane Yoon, Rewon Child, Julia Galef, Parnian Barekatain, Lilian Weng, Kevin Wong, Kaleo Hao, Glenn Powell, Ryan Carey, Naomi Bashkansky, Mathew Shrwed, David Farhi, Adam Smets, Christy Dennison, Ashley C. Pilipiszyn, Remco Zwetsloot, David Luan, Maciej Chociej, Jonathan Ward, Jonathan Raiman, Nikhil Mishra, Larissa Schiavo, Karthik Narasimhan, Bowen Baker, Alex Nichol, Phillip Isola, Joshua Achiam, Yuping Luo, Brooke Chan, Jiaming Song, AlShaun Baksh, Christos Louizos, Cathy Wu, Aditya Grover, Xue Bin Peng, Han Zhang, Dustin Tran, Jason Peng, Trapit Bansal, Yang Liu, Art Chaidarun, Matthias Plappert, David Lansky, Rein Houthooft, Jakub Pachocki, Aleks Kamko, Yaroslav Bulatov, Tim Shi, Danielle Buma, Jonathan Ho, Michael Page, Bob McGrew, Shariq Hashme, Erika Reinhardt, Richard Chen, Taco Cohen, Filip Wolski, Jeremy Schlatter, Louise Cabansay, Jonathan Hernandez, Jack Clark, Harri Edwards, Marie La, Desmond Henderson, Tambet Matiisen, Ludwig Pettersson, Marika Allely, Igor Mordatch, Yuri Burda, Catherine Olsson, Scott Gray, Craig Quiter, Zain Shah, Tyler Neylon, Linxi Fan, Kate Miltenberger, Jon Gauthier, Rafał Józefowicz, Pieter Abbeel, Paul Christiano, Jie Tang, Marcin Andrychowicz, Peter Chen, Eric Price, Tim Salimans, Jim Fan, Prafulla Dhariwal, Jeff Arnold, Jonas Schneider, Alec Radford, Ian Goodfellow, Chris Clark, Rocky Duan, Trevor Blackwell, Wojciech Zaremba, Ilya Sutskever, Andrej Karpathy, Vicki Cheung, Matt Krisiloff, John Schulman, Greg Brockman, Durk Kingma, Lucy Qin, Jonathan Gray
Flowers Laboratory 39 Alvaro Ovalle Castaneda, Anna-Lisa Vollmer, Stéphanie Noirpoudre, Loïc Dauphin, Florian Golemo, Sébastien Forestier, William Schueller, Cem Karaoguz, Nicolas Rabault, Matthieu Lapeyre, Pierre Rouanet, Nicolas Jahier, Didier Roy, Alexandre Gepperth, Adrien Matricon, Céline Craye, Alexandra Delmas, Gennaro Raiola, Baptiste Busch, Panagiotis Papadakis, Yoan Mollard, Thibaut Munzer, Freek Stulp, Thomas Degris, Jonathan Grizou, Guillaume Duceux, Louis-Charles Caron, Manuel Lopes, Olivier Mangin, Natalia Lyubova, Olivier Ly, Fabien Benureau, Paul Fudal, Thomas Cederborg, Adrien Baranes, Jérome Béchu, Pierre-Yves Oudeyer, Damien Caselli, Théo Segonds
Whole Brain Architecture Initiative 9 Koji Morikawa, Hideyuki Nakashima, Hiroyuki Morikawa, Masaru Tomita, Kitano Hiroaki, Kenji Doya, Koichi Takahashi, Yutaka Matsuo, Hiroshi Yamakawa
Google Brain 1 Jeremy Nixon
Google DeepMind 1 Miljan Martic