AI Watch

Welcome! This is a website to track people and organizations in the AI safety/alignment/AI existential risk communities. A position or organization being on AI Watch does not indicate an assessment that that position or organization is actually making AI safer or that the position or organization is good for the world in any way. It is mostly a sociological indication that the position or organization is associated with these communities, as well as an indication that the position or organization claims to be working on AI safety or alignment. (There are some plans to eventually introduce such assessments on AI Watch, but for now there are none.) See the code repository for the source code and data of this website.

This website is developed by Issa Rice with data contributions from Sebastian Sanchez, Amana Rice, and Vipul Naik, and has been partially funded by Vipul Naik and Mati Roy (who in July 2023 paid for the time Issa had spent answering people’s questions about AI Watch up until that point).

Last updated on 2026-05-30; see here for a full list of recent changes.

Table of contents

Agendas

Agenda name Associated people Associated organizations
Iterated amplification Paul Christiano, Buck Shlegeris, Dario Amodei OpenAI
Embedded agency Eliezer Yudkowsky, Scott Garrabrant, Abram Demski Machine Intelligence Research Institute
Comprehensive AI services Eric Drexler Future of Humanity Institute
Ambitious value learning Stuart Armstrong Future of Humanity Institute
Factored cognition Andreas Stuhlmüller Ought
Recursive reward modeling Jan Leike, David Krueger, Tom Everitt, Miljan Martic, Vishal Maini, Shane Legg Google DeepMind
Debate Paul Christiano OpenAI
Interpretability Christopher Olah
Inverse reinforcement learning
Preference learning
Cooperative inverse reinforcement learning
Imitation learning
Alignment for advanced machine learning systems Jessica Taylor, Eliezer Yudkowsky, Patrick LaVictoire, Andrew Critch Machine Intelligence Research Institute
Learning-theoretic AI alignment Vanessa Kosoy
Counterfactual reasoning Jacob Steinhardt

Positions grouped by person

Showing 0 people with positions.

Name Number of organizations List of organizations

Positions grouped by organization

Showing 5 organizations.

Organization Number of people List of people
OpenAI 330 Stefanie Biaggi, Meghan Dorn, Jessica Shieh, Zico Kolter, Hannah Wong, Michael Chen, Jakub Pachocki, Rosie Campbell, Bianca Martin, Peter Hoeschele, Natalie Summers, Mike B., Tatiana Zolotova, Ryan Peterson, Evan Weiss, Yasuyoshi Sakamoto, Shuyuan Zhang, Mada Aflak, Laura W., Amber Yore, Austin Wiseman, Mati Roy, Weiyi Zheng, Uğurcan Türkdoğan, Stewart Hall, Siyuan Fu, Ollie Jaffe, Kleanthes K., Francis Z., Enoch Cheung, Tiffany C., Thomas Dimson, Peter Welinder, Pedram Keyani, John Rizzo, Allan J., Wei An Lee, Dave Willner, CJ Minott, Ofir Nachum, Ilan Bigio, Erica T., Eric Rynerson, Noam Brown, Will Saborio, Anna Makanju, David Carr, Daniel Kappler, Anton Tananaev, Lucas Negritto, Srinivas Narayanan, Andrei Alexandru, Brydon Eastman, Ali Kamali, Jacy Reese Anthis, David Medina, Tarun Gogineni, David Hengky, Tianhao Zheng, Michelle Pokrass, Adam Perelman, Bram Adams, Jan Hendrik Kirchner, Hossem Ben Ayed, Cory Decareaux, Mira Murati, Daniel Levy, Akila Welihinda, Yaniv Markovski, Steven Bills, Steven Adler, Vishal Kuo, Jerry Tworek, Adam Nace, Eugene Wu, Chester Cho, Janine Korovesis, Bogo Giertler , Daniel Kokotajlo, Sully Chen, Trapit Bansal, Davit Khachatryan, Oleg Mürk, Chaitra A., Arun Vijayvergiya, Helen Toner, Juston Forte, Joanne Jang, Atqiya Abida Anjum, Rob Mallery, Adam Goldberg, Angela Jiang, Preston Tuggle, Johannes H., Rajeev Nayak, Luke Miller, Lama Ahmad, Matthew Gentzel, Carroll Wainwright, Richard Ngo, Sarah Shoker, Angie Luo, Vitchyr Pong, Elie Georges, Victor Benito Garcia Rocha, Isabel Alves de Lima, Vlad Ursu, Lukasz Kaiser, Lisa Dethridge, Sarthak Agrawal, Jason Kwon, Emanuele Marchiori, Radhika Mathur, Kyle Kosic, Tabarak Khan, Ishant Singh, Bob Rotsted, Nicolas Norberto Corizzo, Jesse Han, Giambattista Parascandolo, Che Chang, Zack Kass, Evan Morikawa, Diane Yoon, Jonathan Gordon, Maddie Simens, Julián Santoro, Adam Rhodes, Phuong Vu, Tyna Eloundou, Philippe Tillet, Fotios Chantzis, Dave Cummings, Theresa Lopez, Mo Bavarian, Joel Lehman, Denny Jin, Raul Puri, Joost Huizinga, Red A., Emy Parparita, Kelly Sims, Arvind Neelakantan, Tim Yanchen Wang, Jeff Clune, Shivon Zilis, Fraser Kelton, Rachel Lim, Jacob Jackson, Aris Konstantinidis, Roger Xu Jiang, Jian O., Tao Xu, Nikolas Tezak, Stanislas Polu, Gretchen M. Krueger, Girish Sastry, Cullen O"Keefe, Mario Saltarelli, Benjamin Mann, Frances Choi, Long Ouyang, Ife Riamah, Justin Wang, Alex Paino, Karson Elmgren, Ilge Akkaya, Janet Brown, Elynn Chen, Danny Hernandez, Edgar Barraza, Jonathan Michaux, Maxim Sokolov, Fatma Tarlaci, Christina Hendrickson, Yuhao Wan, Ben Chess, Nancy Otero, Katie Mayer, Tom Brown, Qiming Yuan, Mateusz Litwin, Clemens Winter, Amanda Askell, Jacob Hilton, Daniela Amodei, Ian Atha, Todor Markov, Mikhail Pavlov, Lei Zhang, Taehoon Kim, Maddie Hall, Brad Lightcap, Arthur Petron, Gillian Hadfield, Bob McGrew, Matt Mochary, Miles Brundage, Michał Staniszewski, Ingmar Kanitscheider, Xingyou (Richard) Song, Holly Grimm, Hannah Davis, Ifu Aniemeka, Yilun Du, Johannes Otterbach, Will Rice, Christine McLeavey Payne, Nadja Rhodes, Munashe Shumba, Sophia Arakelyan, Karl Cobbe, Joshua Meier, Hanjun Dai, Aravind Srinivas, Sam McCandlish, Erin Grant, Sadhika Malladi, Will Grathwohl, Michael Petrov, Susan Zhang, Suchir Balaji, Aleksandar Botev, Peter Zhokhov, Thomas Anthony, Henrique Ponde de Oliveira Pinto, Elena Chatziathanasiadou, Manuel Sherbakoff, Rewon Child, Julia Galef, Glenn Powell, Ryan Carey, Parnian Barekatain, Lilian Weng, Kevin Wong, Kaleo Hao, David Farhi, Adam Smets, Christy Dennison, Ashley C. Pilipiszyn, Remco Zwetsloot, Naomi Bashkansky, Mathew Shrwed, David Luan, Maciej Chociej, Jonathan Ward, Jonathan Raiman, Bowen Baker, Alex Nichol, Phillip Isola, Nikhil Mishra, Larissa Schiavo, Karthik Narasimhan, Yuping Luo, Joshua Achiam, AlShaun Baksh, Christos Louizos, Cathy Wu, Aditya Grover, Brooke Chan, Jiaming Song, Xue Bin Peng, Han Zhang, Dustin Tran, Jason Peng, Yang Liu, Art Chaidarun, David Lansky, Matthias Plappert, Rein Houthooft, Christopher Berner, Aleks Kamko, Prafulla Dhariwal, Yaroslav Bulatov, Danielle Buma, Michael Page, Shariq Hashme, Erika Reinhardt, Richard Chen, Jonathan Ho, Tim Shi, Filip Wolski, Jeremy Schlatter, Taco Cohen, Szymon Sidor, Jack Clark, Harri Edwards, Marie La, Desmond Henderson, Louise Cabansay, Jonathan Hernandez, Marika Allely, Tambet Matiisen, Ludwig Pettersson, Igor Mordatch, Yuri Burda, Catherine Olsson, Craig Quiter, Zain Shah, Scott Gray, Kate Miltenberger, Jon Gauthier, Rafał Józefowicz, Pieter Abbeel, Tyler Neylon, Linxi Fan, Marcin Andrychowicz, Paul Christiano, Jie Tang, Peter Chen, Eric Price, Tim Salimans, Jim Fan, Jeff Arnold, Jonas Schneider, Alec Radford, Ian Goodfellow, Chris Clark, Rocky Duan, Ilya Sutskever, Andrej Karpathy, Trevor Blackwell, Wojciech Zaremba, Durk Kingma, Vicki Cheung, Matt Krisiloff, John Schulman, Greg Brockman, Lucy Qin, Jonathan Gray
Flowers Laboratory 39 Alvaro Ovalle Castaneda, Anna-Lisa Vollmer, Stéphanie Noirpoudre, Loïc Dauphin, Florian Golemo, Sébastien Forestier, William Schueller, Cem Karaoguz, Nicolas Rabault, Matthieu Lapeyre, Pierre Rouanet, Nicolas Jahier, Didier Roy, Alexandre Gepperth, Adrien Matricon, Céline Craye, Alexandra Delmas, Gennaro Raiola, Baptiste Busch, Panagiotis Papadakis, Yoan Mollard, Thibaut Munzer, Freek Stulp, Thomas Degris, Jonathan Grizou, Guillaume Duceux, Louis-Charles Caron, Manuel Lopes, Olivier Mangin, Natalia Lyubova, Olivier Ly, Fabien Benureau, Paul Fudal, Thomas Cederborg, Adrien Baranes, Jérome Béchu, Pierre-Yves Oudeyer, Damien Caselli, Théo Segonds
Whole Brain Architecture Initiative 9 Koji Morikawa, Hideyuki Nakashima, Hiroyuki Morikawa, Masaru Tomita, Kitano Hiroaki, Kenji Doya, Koichi Takahashi, Yutaka Matsuo, Hiroshi Yamakawa
Google Brain 1 Jeremy Nixon
Google DeepMind 1 Miljan Martic