AI Watch

Welcome! This is a website to track people and organizations in the AI safety/alignment/AI existential risk communities. A position or organization being on AI Watch does not indicate an assessment that that position or organization is actually making AI safer or that the position or organization is good for the world in any way. It is mostly a sociological indication that the position or organization is associated with these communities, as well as an indication that the position or organization claims to be working on AI safety or alignment. (There are some plans to eventually introduce such assessments on AI Watch, but for now there are none.) See the code repository for the source code and data of this website.

This website is developed by Issa Rice with data contributions from Sebastian Sanchez, Amana Rice, and Vipul Naik, and has been partially funded by Vipul Naik and Mati Roy (who in July 2023 paid for the time Issa had spent answering people’s questions about AI Watch up until that point).

Last updated on 2025-11-01; see here for a full list of recent changes.

Table of contents

Agendas

Agenda name Associated people Associated organizations
Iterated amplification Paul Christiano, Buck Shlegeris, Dario Amodei OpenAI
Embedded agency Eliezer Yudkowsky, Scott Garrabrant, Abram Demski Machine Intelligence Research Institute
Comprehensive AI services Eric Drexler Future of Humanity Institute
Ambitious value learning Stuart Armstrong Future of Humanity Institute
Factored cognition Andreas Stuhlmüller Ought
Recursive reward modeling Jan Leike, David Krueger, Tom Everitt, Miljan Martic, Vishal Maini, Shane Legg Google DeepMind
Debate Paul Christiano OpenAI
Interpretability Christopher Olah
Inverse reinforcement learning
Preference learning
Cooperative inverse reinforcement learning
Imitation learning
Alignment for advanced machine learning systems Jessica Taylor, Eliezer Yudkowsky, Patrick LaVictoire, Andrew Critch Machine Intelligence Research Institute
Learning-theoretic AI alignment Vanessa Kosoy
Counterfactual reasoning Jacob Steinhardt

Positions grouped by person

Showing 0 people with positions.

Name Number of organizations List of organizations

Positions grouped by organization

Showing 5 organizations.

Organization Number of people List of people
OpenAI 330 Meghan Dorn, Jessica Shieh, Zico Kolter, Stefanie Biaggi, Michael Chen, CJ Minott, Rosie Campbell, Peter Hoeschele, Natalie Summers, Mike B., Christopher Berner, Yasuyoshi Sakamoto, Shuyuan Zhang, Mada Aflak, Laura W., Evan Weiss, Mati Roy, Weiyi Zheng, Uğurcan Türkdoğan, Stewart Hall, Siyuan Fu, Ollie Jaffe, Kleanthes K., Amber Yore, Austin Wiseman, Tiffany C., Thomas Dimson, Peter Welinder, Pedram Keyani, John Rizzo, Francis Z., Enoch Cheung, Adam Goldberg, Wei An Lee, Ofir Nachum, Ilan Bigio, Allan J., Noam Brown, Will Saborio, Erica T., Eric Rynerson, Lucas Negritto, David Carr, Daniel Kappler, Anton Tananaev, Todor Markov, Srinivas Narayanan, Andrei Alexandru, Jacy Reese Anthis, Cory Decareaux, Brydon Eastman, Ali Kamali, Tarun Gogineni, David Medina, David Hengky, Tianhao Zheng, Michelle Pokrass, Adam Perelman, Bram Adams, Jan Hendrik Kirchner, Hossem Ben Ayed, Mira Murati, Vishal Kuo, Daniel Levy, Akila Welihinda, Yaniv Markovski, Steven Bills, Steven Adler, Chester Cho, Adam Nace, Eugene Wu, Davit Khachatryan, Oleg Mürk, Bogo Giertler , Daniel Kokotajlo, Tatiana Zolotova, Sully Chen, Ryan Peterson, Helen Toner, Juston Forte, Joanne Jang, Chaitra A., Arun Vijayvergiya, Angela Jiang, Preston Tuggle, Dave Willner, Atqiya Abida Anjum, Rob Mallery, Rajeev Nayak, Lama Ahmad, Matthew Gentzel, Sarah Shoker, Carroll Wainwright, Anna Makanju, Richard Ngo, Vitchyr Pong, Victor Benito Garcia Rocha, Elie Georges, Angie Luo, Vlad Ursu, Lukasz Kaiser, Lisa Dethridge, Isabel Alves de Lima, Johannes H., Sarthak Agrawal, Radhika Mathur, Kyle Kosic, Jason Kwon, Emanuele Marchiori, Tabarak Khan, Nicolas Norberto Corizzo, Jesse Han, Ishant Singh, Hannah Wong, Bob Rotsted, Giambattista Parascandolo, Che Chang, Zack Kass, Evan Morikawa, Jonathan Gordon, Maddie Simens, Suchir Balaji, Phuong Vu, Tyna Eloundou, Philippe Tillet, Julián Santoro, Adam Rhodes, Theresa Lopez, Mo Bavarian, Fotios Chantzis, Dave Cummings, Joel Lehman, Denny Jin, Raul Puri, Joost Huizinga, Red A., Emy Parparita, Kelly Sims, Tim Yanchen Wang, Arvind Neelakantan, Rachel Lim, Jeff Clune, Shivon Zilis, Fraser Kelton, Jian O., Aris Konstantinidis, Roger Xu Jiang, Tao Xu, Nikolas Tezak, Stanislas Polu, Gretchen M. Krueger, Mario Saltarelli, Girish Sastry, Cullen O"Keefe, Luke Miller, Benjamin Mann, Long Ouyang, Ife Riamah, Frances Choi, Karson Elmgren, Ilge Akkaya, Jerry Tworek, Alex Paino, Szymon Sidor, Yuhao Wan, Janet Brown, Elynn Chen, Danny Hernandez, Edgar Barraza, Jonathan Michaux, Maxim Sokolov, Fatma Tarlaci, Christina Hendrickson, Nancy Otero, Katie Mayer, Bianca Martin, Ben Chess, Qiming Yuan, Mateusz Litwin, Tom Brown, Janine Korovesis, Clemens Winter, Amanda Askell, Mikhail Pavlov, Lei Zhang, Jacob Hilton, Justin Wang, Daniela Amodei, Ian Atha, Taehoon Kim, Maddie Hall, Jacob Jackson, Gillian Hadfield, Matt Mochary, Miles Brundage, Michał Staniszewski, Ingmar Kanitscheider, Brad Lightcap, Arthur Petron, Nadja Rhodes, Munashe Shumba, Sophia Arakelyan, Karl Cobbe, Joshua Meier, Xingyou (Richard) Song, Holly Grimm, Hannah Davis, Ifu Aniemeka, Yilun Du, Johannes Otterbach, Will Rice, Christine McLeavey Payne, Will Grathwohl, Michael Petrov, Susan Zhang, Hanjun Dai, Aravind Srinivas, Sam McCandlish, Erin Grant, Sadhika Malladi, Peter Zhokhov, Thomas Anthony, Henrique Ponde de Oliveira Pinto, Aleksandar Botev, Elena Chatziathanasiadou, Manuel Sherbakoff, Diane Yoon, Rewon Child, Julia Galef, Parnian Barekatain, Lilian Weng, Kevin Wong, Kaleo Hao, Glenn Powell, Ryan Carey, Naomi Bashkansky, Mathew Shrwed, David Farhi, Adam Smets, Christy Dennison, Ashley C. Pilipiszyn, Remco Zwetsloot, David Luan, Maciej Chociej, Jonathan Ward, Jonathan Raiman, Phillip Isola, Nikhil Mishra, Larissa Schiavo, Karthik Narasimhan, Bowen Baker, Alex Nichol, Joshua Achiam, Yuping Luo, Brooke Chan, Jiaming Song, AlShaun Baksh, Christos Louizos, Cathy Wu, Aditya Grover, Yang Liu, Xue Bin Peng, Han Zhang, Dustin Tran, Jason Peng, Trapit Bansal, Art Chaidarun, Matthias Plappert, David Lansky, Rein Houthooft, Jakub Pachocki, Aleks Kamko, Yaroslav Bulatov, Tim Shi, Danielle Buma, Jonathan Ho, Michael Page, Bob McGrew, Shariq Hashme, Erika Reinhardt, Richard Chen, Taco Cohen, Filip Wolski, Jeremy Schlatter, Louise Cabansay, Jonathan Hernandez, Jack Clark, Harri Edwards, Marie La, Desmond Henderson, Tambet Matiisen, Ludwig Pettersson, Marika Allely, Igor Mordatch, Yuri Burda, Catherine Olsson, Zain Shah, Scott Gray, Craig Quiter, Pieter Abbeel, Tyler Neylon, Linxi Fan, Kate Miltenberger, Jon Gauthier, Rafał Józefowicz, Paul Christiano, Jie Tang, Marcin Andrychowicz, Peter Chen, Eric Price, Tim Salimans, Jim Fan, Prafulla Dhariwal, Jeff Arnold, Jonas Schneider, Alec Radford, Ian Goodfellow, Chris Clark, Rocky Duan, Trevor Blackwell, Wojciech Zaremba, Ilya Sutskever, Andrej Karpathy, Vicki Cheung, Matt Krisiloff, John Schulman, Greg Brockman, Durk Kingma, Lucy Qin, Jonathan Gray
Flowers Laboratory 39 Alvaro Ovalle Castaneda, Anna-Lisa Vollmer, Stéphanie Noirpoudre, Loïc Dauphin, Florian Golemo, Sébastien Forestier, William Schueller, Cem Karaoguz, Nicolas Rabault, Matthieu Lapeyre, Pierre Rouanet, Nicolas Jahier, Didier Roy, Alexandre Gepperth, Adrien Matricon, Céline Craye, Alexandra Delmas, Gennaro Raiola, Baptiste Busch, Panagiotis Papadakis, Yoan Mollard, Thibaut Munzer, Freek Stulp, Guillaume Duceux, Thomas Degris, Jonathan Grizou, Louis-Charles Caron, Manuel Lopes, Paul Fudal, Olivier Mangin, Natalia Lyubova, Olivier Ly, Fabien Benureau, Thomas Cederborg, Pierre-Yves Oudeyer, Adrien Baranes, Jérome Béchu, Damien Caselli, Théo Segonds
Whole Brain Architecture Initiative 9 Koji Morikawa, Hideyuki Nakashima, Hiroyuki Morikawa, Masaru Tomita, Kitano Hiroaki, Kenji Doya, Koichi Takahashi, Yutaka Matsuo, Hiroshi Yamakawa
Google Brain 1 Jeremy Nixon
Google DeepMind 1 Miljan Martic