AI Watch

Welcome! This is a website to track people and organizations in the AI safety/alignment/AI existential risk communities. A position or organization being on AI Watch does not indicate an assessment that that position or organization is actually making AI safer or that the position or organization is good for the world in any way. It is mostly a sociological indication that the position or organization is associated with these communities, as well as an indication that the position or organization claims to be working on AI safety or alignment. (There are some plans to eventually introduce such assessments on AI Watch, but for now there are none.) See the code repository for the source code and data of this website.

This website is developed by Issa Rice with data contributions from Sebastian Sanchez, Amana Rice, and Vipul Naik, and has been partially funded by Vipul Naik and Mati Roy (who in July 2023 paid for the time Issa had spent answering people’s questions about AI Watch up until that point).

If you like (or want to like) this website and have money: the current funder is mostly only funding data updates to existing organizations as well as adding data for some new effective altruist organizations. As a result, the site is not getting any new features or improvements in design. If you want to bring this site to the next level, contact Issa at riceissa@gmail.com. What you get: site improvements, recognition in the site credits. What the site needs: money.

If you have time and want experience building websites: this website is looking for contributors. If you want to help out, contact Issa at riceissa@gmail.com. What you get: little or no pay (this could change if the site gets funding; see previous paragraph), recognition in the site credits, privilege of working with me, knowledge of the basics of web development (MySQL, PHP, Git). What the site needs: data collection/entry and website code improvements.

Last updated on 2024-04-15; see here for a full list of recent changes.

Table of contents

Agendas

Agenda name Associated people Associated organizations
Iterated amplification Paul Christiano, Buck Shlegeris, Dario Amodei OpenAI
Embedded agency Eliezer Yudkowsky, Scott Garrabrant, Abram Demski Machine Intelligence Research Institute
Comprehensive AI services Eric Drexler Future of Humanity Institute
Ambitious value learning Stuart Armstrong Future of Humanity Institute
Factored cognition Andreas Stuhlmüller Ought
Recursive reward modeling Jan Leike, David Krueger, Tom Everitt, Miljan Martic, Vishal Maini, Shane Legg Google DeepMind
Debate Paul Christiano OpenAI
Interpretability Christopher Olah
Inverse reinforcement learning
Preference learning
Cooperative inverse reinforcement learning
Imitation learning
Alignment for advanced machine learning systems Jessica Taylor, Eliezer Yudkowsky, Patrick LaVictoire, Andrew Critch Machine Intelligence Research Institute
Learning-theoretic AI alignment Vanessa Kosoy
Counterfactual reasoning Jacob Steinhardt

Positions grouped by person

Showing 0 people with positions.

Name Number of organizations List of organizations

Positions grouped by organization

Showing 5 organizations.

Organization Number of people List of people
OpenAI 332 Mada Aflak, Laura W., Yasuyoshi Sakamoto, Evan Weiss, Shuyuan Zhang, Ollie Jaffe, Amber Yore, Kleanthes K., Weiyi Zheng, Uğurcan Türkdoğan, Stewart Hall, Siyuan Fu, Pedram Keyani, John Rizzo, Tiffany C., Thomas Dimson, Francis Z., Enoch Cheung, CJ Minott, Ofir Nachum, Allan J., Wei An Lee, Ilan Bigio, Will Saborio, Erica T., Eric Rynerson, David Carr, Daniel Kappler, Anton Tananaev, Andrei Alexandru, Srinivas Narayanan, Brydon Eastman, Ali Kamali, Tina Miranda, Hisham Elhaddad, Justin B., David Medina, David Hengky, Michelle Pokrass, Tianhao Zheng, Adam Perelman, Jan Hendrik Kirchner, Hossem Ben Ayed, Cory Decareaux, Mati Roy, Akila Welihinda, Yaniv Markovski, Vishal Kuo, Steven Bills, Chester Cho, Adam Nace, Jessica Shieh, Eugene Wu, Oleg Mürk, Bogo Giertler , Karl Whitford Pollard, Tatiana Zolotova, Sully Chen, Ryan Peterson, Chaitra A., Arun Vijayvergiya, Juston Forte, Joanne Jang, Zarina Stanik, Rob Mallery, Dave Willner, Preston Tuggle, Austin Wiseman, Atqiya Abida Anjum, Angela Jiang, Adam Goldberg, Davit Khachatryan, Rajeev Nayak, Matthew Gentzel, Lama Ahmad, Giambattista Parascandolo, Richard Ngo, Carroll Wainwright, Anna Makanju, Sarah Shoker, Angie Luo, Vitchyr Pong, Victor Benito Garcia Rocha, Elie Georges, Rosie Campbell, Lukasz Kaiser, Lisa Dethridge, Vlad Ursu, Isabel Alves de Lima, Stefanie Biaggi, Johannes H., Sarthak Agrawal, Radhika Mathur, Kyle Kosic, Jason Kwon, Emanuele Marchiori, Natalie Summers, Tabarak Khan, Nicolas Norberto Corizzo, Bob Rotsted, Jesse Han, Ishant Singh, Hannah Wong, Che Chang, Zack Kass, Evan Morikawa, Sinith T., Shawn Jain, Diane Yoon, Lucas Negritto, Jonathan Gordon, Steven Adler, Maddie Simens, Tarun Gogineni, Phuong Vu, Philippe Tillet, Bram Adams, Adam Rhodes, Julián Santoro, Tyna Eloundou, Dave Cummings, Mo Bavarian, Theresa Lopez, Fotios Chantzis, Denny Jin, Joel Lehman, Raul Puri, Joost Huizinga, Red A., Emy Parparita, Kelly Sims, Arvind Neelakantan, Tim Yanchen Wang, Rachel Lim, Jeff Clune, Fraser Kelton, Roger Xu Jiang, Aris Konstantinidis, Jian O., Jacquelyn Lau, Tao Xu, Gretchen M. Krueger, Girish Sastry, Stanislas Polu, Cullen O"Keefe, Mario Saltarelli, Benjamin Mann, Luke Miller, Long Ouyang, Ife Riamah, Frances Choi, Richard Dunn, Peter Hoeschele, Nikolas Tezak, Alex Paino, Karson Elmgren, Jerry Tworek, Ilge Akkaya, Danny Hernandez, Christina Hendrickson, Maxim Sokolov, Jonathan Michaux, Yuhao Wan, Janet Brown, Fatma Tarlaci, Elynn Chen, Edgar Barraza, Nancy Otero, Bianca Martin, Ben Chess, Katie Mayer, Qiming Yuan, Mateusz Litwin, Tom Brown, Clemens Winter, Amanda Askell, Janine Korovesis, Daniela Amodei, Mikhail Pavlov, Lei Zhang, Justin Wang, Jacob Hilton, Todor Markov, Ian Atha, Maddie Hall, Jacob Jackson, Taehoon Kim, Brad Lightcap, Miles Brundage, Michał Staniszewski, Arthur Petron, Matt Mochary, Ingmar Kanitscheider, Gillian Hadfield, Christine McLeavey Payne, Nadja Rhodes, Munashe Shumba, Mira Murati, Karl Cobbe, Joshua Meier, Johannes Otterbach, Yilun Du, Xingyou (Richard) Song, Ifu Aniemeka, Holly Grimm, Hannah Davis, Sophia Arakelyan, Michael Petrov, Aravind Srinivas, Louis Cheong, Will Grathwohl, Hanjun Dai, Susan Zhang, Suchir Balaji, Erin Grant, Sam McCandlish, Sadhika Malladi, Peter Zhokhov, Aleksandar Botev, Henrique Ponde de Oliveira Pinto, Thomas Anthony, Rewon Child, Manuel Sherbakoff, Eric Sigler, Elena Chatziathanasiadou, Ryan Carey, Parnian Barekatain, Lilian Weng, Kevin Wong, Kaleo Hao, Glenn Powell, David Farhi, Remco Zwetsloot, Christy Dennison, Ashley C. Pilipiszyn, Mathew Shrwed, Adam Smets, David Luan, Maciej Chociej, Jonathan Ward, Jonathan Raiman, Phillip Isola, Nikhil Mishra, Bowen Baker, Alex Nichol, Larissa Schiavo, Karthik Narasimhan, Joshua Achiam, Yuping Luo, Christos Louizos, Cathy Wu, Brooke Chan, AlShaun Baksh, Aditya Grover, Jiaming Song, Jason Peng, Yang Liu, Xue Bin Peng, Trapit Bansal, Han Zhang, Dustin Tran, David Lansky, Matthias Plappert, Art Chaidarun, Rein Houthooft, Christopher Berner, Aleks Kamko, Jakub Pachocki, Yaroslav Bulatov, Richard Chen, Danielle Buma, Peter Welinder, Bob McGrew, Michael Page, Jonathan Ho, Tim Shi, Erika Reinhardt, Shariq Hashme, Jeremy Schlatter, Taco Cohen, Szymon Sidor, Desmond Henderson, Rachel Fong, Marie La, Louise Cabansay, Jonathan Hernandez, Jack Clark, Harri Edwards, Marika Allely, Ludwig Pettersson, Tambet Matiisen, Igor Mordatch, Filip Wolski, Catherine Olsson, Craig Quiter, Zain Shah, Scott Gray, Rafał Józefowicz, Pieter Abbeel, Linxi Fan, Kate Miltenberger, Jon Gauthier, Tyler Neylon, Paul Christiano, Marcin Andrychowicz, Jie Tang, Peter Chen, Prafulla Dhariwal, Jim Fan, Tim Salimans, Shivon Zilis, Jonas Schneider, Jeff Arnold, Alec Radford, Yuri Burda, Eric Price, Chris Clark, Ian Goodfellow, Rocky Duan, Andrej Karpathy, Trevor Blackwell, Ilya Sutskever, Wojciech Zaremba, Matt Krisiloff, John Schulman, Vicki Cheung, Greg Brockman, Durk Kingma, Lucy Qin, Jonathan Gray, Javier Gai, Helen Toner
FLOWERS 31 Alvaro Ovalle Castaneda, Florian Golemo, Benjamin Clément, Sébastien Forestier, William Schueller, Cem Karaoguz, Adrien Matricon, Céline Craye, Alexandra Delmas, Gennaro Raiola, Baptiste Busch, Panagiotis Papadakis, Thibaut Munzer, Guillaume Duceux, Louis-Charles Caron, Damien Caselli, Théo Segonds, Stéphanie Noirpoudre, Loïc Dauphin, Matthieu Lapeyre, Nicolas Rabault, Yoan Mollard, Pierre Rouanet, Nicolas Jahier, Didier Roy, Anna-Lisa Vollmer, Freek Stulp, Alexandre Gepperth, David Filliat, Manuel Lopes, Pierre-Yves Oudeyer
Whole Brain Architecture Initiative 9 Koji Morikawa, Hideyuki Nakashima, Hiroyuki Morikawa, Masaru Tomita, Kitano Hiroaki, Kenji Doya, Koichi Takahashi, Yutaka Matsuo, Hiroshi Yamakawa
Google Brain 1 Jeremy Nixon
Google DeepMind 1 Miljan Martic