AI Watch

Welcome! This is a website to track people and organizations working on AI safety. See the code repository for the source code and data of this website.

This website is developed by Issa Rice with data contributions from Sebastian Sanchez, Amana Rice, and Vipul Naik, and has been partially funded by Vipul Naik and Mati Roy (who in July 2023 paid for the time Issa had spent answering people’s questions about AI Watch up until that point).

If you like (or want to like) this website and have money: the current funder is mostly only funding data updates to existing organizations as well as adding data for some new effective altruist organizations. As a result, the site is not getting any new features or improvements in design. If you want to bring this site to the next level, contact Issa at riceissa@gmail.com. What you get: site improvements, recognition in the site credits. What the site needs: money.

If you have time and want experience building websites: this website is looking for contributors. If you want to help out, contact Issa at riceissa@gmail.com. What you get: little or no pay (this could change if the site gets funding; see previous paragraph), recognition in the site credits, privilege of working with me, knowledge of the basics of web development (MySQL, PHP, Git). What the site needs: data collection/entry and website code improvements.

Last updated on 2023-09-07; see here for a full list of recent changes.

Table of contents

Agendas

Agenda name Associated people Associated organizations
Iterated amplification Paul Christiano, Buck Shlegeris, Dario Amodei OpenAI
Embedded agency Eliezer Yudkowsky, Scott Garrabrant, Abram Demski Machine Intelligence Research Institute
Comprehensive AI services Eric Drexler Future of Humanity Institute
Ambitious value learning Stuart Armstrong Future of Humanity Institute
Factored cognition Andreas Stuhlmüller Ought
Recursive reward modeling Jan Leike, David Krueger, Tom Everitt, Miljan Martic, Vishal Maini, Shane Legg Google DeepMind
Debate Paul Christiano OpenAI
Interpretability Christopher Olah
Inverse reinforcement learning
Preference learning
Cooperative inverse reinforcement learning
Imitation learning
Alignment for advanced machine learning systems Jessica Taylor, Eliezer Yudkowsky, Patrick LaVictoire, Andrew Critch Machine Intelligence Research Institute
Learning-theoretic AI alignment Vanessa Kosoy
Counterfactual reasoning Jacob Steinhardt

Positions grouped by person

Showing 0 people with positions.

Name Number of organizations List of organizations

Positions grouped by organization

Showing 5 organizations.

Organization Number of people List of people
OpenAI 283 Yaniv Markovski, Vishal Kuo, Steven Bills, Daniel Levy, Jessica Shieh, Eugene Wu, Dave Willner, Chester Cho, Adam Nace, Tatiana Zolotova, Sully Chen, Ryan Peterson, Adam Rhodes, Oleg Mürk, Juston Forte, Joanne Jang, Zarina Stanik, Tolly Powell, Chaitra A., Rob Mallery, Austin Wiseman, Preston Tuggle, Adam Goldberg, Matthew Gentzel, Lama Ahmad, Gretchen M. Krueger, Giambattista Parascandolo, Davit Khachatryan, Rajeev Nayak, Sarah Shoker, Carroll Wainwright, Richard Ngo, Anna Makanju, Vitchyr Pong, Victor Benito Garcia Rocha, Elie Georges, Angie Luo, Lukasz Kaiser, Lisa Dethridge, Isabel Alves de Lima, Vlad Ursu, Stefanie Biaggi, Rosie Campbell, Lucas Negritto, Johannes H., Sarthak Agrawal, Kyle Kosic, Karl Whitford Pollard, Jason Kwon, Gualberto Briceño, Emanuele Marchiori, Radhika Mathur, Frances Choi, Tabarak Khan, Natalie Summers, Jonathan Ward, Jesse Han, Jade Leung, Ishant Singh, Hannah Wong, Cullen O"Keefe, Bob Rotsted, Miles Brundage, Lilian Weng, Zack Kass, Steven Adler, Shawn Jain, Che Chang, Jonathan Gordon, Maddie Simens, Julián Santoro, Tyna Eloundou, Bram Adams, Phuong Vu, Philippe Tillet, Mo Bavarian, Fotios Chantzis, Theresa Lopez, Dave Cummings, Joel Lehman, Denny Jin, Joost Huizinga, Fraser Kelton, Raul Puri, Robert B. Brodsky, Red A., Emy Parparita, Kelly Sims, Tim Yanchen Wang, Rachelle F., Arvind Neelakantan, Jeff Clune, Peter Welinder, Christina Hendrickson, Roger Xu Jiang, Aris Konstantinidis, Girish Sastry, Tao Xu, Stanislas Polu, Nikolas Tezak, Mario Saltarelli, Luke Miller, Long Ouyang, Ife Riamah, Diane Yoon, Karson Elmgren, Jerry Tworek, Ilge Akkaya, Alex Paino, Maxim Sokolov, Jonathan Michaux, Janet Brown, Helen (Mengxin) Ji, Yuhao Wan, Fatma Tarlaci, Elynn Chen, Edgar Barraza, Danny Hernandez, Nancy Otero, Bianca Martin, Ben Chess, Mateusz Litwin, Johannes Otterbach, Qiming Yuan, Janine Korovesis, Clemens Winter, Amanda Askell, Mikhail Pavlov, Lei Zhang, Justin Wang, Jacob Hilton, Jack Clark, Ian Atha, Todor Markov, Maddie Hall, Jacob Jackson, Taehoon Kim, Michał Staniszewski, Matt Mochary, Ingmar Kanitscheider, Gillian Hadfield, Christine McLeavey Payne, Brad Lightcap, Arthur Petron, Mira Murati, Karl Cobbe, Josh Meier, Ifu Aniemeka, Holly Grimm, Hannah Davis, Yilun Du, Xingyou (Richard) Song, Will Rice, Dolapo Martins, Sophia Arakelyan, Nadja Rhodes, Munashe Shumba, Michael Petrov, Hanjun Dai, Will Grathwohl, Erin Grant, Susan Zhang, Suchir Balaji, Sam McCandlish, Sadhika Malladi, Aravind Srinivas, Thomas Anthony, Peter Zhokhov, Aleksandar Botev, Manuel Sherbakoff, Eric Sigler, Elena Chatziathanasiadou, Rewon Child, Kevin Wong, Kaleo Hao, Glenn Powell, Ryan Carey, Parnian Barekatain, Mathew Shrwed, David Farhi, Christy Dennison, Remco Zwetsloot, Ashley C. Pilipiszyn, Adam Smets, David Luan, Maciej Chociej, Jonathan Raiman, Larissa Schiavo, Karthik Narasimhan, Bowen Baker, Phillip Isola, Alex Nichol, Nikhil Mishra, Joshua Achiam, Yuping Luo, Jiaming Song, Christos Louizos, Cathy Wu, Brooke Chan, AlShaun Baksh, Aditya Grover, Jason Peng, Han Zhang, Yang Liu, Trapit Bansal, Dustin Tran, Matthias Plappert, David Lansky, Art Chaidarun, Christopher Berner, Rein Houthooft, Benjamin Mann, Aleks Kamko, Jonathan Gray, Jakub Pachocki, Yaroslav Bulatov, Ankur Handa, Michael Page, Erika Reinhardt, Tim Shi, Danielle Buma, Shariq Hashme, Bob McGrew, Jeremy Schlatter, Taco Cohen, Szymon Sidor, Meredith Blankenship, Marie La, Louise Cabansay, Jonathan Hernandez, Harri Edwards, Desmond Henderson, Rachel Fong, Marika Allely, Ludwig Pettersson, Tambet Matiisen, Craig Quiter, Alexander Skidanov, Alexander Ray, Igor Mordatch, Filip Wolski, Catherine Olsson, Zain Shah, Gavan Woolery, Scott Gray, Kate Miltenberger, Jon Gauthier, Tyler Neylon, Rafał Józefowicz, Marcin Andrychowicz, Jie Tang, Prafulla Dhariwal, Paul Christiano, Tim Salimans, Shivon Zilis, Jonas Schneider, Jeff Arnold, Linxi Fan, Jonathan Ho, Yura Burda, Eric Price, Peter Chen, Alec Radford, Ian Goodfellow, Chris Clark, Rocky Duan, Ilya Sutskever, Wojciech Zaremba, Trevor Blackwell, Tom Brown, Pieter Abbeel, Andrej Karpathy, Matt Krisiloff, John Schulman, Greg Brockman, Vicki Cheung, Durk Kingma, Pamela Vagata, Pavan Sharma, Henrique Pondé, Helen Toner, Daniela Amodei, Smitha Milli
FLOWERS 31 Alvaro Ovalle Castaneda, Florian Golemo, Benjamin Clément, Sébastien Forestier, William Schueller, Cem Karaoguz, Adrien Matricon, Céline Craye, Alexandra Delmas, Gennaro Raiola, Baptiste Busch, Panagiotis Papadakis, Thibaut Munzer, Guillaume Duceux, Louis-Charles Caron, Damien Caselli, Théo Segonds, Stéphanie Noirpoudre, Loïc Dauphin, Matthieu Lapeyre, Nicolas Rabault, Yoan Mollard, Pierre Rouanet, Nicolas Jahier, Didier Roy, Anna-Lisa Vollmer, Freek Stulp, Alexandre Gepperth, David Filliat, Manuel Lopes, Pierre-Yves Oudeyer
Whole Brain Architecture Initiative 9 Koji Morikawa, Hideyuki Nakashima, Hiroyuki Morikawa, Masaru Tomita, Kitano Hiroaki, Kenji Doya, Koichi Takahashi, Yutaka Matsuo, Hiroshi Yamakawa
Google Brain 1 Jeremy Nixon
Google DeepMind 1 Miljan Martic