AI Watch

Welcome! This is a website to track people and organizations in the AI safety/alignment/AI existential risk communities. A position or organization being on AI Watch does not indicate an assessment that that position or organization is actually making AI safer or that the position or organization is good for the world in any way. It is mostly a sociological indication that the position or organization is associated with these communities, as well as an indication that the position or organization claims to be working on AI safety or alignment. (There are some plans to eventually introduce such assessments on AI Watch, but for now there are none.) See the code repository for the source code and data of this website.

This website is developed by Issa Rice with data contributions from Sebastian Sanchez, Amana Rice, and Vipul Naik, and has been partially funded by Vipul Naik and Mati Roy (who in July 2023 paid for the time Issa had spent answering people’s questions about AI Watch up until that point).

Last updated on 2024-10-19; see here for a full list of recent changes.

Table of contents

Agendas

Agenda name Associated people Associated organizations
Iterated amplification Paul Christiano, Buck Shlegeris, Dario Amodei OpenAI
Embedded agency Eliezer Yudkowsky, Scott Garrabrant, Abram Demski Machine Intelligence Research Institute
Comprehensive AI services Eric Drexler Future of Humanity Institute
Ambitious value learning Stuart Armstrong Future of Humanity Institute
Factored cognition Andreas Stuhlmüller Ought
Recursive reward modeling Jan Leike, David Krueger, Tom Everitt, Miljan Martic, Vishal Maini, Shane Legg Google DeepMind
Debate Paul Christiano OpenAI
Interpretability Christopher Olah
Inverse reinforcement learning
Preference learning
Cooperative inverse reinforcement learning
Imitation learning
Alignment for advanced machine learning systems Jessica Taylor, Eliezer Yudkowsky, Patrick LaVictoire, Andrew Critch Machine Intelligence Research Institute
Learning-theoretic AI alignment Vanessa Kosoy
Counterfactual reasoning Jacob Steinhardt

AI safety relation by subject

Note: as shown by the large number of “unknown” values, most of the positions haven’t been categorized by relation/subject so this table will only be useful in the future.

Subject UnknownAGI organizationGCR organizationpositionunrelated Total
Unknown 8756 469 64 432 1 9722
background 0 0 0 23 0 23
general 0 0 3 43 0 46
policy 0 0 0 1 0 1
popularization 0 0 0 2 0 2
software engineering 0 2 0 8 0 10
strategy 0 0 0 1 0 1
technical research 6 2 3 33 1 45
Total 8762 473 70 543 2 9850

Positions summary by year

Note: as shown by the large number of “unknown” values, most of the positions haven’t been categorized by start/end dates so this table will only be useful in the future.

Year Start date End date
Unknown 1112 6382
1986 1 0
1993 1 0
1997 3 0
1999 2 1
2000 5 0
2001 5 0
2002 62 1
2003 15 1
2004 32 4
2005 60 5
2006 37 13
2007 48 4
2008 78 8
2009 131 16
2010 181 42
2011 218 55
2012 167 75
2013 194 92
2014 264 66
2015 376 154
2016 640 229
2017 741 298
2018 867 397
2019 917 331
2020 911 327
2021 1001 365
2022 870 440
2023 728 354
2024 183 190

Positions grouped by person

Showing 236 people with positions.

Name Number of organizations List of organizations
Paul Christiano 9 AI Impacts, Alignment Research Center, Future of Humanity Institute, Machine Intelligence Research Institute, Open Philanthropy, OpenAI, Ought, Redwood Research, University of California, Berkeley
Nick Bostrom 7 Centre for the Study of Existential Risk, Future of Humanity Institute, Future of Life Institute, Google DeepMind, Leverhulme Centre for the Future of Intelligence, Machine Intelligence Research Institute, University of Oxford
Stuart Russell 7 Berkeley Existential Risk Initiative, Center for Human-Compatible AI, Centre for the Study of Existential Risk, Future of Life Institute, Leverhulme Centre for the Future of Intelligence, Machine Intelligence Research Institute, University of California, Berkeley
Andrew Critch 6 Berkeley Existential Risk Initiative, Center for Applied Rationality, Center for Human-Compatible AI, Encultured AI, Machine Intelligence Research Institute, University of California, Berkeley
Kyle Scott 6 Alignment Research Center, Berkeley Existential Risk Initiative, Center for Applied Rationality, Future of Humanity Institute, Model Evaluation and Threat Research, Palisade Research
Allan Dafoe 5 Centre for the Study of Existential Risk, Cooperative AI Foundation, Future of Humanity Institute, University of Oxford, Yale University
Dario Amodei 5 Anthropic, Cooperative AI Foundation, Google Brain, Open Philanthropy, OpenAI
Jan Leike 5 Australian National University, Future of Humanity Institute, Google DeepMind, Machine Intelligence Research Institute, OpenAI
Ryan Carey 5 Centre for the Study of Existential Risk, Future of Humanity Institute, Machine Intelligence Research Institute, OpenAI, Ought
Seán Ó hÉigeartaigh 5 Berkeley Existential Risk Initiative, Centre for the Study of Existential Risk, Future of Humanity Institute, Global Catastrophic Risk Institute, Leverhulme Centre for the Future of Intelligence
Adam Gleave 4 Center for Human-Compatible AI, FAR.AI, Fund for Alignment Research, Model Evaluation and Threat Research
Bas R. Steunebrink 4 IDSIA, NNAISENSE, SUPSI, Università della Svizzera italiana
Ben Goldhaber 4 Center for Applied Rationality, FAR.AI, Fund for Alignment Research, Ought
Heather Roff 4 Arizona State University, Leverhulme Centre for the Future of Intelligence, New America Foundation, University of Oxford
Jaan Tallinn 4 Berkeley Existential Risk Initiative, Centre for the Study of Existential Risk, Future of Life Institute, Machine Intelligence Research Institute
Lawrence Chan 4 Alignment Research Center, Center for Human-Compatible AI, FAR.AI, Fund for Alignment Research
Matthijs Maas 4 Global Catastrophic Risk Institute, Global Politics of Artificial Intelligence Research Group at Yale University and University of Oxford, Hague Centre for Strategic Studies, University of Copenhagen
Miles Brundage 4 Arizona State University, Future of Humanity Institute, General AI Challenge, OpenAI
Roman Yampolskiy 4 General AI Challenge, Global Catastrophic Risk Institute, Machine Intelligence Research Institute, University of Louisville
Scott Emmons 4 Center for AI Safety, Center for Human-Compatible AI, FAR.AI, Fund for Alignment Research
Seth Baum 4 Centre for the Study of Existential Risk, Global Catastrophic Risk Institute, Machine Intelligence Research Institute, Social & Environmental Entrepreneurs
Adrian Weller 3 Centre for the Study of Existential Risk, Leverhulme Centre for the Future of Intelligence, University of Cambridge
Alex Tamkin 3 Anthropic, FAR.AI, Stanford University
Alison Gopnik 3 Center for Human-Compatible AI, Centre for the Study of Existential Risk, Leverhulme Centre for the Future of Intelligence
Andrew Snyder-Beattie 3 Berkeley Existential Risk Initiative, Future of Humanity Institute, Leverhulme Centre for the Future of Intelligence
Bart Selman 3 Center for Human-Compatible AI, Cornell University, Machine Intelligence Research Institute
Ben Weinstein-Raun 3 Machine Intelligence Research Institute, Ought, Redwood Research
Benjamin Mann 3 Anthropic, Machine Intelligence Research Institute, OpenAI
Buck Shlegeris 3 Alignment Research Center, Machine Intelligence Research Institute, Redwood Research
Carla Zoe Cremer 3 Centre for the Study of Existential Risk, Future of Humanity Institute, Leverhulme Centre for the Future of Intelligence
Daniel Dewey 3 Future of Humanity Institute, Future of Life Institute, Machine Intelligence Research Institute
Daniel Kokotajlo 3 AI Impacts, Effective Altruism Foundation, OpenAI
Daniela Amodei 3 Anthropic, Epoch, OpenAI
David Krueger 3 Center for Human-Compatible AI, Centre for the Study of Existential Risk, Future of Humanity Institute
Elon Musk 3 Centre for the Study of Existential Risk, Future of Life Institute, OpenAI
Eric Rogstad 3 Berkeley Existential Risk Initiative, Center for Applied Rationality, Lightcone Infrastructure
Ethan Perez 3 FAR.AI, Fund for Alignment Research, New York University
Francesca Rossi 3 Future of Life Institute, Leverhulme Centre for the Future of Intelligence, University of Padova
Gillian Hadfield 3 Center for Human-Compatible AI, Cooperative AI Foundation, OpenAI
Girish Sastry 3 Future of Humanity Institute, OpenAI, Ought
Helen Toner 3 Center for Security and Emerging Technology, Future of Humanity Institute, OpenAI
Holden Karnofsky 3 Model Evaluation and Threat Research, OpenAI, Redwood Research
Huw Price 3 Centre for the Study of Existential Risk, Leverhulme Centre for the Future of Intelligence, University of Cambridge
Ian McKenzie 3 FAR.AI, Fund for Alignment Research, Ought
Jack Clark 3 Anthropic, Center for Security and Emerging Technology, OpenAI
Jacob Steinhardt 3 Center for Human-Compatible AI, Open Philanthropy, Stanford University
Janos Kramar 3 Future of Life Institute, Machine Intelligence Research Institute, University of Montreal
Jeremy Schlatter 3 Berkeley Existential Risk Initiative, Machine Intelligence Research Institute, OpenAI
Johannes Treutlein 3 Center for Human-Compatible AI, Centre for Effective Altruism, Effective Altruism Foundation
José Hernández-Orallo 3 Centre for the Study of Existential Risk, General AI Challenge, Leverhulme Centre for the Future of Intelligence
Josh Jacobson 3 Alignment Research Center, Berkeley Existential Risk Initiative, FAR.AI
Jürgen Schmidhuber 3 IDSIA, SUPSI, Università della Svizzera italiana
Kaj Sotala 3 Foundational Research Institute, Lightcone Infrastructure, Machine Intelligence Research Institute
Katja Grace 3 AI Impacts, Future of Humanity Institute, Machine Intelligence Research Institute
Laurent Orseau 3 AgroParisTech, Google DeepMind, INRA
Malo Bourgon 3 Berkeley Existential Risk Initiative, Machine Intelligence Research Institute, Redwood Research
Mark Ring 3 IDSIA, SUPSI, Università della Svizzera italiana
Martin Rees 3 Centre for the Study of Existential Risk, Future of Life Institute, Leverhulme Centre for the Future of Intelligence
Matthew Graves 3 Center for Applied Rationality, Lightcone Infrastructure, Machine Intelligence Research Institute
Max Tegmark 3 Centre for the Study of Existential Risk, Future of Life Institute, Machine Intelligence Research Institute
Michael Cohen 3 Center for Human-Compatible AI, Future of Humanity Institute, The Australian National University
Oliver Habryka 3 Center for Applied Rationality, Lightcone Infrastructure, Machine Intelligence Research Institute
Owain Evans 3 Future of Humanity Institute, Ought, University of Oxford
Patrick LaVictoire 3 Machine Intelligence Research Institute, Quixey, University of Wisconsin–Madison
Peter Barnett 3 Center for Human-Compatible AI, Machine Intelligence Research Institute, Nonlinear
Pieter Abbeel 3 Center for Human-Compatible AI, OpenAI, University of California, Berkeley
Qiaochu Yuan 3 Berkeley Existential Risk Initiative, Center for Applied Rationality, University of California, Berkeley
Ramana Kumar 3 Data61, Machine Intelligence Research Institute, University of Cambridge
Robin Hanson 3 Future of Humanity Institute, George Mason University, Machine Intelligence Research Institute
Sawyer Bernath 3 Berkeley Existential Risk Initiative, FAR.AI, Fund for Alignment Research
Tom Brown 3 Anthropic, Google Brain, OpenAI
Tom McGrath 3 AI Safety Camp, Future of Humanity Institute, Ought
Tomasz Korbak 3 Anthropic, FAR.AI, Fund for Alignment Research
Victoria Krakovna 3 Future of Life Institute, Google DeepMind, Machine Intelligence Research Institute
Yang Liu 3 Centre for the Study of Existential Risk, OpenAI, University of Cambridge
Adam Scholl 2 Center for Applied Rationality, Global Catastrophic Risk Institute
Adrià Garriga-Alonso 2 FAR.AI, Fund for Alignment Research
Ales Flidr 2 Centre for Effective Altruism, Future of Life Institute
Alex Zhu 2 Machine Intelligence Research Institute, Nonlinear
Alexey Potapov 2 AIDEUS, ITMO University
Amanda Askell 2 Anthropic, OpenAI
Amrit Sidhu-Brar 2 Cooperative AI Foundation, Effective Altruism Foundation
Anastasiia Gaidashenko 2 FAR.AI, Fund for Alignment Research
Anca Dragan 2 Center for Human-Compatible AI, University of California, Berkeley
Andreas Stuhlmüller 2 Ought, Stanford University
Anna Salamon 2 Center for Applied Rationality, Machine Intelligence Research Institute
Ben Goertzel 2 CogPrime, Machine Intelligence Research Institute
Ben Hoskin 2 Alignment Research Center, Machine Intelligence Research Institute
Ben West 2 Model Evaluation and Threat Research, Ought
Benya Fallenstein 2 Machine Intelligence Research Institute, University of Bristol
Beth Barnes 2 Center for Human-Compatible AI, Centre for the Study of Existential Risk
Blake Borgeson 2 Machine Intelligence Research Institute, Redwood Research
Brandon Perry 2 AI Safety Camp, Center for Human-Compatible AI
Brian Tomasik 2 Effective Altruism Foundation, Foundational Research Institute
Carl Shulman 2 Future of Humanity Institute, Machine Intelligence Research Institute
Carrick Flynn 2 Center for Security and Emerging Technology, Future of Humanity Institute
Catherine Olsson 2 Anthropic, OpenAI
Charlie Rogers-Smith 2 Palisade Research, University of Oxford
Chris Cundy 2 FAR.AI, Ought
Chris Maddison 2 Google DeepMind, University of Oxford
Christine Peterson 2 Foresight Institute, Machine Intelligence Research Institute
Christopher Cundy 2 Center for Human-Compatible AI, Future of Humanity Institute
Christopher Olah 2 Google Brain, OpenAI
Claudia Shi 2 FAR.AI, Fund for Alignment Research
Connor Flexman 2 AI Impacts, Machine Intelligence Research Institute
Dan Hendrycks 2 Center for AI Safety, University of California, Berkeley
Daniel Filan 2 Center for Human-Compatible AI, Future of Humanity Institute
Daniel Ziegler 2 OpenAI, Redwood Research
Danny Hernandez 2 Anthropic, OpenAI
David Abel 2 Brown University, Future of Humanity Institute
David Kristoffersson 2 AI Safety Camp, Future of Humanity Institute
David Lindner 2 AI Safety Camp, Center for Human-Compatible AI
David Manheim 2 Association for Long Term Existence and Resilience, Future of Humanity Institute
Demis Hassabis 2 Google DeepMind, Leverhulme Centre for the Future of Intelligence
Dmitrii Krasheninnikov 2 Center for Human-Compatible AI, University of Amsterdam
Dorsa Sadigh 2 Center for Human-Compatible AI, Stanford University
Durk Kingma 2 Google DeepMind, OpenAI
Dylan Hadfield-Menell 2 Center for Human-Compatible AI, University of California, Berkeley
Elizabeth Barnes 2 Center for Human-Compatible AI, Model Evaluation and Threat Research
Elizabeth Cooper 2 Berkeley Existential Risk Initiative, Centre for the Study of Existential Risk
Fazl Barez 2 Centre for the Study of Existential Risk, Future of Life Institute
Fynn Heide 2 FAR.AI, Fund for Alignment Research
Gina Stuessy 2 Berkeley Existential Risk Initiative, Center for Applied Rationality
Gwern Branwen 2 Center for Applied Rationality, Machine Intelligence Research Institute
Hannah Betts 2 FAR.AI, Fund for Alignment Research
Haydn Belfield 2 Centre for the Study of Existential Risk, Leverhulme Centre for the Future of Intelligence
Ian Goodfellow 2 Google DeepMind, OpenAI
Isabella Duan 2 FAR.AI, Fund for Alignment Research
Jacob Hilton 2 Alignment Research Center, OpenAI
Jacob Lagerros 2 Future of Humanity Institute, Lightcone Infrastructure
Jaime Sevilla 2 Centre for the Study of Existential Risk, Epoch
Jakob Foerster 2 Center for Human-Compatible AI, OpenAI
James Miller 2 Machine Intelligence Research Institute, Smith College
James Paul Gonzales 2 Berkeley Existential Risk Initiative, Center for Human-Compatible AI
Jeffrey Ladish 2 Anthropic, Palisade Research
Jelena Luketina 2 Aalto University, Université de Montréal
Jérémy Scheurer 2 FAR.AI, Fund for Alignment Research
Jesse Clifton 2 Cooperative AI Foundation, Effective Altruism Foundation
Jesse Galef 2 Future of Life Institute, Machine Intelligence Research Institute
Jesse Liptrap 2 Center for Applied Rationality, Machine Intelligence Research Institute
Jia Yuan Loke 2 Anthropic, Effective Altruism Foundation
Jimmy Rintjema 2 AI Impacts, Machine Intelligence Research Institute
Joar Skalse 2 Future of Humanity Institute, Oxford University
Johannes Heidecke 2 AI Safety Camp, Road to AI Safety Excellence
John Salvatier 2 AI Impacts, Future of Humanity Institute
Jon Gauthier 2 Massachusetts Institute of Technology, OpenAI
Joseph Halpern 2 Center for Human-Compatible AI, Cornell University
Joshua Clymer 2 Center for AI Safety, Model Evaluation and Threat Research
Joshua Fox 2 Association for Long Term Existence and Resilience, Machine Intelligence Research Institute
Joshua Gans 2 National Bureau of Economic Research, University of Toronto
Julia Galef 2 Center for Applied Rationality, OpenAI
Jun Shern Chan 2 Center for AI Safety, Fund for Alignment Research
Justin Shovelain 2 Convergence Analysis, Machine Intelligence Research Institute
Karl Berzins 2 FAR.AI, Fund for Alignment Research
Kellin Pelrine 2 FAR.AI, Fund for Alignment Research
Kenzi Amodei 2 Berkeley Existential Risk Initiative, Center for Applied Rationality
Kris Chari 2 Alignment Research Center, Model Evaluation and Threat Research
Kristinn R. Thórisson 2 Center for Analysis & Design of Intelligent Agents, Icelandic Institute for Intelligent Machines
Lewis Hammond 2 Cooperative AI Foundation, Future of Humanity Institute
Linda Linsefors 2 AI Safety Camp, Machine Intelligence Research Institute
Lindsay Murachver 2 FAR.AI, Fund for Alignment Research
Lukas Gloor 2 Effective Altruism Foundation, Foundational Research Institute
Marcello Herreshoff 2 Google, Machine Intelligence Research Institute
Marek Havrda 2 General AI Challenge, GoodAI
Marek Rosa 2 General AI Challenge, GoodAI
Margaret Boden 2 Centre for the Study of Existential Risk, Leverhulme Centre for the Future of Intelligence
Matthijs M. Maas 2 Centre for the Study of Existential Risk, Leverhulme Centre for the Future of Intelligence
Max Daniel 2 Effective Altruism Foundation, Foundational Research Institute
Megan Kinniment 2 Alignment Research Center, Model Evaluation and Threat Research
Melody Guan 2 Future of Life Institute, Google Brain
Michael Blume 2 Center for Applied Rationality, Machine Intelligence Research Institute
Michael Chen 2 Center for AI Safety, Model Evaluation and Threat Research
Michael Keenan 2 Berkeley Existential Risk Initiative, Center for Applied Rationality
Michael Page 2 Center for Security and Emerging Technology, OpenAI
Michael Wellman 2 Center for Human-Compatible AI, University of Michigan
Mihaly Barasz 2 Machine Intelligence Research Institute, Nilcons
Mohammad Taufeeque 2 FAR.AI, Fund for Alignment Research
Moritz von Knebel 2 FAR.AI, Fund for Alignment Research
Mrinank Sharma 2 Future of Humanity Institute, University of Oxford
Murray Shanahan 2 Centre for the Study of Existential Risk, Leverhulme Centre for the Future of Intelligence
Natalia Díaz Rodríguez 2 ContinualAI, Flowers Laboratory
Neal Jean 2 Future of Humanity Institute, Ought
Neel Nanda 2 Anthropic, Center for Human-Compatible AI
Niki Howe 2 FAR.AI, Fund for Alignment Research
Nisan Stiennon 2 Center for Human-Compatible AI, Machine Intelligence Research Institute
Nora Belrose 2 FAR.AI, Fund for Alignment Research
Olga Afanasjeva 2 General AI Challenge, GoodAI
Oskar Hollinsworth 2 FAR.AI, Fund for Alignment Research
Owen Cotton-Barratt 2 Centre for Effective Altruism, Redwood Research
Ozzie Gooen 2 Convergence Analysis, Ought
Pedro Freire 2 Center for Human-Compatible AI, FAR.AI
Philip Quirke 2 FAR.AI, Fund for Alignment Research
Piers Millett 2 Centre for the Study of Existential Risk, Future of Humanity Institute
Rae She 2 Alignment Research Center, Model Evaluation and Threat Research
Randall C. O’Reilly 2 eCortex, University of Colorado Boulder
Rebecca Baron 2 Alignment Research Center, Model Evaluation and Threat Research
Rebecca Raible 2 Anthropic, Berkeley Existential Risk Initiative
Remco Zwetsloot 2 Center for Security and Emerging Technology, OpenAI
Remmelt Ellen 2 AI Safety Camp, Road to AI Safety Excellence
Reuben Stern 2 Ludwig Maximilian University of Munich, University of Wisconsin–Madison
Richard Ngo 2 OpenAI, University of Cambridge
Robert Miles 2 Nonlinear, Road to AI Safety Excellence
Robert Mushkatblat 2 Lightcone Infrastructure, Machine Intelligence Research Institute
Roger Grosse 2 Future of Humanity Institute, University of Toronto
Rosie Campbell 2 Center for Human-Compatible AI, OpenAI
Roxanne Heston 2 Center for Security and Emerging Technology, Future of Humanity Institute
Saad Siddiqui 2 FAR.AI, Fund for Alignment Research
Sam Clarke 2 Centre for the Study of Existential Risk, Future of Humanity Institute
Sam McCandlish 2 Anthropic, OpenAI
Sergey Rodionov 2 AIDEUS, Aix-Marseille University
Siao Si Looi 2 FAR.AI, Fund for Alignment Research
Smitha Milli 2 Center for Human-Compatible AI, University of California, Berkeley
Sören Mindermann 2 Center for Human-Compatible AI, Future of Humanity Institute
Stanislav Fort 2 Google DeepMind, Stanford University
Stephanie Zolayvar 2 AI Impacts, Center for Applied Rationality
Stephen Hawking 2 Centre for the Study of Existential Risk, Future of Life Institute
Steve Omohundro 2 Machine Intelligence Research Institute, Self-Aware Systems
Steven Umbrello 2 Global Catastrophic Risk Institute, Institute of Ethics and Emerging Technologies
Stuart Armstrong 2 Future of Humanity Institute, Machine Intelligence Research Institute
Tamay Besiroglu 2 Epoch , Future of Humanity Institute
Tao Lin 2 Model Evaluation and Threat Research, Redwood Research
Thomas Woodside 2 Center for AI Safety, Center for Security and Emerging Technology
Timothée Lesort 2 ContinualAI, Flowers Laboratory
Timothy Telleen-Lawton 2 Anthropic, Center for Applied Rationality
Tobias Baumann 2 Foundational Research Institute, University College London
Tom Everitt 2 Australian National University, Google DeepMind
Tom Tseng 2 FAR.AI, Fund for Alignment Research
Tsvi Benson-Tilsen 2 Center for Applied Rationality, Machine Intelligence Research Institute
Vael Gates 2 Center for Human-Compatible AI, Fund for Alignment Research
Vanessa Kosoy 2 Association for Long Term Existence and Resilience, Machine Intelligence Research Institute
Vincent Conitzer 2 Cooperative AI Foundation, Duke University
Will Grathwohl 2 Google DeepMind, OpenAI
Will Millership 2 General AI Challenge, GoodAI
Will Sawin 2 Institute for Theoretical Studies at ETH Zurich, Princeton University
Yoshua Bengio 2 Model Evaluation and Threat Research, Montreal Institute for Learning Algorithms
Zac Kenton 2 Montreal Institute for Learning Algorithms, Ought

Positions grouped by organization

Showing 158 organizations.

Organization Number of people List of people
OpenAI 366 Zico Kolter, Laura W., Yasuyoshi Sakamoto, Evan Weiss, Shuyuan Zhang, Mada Aflak, Kleanthes K., Weiyi Zheng, Uğurcan Türkdoğan, Stewart Hall, Siyuan Fu, Ollie Jaffe, Amber Yore, John Rizzo, Tiffany C., Thomas Dimson, Francis Z., Enoch Cheung, Pedram Keyani, Wei An Lee, Ilan Bigio, CJ Minott, Ofir Nachum, Allan J., Will Saborio, Erica T., Eric Rynerson, David Carr, Daniel Kappler, Anton Tananaev, Srinivas Narayanan, Andrei Alexandru, Tina Miranda, Hisham Elhaddad, Brydon Eastman, Ali Kamali, Justin B., David Medina, David Hengky, Tianhao Zheng, Michelle Pokrass, Adam Perelman, Jan Hendrik Kirchner, Hossem Ben Ayed, Cory Decareaux, Mati Roy, Akila Welihinda, Yaniv Markovski, Vishal Kuo, Steven Bills, Adam Nace, Jessica Shieh, Eugene Wu, Chester Cho, Karl Whitford Pollard, Tatiana Zolotova, Sully Chen, Ryan Peterson, Daniel Kokotajlo, Oleg Mürk, Bogo Giertler , Juston Forte, Joanne Jang, Zarina Stanik, Chaitra A., Arun Vijayvergiya, Adam Goldberg, Rob Mallery, Dave Willner, Preston Tuggle, Austin Wiseman, Atqiya Abida Anjum, Angela Jiang, Lama Ahmad, Giambattista Parascandolo, Davit Khachatryan, Rajeev Nayak, Matthew Gentzel, Sarah Shoker, Richard Ngo, Carroll Wainwright, Anna Makanju, Vitchyr Pong, Victor Benito Garcia Rocha, Elie Georges, Angie Luo, Lisa Dethridge, Vlad Ursu, Isabel Alves de Lima, Stefanie Biaggi, Rosie Campbell, Lukasz Kaiser, Johannes H., Sarthak Agrawal, Kyle Kosic, Jason Kwon, Emanuele Marchiori, Radhika Mathur, Tabarak Khan, Natalie Summers, Jesse Han, Ishant Singh, Hannah Wong, Nicolas Norberto Corizzo, Bob Rotsted, Zack Kass, Jan Leike, Evan Morikawa, Sinith T., Shawn Jain, Che Chang, Lucas Negritto, Jonathan Gordon, Steven Adler, Tarun Gogineni, Maddie Simens, Adam Rhodes, Julián Santoro, Tyna Eloundou, Phuong Vu, Philippe Tillet, Bram Adams, Theresa Lopez, Fotios Chantzis, Dave Cummings, Mo Bavarian, Joel Lehman, Denny Jin, Joost Huizinga, Raul Puri, Red A., Emy Parparita, Alex Nichol, Kelly Sims, Tim Yanchen Wang, Arvind Neelakantan, Jeff Clune, Tom Rubin, Fraser Kelton, Rachel Lim, Jian O., Jacquelyn Lau, Roger Xu Jiang, Aris Konstantinidis, Tao Xu, Gretchen M. Krueger, Girish Sastry, Stanislas Polu, Cullen O"Keefe, Mario Saltarelli, Benjamin Mann, Luke Miller, Long Ouyang, Ife Riamah, Frances Choi, Richard Dunn, Peter Hoeschele, Karson Elmgren, Jerry Tworek, Yi Wu, Ilge Akkaya, Nikolas Tezak, Mor Katz, Alex Paino, Jonathan Michaux, Yuhao Wan, Janet Brown, Fatma Tarlaci, Elynn Chen, Edgar Barraza, Danny Hernandez, Christina Hendrickson, Maxim Sokolov, Katie Mayer, Nancy Otero, Bianca Martin, Ben Chess, Tom Brown, Qiming Yuan, Mateusz Litwin, Janine Korovesis, Clemens Winter, Amanda Askell, Lei Zhang, Justin Wang, Jacob Hilton, Todor Markov, Ian Atha, Daniela Amodei, Mikhail Pavlov, Jacob Jackson, Taehoon Kim, Sue Yoon, Christopher Olah, Maddie Hall, Jeffrey Wu, Ingmar Kanitscheider, Gillian Hadfield, Brad Lightcap, Miles Brundage, Michał Staniszewski, Matt Mochary, Arthur Petron, Karl Cobbe, Joshua Meier, Johannes Otterbach, Yilun Du, Xingyou (Richard) Song, Ifu Aniemeka, Holly Grimm, Hannah Davis, Ethan Knight, Sophia Arakelyan, Christine McLeavey Payne, Nadja Rhodes, Munashe Shumba, Mira Murati, Louis Cheong, Will Grathwohl, Hanjun Dai, Susan Zhang, Suchir Balaji, Erin Grant, Sam McCandlish, Sadhika Malladi, Daniel Ziegler, Michael Petrov, Aravind Srinivas, Adam D’Angelo, Aleksandar Botev, Henrique Ponde de Oliveira Pinto, Thomas Anthony, Peter Zhokhov, Eric Sigler, Elena Chatziathanasiadou, Diane Yoon, Rewon Child, Manuel Sherbakoff, Julia Galef, Maran Nelson, Lilian Weng, Kevin Wong, Kaleo Hao, Glenn Powell, Ryan Carey, Parnian Barekatain, Adam Smets, Tasha McCauley, Remco Zwetsloot, David Farhi, Christy Dennison, Ashley C. Pilipiszyn, Mathew Shrwed, David Luan, Maciej Chociej, Jonathan Ward, Jonathan Raiman, Larissa Schiavo, Karthik Narasimhan, Phillip Isola, Nikhil Mishra, Bowen Baker, Joshua Achiam, Yuping Luo, Geoffrey Irving, Aditya Grover, Jiaming Song, Yuhuai Wu, Jakob Foerster, Christos Louizos, Oleg Klimov, Cathy Wu, Brooke Chan, AlShaun Baksh, Lerrel Pinto, Kevin Frans, Yang Liu, Jason Peng, Xue Bin Peng, Trapit Bansal, Han Zhang, Dustin Tran, Maruan Al-Shedivat, David Lansky, Quirin Fischer, Christopher Hesse, Matthias Plappert, Art Chaidarun, Jean Harb, Rein Houthooft, Christopher Berner, Aleks Kamko, Jakub Pachocki, Ryan Lowe, Yaroslav Bulatov, Jonathan Ho, Tim Shi, Erika Reinhardt, Shariq Hashme, Richard Chen, Danielle Buma, Peter Welinder, Bob McGrew, Michael Page, Jeremy Schlatter, Taco Cohen, Szymon Sidor, Louise Cabansay, Josh Tobin, Jonathan Hernandez, Jack Clark, Harri Edwards, Desmond Henderson, Rachel Fong, Marie La, Ludwig Pettersson, Tambet Matiisen, Marika Allely, Igor Mordatch, Filip Wolski, Catherine Olsson, Zain Shah, Scott Gray, Dario Amodei, Craig Quiter, Linxi Fan, Kate Miltenberger, Jon Gauthier, Tyler Neylon, Rafał Józefowicz, Pieter Abbeel, Jie Tang, Paul Christiano, Marcin Andrychowicz, Peter Chen, Jim Fan, Tim Salimans, Prafulla Dhariwal, Shivon Zilis, Jonas Schneider, Jeff Arnold, Alec Radford, Yuri Burda, Eric Price, Ian Goodfellow, Chris Clark, Rocky Duan, Trevor Blackwell, Ilya Sutskever, Bradly Stadie, Andrej Karpathy, John Schulman, Wojciech Zaremba, Elon Musk, Sam Altman, Reid Hoffman, Vicki Cheung, Greg Brockman, Durk Kingma, Matt Krisiloff, Lucy Qin, Jonathan Gray, Javier Gai, Holden Karnofsky, Helen Toner, Anish Athalye
Machine Intelligence Research Institute 181 Jimmy Rintjema, Protyay Shyam Chowdhury, Lisa Thiergart, Gretta Duleba, Jeremy Gillen, Peter Barnett, James Payor, Edward Kmett, Victoria Krakovna, Carson Jones, Evan Hubinger, David Simmons, Daniel Demski, Ben Weinstein-Raun, Alex Zhu, Alex Mennen, Alex Appel, Linda Linsefors, Andrew Critch, Buck Shlegeris, Blake Borgeson, Nick Tarleton, Kurt Brown, Jesse Liptrap, Benjamin Mann, Sam Eisenstat, Jeremy Schlatter, Jan Leike, Matthew Graves, Ryan Carey, Connor Flexman, Colm Ó Riain, Aaron Silverbook, Gary Drescher, Andrew Lapinski-Barker, Robin Hanson, Kaya Stechly, Jack Gallagher, Jaan Tallinn, Bart Selman, Stuart Russell, Ramana Kumar, Vanessa Kosoy, Abram Demski, Stuart Armstrong, Nate Thomas, Jed McCaleb, Jake Moskowitz, Scott Garrabrant, Luke Muehlhauser, Jessica Taylor, Jesse Galef, Tsvi Benson-Tilsen, Matthew Fallshaw, Elizabeth Morningstar, Nicolas Gagné, Lila Rieber, Vipul Naik, Nate Soares, Daniel Lewis, Rob Bensinger, Richard Neal, Robert Mushkatblat, Dávid Natingga, James Miller, Seth Baum, Roman Yampolskiy, Randal Koene, Nathan Clark, Moshe Looks, Kaj Sotala, Evan Erickson, Sebastian Nickel, Oliver Habryka, Bill Hibbard, Benya Fallenstein, Anja Heinisch, Alex Altair, Vladimir Nesov, Steve Rayhawk, Paul Christiano, Patrick LaVictoire, Nisan Stiennon, Mihaly Barasz, Joshua Fox, Jeremy Miller, Stephen Barnes, Ioven Fables, Louie Helm, Daniel Roth, Patrick Robotham, Pedro Chaves, Topher Brennan, Liron Shapira, Carl Shulman, Cameron Taylor, Nickolai Leschov, Jonathan Wang, Jake Miller, Gwern Branwen, Erica Edelman, Alex Vermeer, Tomer Kagan, Pejman Makhfi, Malo Bourgon, Kevin Fischer, Robert V. Brazell, Lincoln Quirk, Diego Caleiro, Will Newsome, Nevin Freeman, Minda Myers, Keefe Roedersheimer, Peter Scheyer, Jasen Murray, Peter de Blanc, Daniel Dewey, Abraham Wolk, Thomas Colthurst, Stanislas Sochacki, Luke Grecki, Janos Kramar, Dennis Fan, Ben Hoskin, Ben Goertzel, Jason Levin, Tim Czech, Robert Zahra, Frank Adamek, Aruna Vassar, Amy Willey, Henrik Jonsson, Harrison Willey, Edwin Evans, Anna Salamon, Max Tegmark, Zack M. Davis, Michael Blume, Kemal Eren, Andrew Rettek, Michael Vassar, Andriy Brodskyy, Andrew Hay, Vincent Fagot, Thomas McCabe, Steven Kaas, Roko Mijic, Katja Grace, Justin Shovelain, Bryan Bishop, Alyssa Vance, Peter Cheeseman, David Hart, Susan Fonseca-Klein, C. Colby Thomson, Jonas Lamis, Steve Omohundro, Bruce Klein, Allison Taguchi, Brian Atkins, Barney Pell, Tyler Emerson, Neil Jacobstein, Carolyn L. Burke, Marcello Herreshoff, Peter Thiel, Rick Schwall, Emil Gilliam, Christine Peterson, Aubrey de Grey, Ray Kurzweil, Nick Bostrom, Jeff Medina, Michael Roy Ames, Jeff Alexander, Michael Wilson, Christian Rovner, Michael Anissimov, Michael Raimondi, Eliezer Yudkowsky, Sabine Atkins
Anthropic 170 Joel Lewenstein, Eilona Maitski, Diego Iaconelli, Coyote Codornices Marin, Chris O'Connell, Chinsin Sim, Adam Pearce, Adam Dix, Nina Rimsky, Meg Tong, Mark S., Laila Rafi, Julian Williams, Jonathan Marcus, Joel Pobar, Graham Jackson, Elaine C., Connor Holloway, Christopher Chalek, Ashley Zlatinov, Akila S., Sally Aldous, Rob Greenlee, Ranell Nakayama, Rae Phillips, Patrick Ekeruo, Nicola Lau, Nicholas Marwell, Kyle Turman, Kei Nishimura, JB Boin, Jamie Neuwirth, Isabel Larrow, Isaac Dunn, Hunar Batra, Dana Malman Warren, Carrie Bentley, Brian Delahunty, Alfred Mountfield, Vu Bui, Vinay Rao, Tomasz Korbak, Rishi Gupta, Kate Jensen, Daniel Rosenthal, Brett Andrus, Brendan Collins, Amir Kashanchi, Stephen Jung, Sasha de Marigny, Elena L., Dianne Na Penn, Anton Paquin, Zack Witten, Natalie Esperance, Marisa Gobby, Gautham Raj, Everett Katigbak, Alex Tamkin, Shawn Owen, Nicholas Turner, Laura Colley, Julia Schmaltz, Josiah Burke, Jihong Kim, Jennifer Pisansky, Evan Frondorf, Emmanuel Ameisen, Dan Dascalescu, Aaron Begg, Zubair Jandali, Tony H., Tanya Singh, Samantha Wong, Rachit Agarwal, Jason Clinton, Cassandra Evraets, Benoit Steiner, Ryan Seunghwan Kim, Ruhua Jiang, Pujaa Rajan, Paul-Frederik Schubert, Avital Balwit, Nathan Bailey, Joshua Batson, Jenan Wise, Ansh Radhakrishnan, Angie Lal, Robert Baden, Keri Warr, Julieann Choi, Janel Thamkul, Frances Pye, Esin Durmus, Elizabeth Edwards-Appell, Diana Jung, David Hwang, Ben Kuhn, Alex S., Adam Jermyn, Yifan Wu, Sandy Banerjee, Ryan Soklaski, Nikhil Bhargava, Marina Favaro, Linh-Chi T., Justin Spahr-Summers, Gyula Lakatos, Ethan Forrest, Devi Borg, Brayden McLean, Amanda (Lipson) Kelley, Alex Silverstein, Vlad G., Thompson Paine, Ethan Langevin, Autumn Russell, James Sully, Peter Lofgren, Mike Lambert, Matt Bell, Karina Nguyen, Hongbin Chen, Brian Israel, Oliver Rausch, Neerav Kingsland, Landon Goldberg, Deep Ganguli, Miranda Zhang, Da Yan, Noemí Mercado, Sam Bowman, Guro Khundadze, Nicholas Schiefer, Scott Johnston, Dustin Li, Bryan Seethor, Thomas Liao, Shauna Kravec, Saurav Kadavath, Rebecca Raible, Neel Nanda, Jackson Kernion, Tom Conerly, Jia Yuan Loke, Andy Jones, Liane Lovitt, Jeffrey Ladish, Timothy Telleen-Lawton, Dawn Drain, Anna Chen, Yuntao Bai, Nelson Elhage, Kamal Ndousse, Catherine Olsson, Amanda Askell, Dario Amodei, Danny Hernandez, Benjamin Mann, Jared Kaplan, Jack Clark, Tom Brown, Sam McCandlish, Nicholas Joseph, Daniela Amodei, Moumita Das, Chris Olah, Zac Hatfield-Dodds, Tom Henighan, Nova DasSarma
Center for Security and Emerging Technology 170 Matthew Burtell, Thomas Woodside, Brendan Oliss , Lauren Kahn, Lawrence Hailes, Jenny Jun, John VerWey, Sam Bresnick , Mia Hoffmann, Cole McFaul, Brian Love, Carolina Pachón, Andrea Guerrero, Josh Goldstein, Neha Singh, Hanna Dohmen, Katherine Quinn, Steph Batalis, Vikram Venkatram, Kathleen Curlee, Christian Schoeberl, Donna Artusy, Robert Cardillo, Remco Zwetsloot, Rafay Ur Rehman Khan, Olivia Albrighton-Vanway, Michael Sulmeyer, Michael Page, Lorand Laskai, John Bansemer, Jeff Ding, Jacob Strieb, Eri Phinisee, Emelia Probasco, Emefa Addo Agawu, Elsa Kania, Darrin Gladman, Daniel Cebul, Dalila Scott, Dakota Foster, Dakota Cary, Collins Nji, Claire Perkins, Cindy Martinez, Christopher Back, Christine McNeill, Carrick Flynn, Beba Cibralic, Avonelle Davis, Aurora Johnson, Ashwin Acharya, Anna Puglisi, Amy Chao, Alan Loera, Aditi Joshi, Tina Huang, Thuy Nguyen, Tarun Chhabra, Tantum Collins, Sue Gordon, Stephanie O"Sullivan, Schuyler Moore, Santiago Mutis, Roxanne Heston, Nii Simmonds, Ronnie Kinoshita, Kevin Wolf, Heather Frase, Mina Narayanan, Walter Haydock, Jessica Ji, Shuvo Bardhan, Owen Daniels, Laissa A., Ella Kay, Caroline Schuerger, Lisa Oguike, Sara Abdulla, Luke Koslosky, Jack Corrigan, Channing Lee, Kyle Miller, Heeu Millie Kim, Kayla Goode, Eish Sumra, Adrienne Thompson, Maya Gros, Ingrid Dickinson, Ali Crawford, Abelardo Cruz Osorio, Shelton Fitch, Melissa Deng, Mary Hill Brooks, Lizbeth Lucero, J. Guillermo Mendoza Bazán, Filippo Fagnoni, Alex Friedland, Alan Omar Loera Martinez, Piyush Mishra, Oneeb Ul Haq Khan, Gustavo Mauricio Bastien Olvera, Will Hunt, George Klein, Diana Gehlhaus Carew, Darius Diamond, Sean Kucer, Raveena Kshatriya, Max Langenkamp, Jasmine Ding, Christina Ismailos, Chris Rohlf, Bryce Farabaugh, Ashton Garriott, Andrew Lohn, Andreas Greiler-Basaldúa, Alex Barker, Simon Godfrey Rodriguez, Katerina Sedova, Farid Nemri, Zuleirys Santana-Rodriguez, Rebecca Gelles, Jacob Feldgoise, Wyatt Hoffman, Micah Musser, Emily Weinstein, Autumn Toney, Ngor Luong, Matthew Daniels, Yiming Y., Nicolina Demakos, Emily Xue, Reginald Brothers, Charlie Wang, Alexandra Vreeman, Wenchuan Dong, Jack Clark, Melissa Flagg, Jack Lucas, Daniel Hague, Margarita Konaev, Igor Mikolic-Torreira, Catherine Aiken, Jonathan Murdick, Jennifer Melot, Huey-Meei Chang, Alexander M., Ilya Rahkovsky, Husanjot Chahal, Dahlia Peterson, Ben Murphy, Andrew Imbrie, Tim G. J. Rudner, Saif M. Khan, Saif Khan, Ryan Fedasiuk, Rebecca Kagan, Lynne Weil, Jamie Baker, Daniel Chou, Ben Buchanan, Benjamin Chang, William Hannas, James Dunham, Zachary Arnold, Peggy Evans, Jason Matheny, Helen Toner, Tim Hwang, Tessa Baker, Dewey Murdick
Center for Human-Compatible AI 122 Brandie Nonnecke, Henry Papadatos, Khanh Nguyen, Dale Reed, Ben Plaut, Bhaskar Mishra, Cameron Allen, Alexandra Souly, Tu (Alina) Trinh, Sana Pandey, Michael Cohen, Tiffany Wang, Jacy Reese Anthis, Brian Judge, Olivia Watkins, Niklas Lauffer, David Krueger, Leonie Richter, George Matheos, Shreyas Kapur, Nisan Stiennon, George Obaido, Erdem Biyik, Alexander Turner, Peter Barnett, Anand Siththaranjan, Yuxi Liu, Scott Emmons, Justin Svegliato, Ruairidh McLennan Battleday, Kimin Lee, James Paul Gonzales, Arnaud Fickinger, Wesley Holliday, Neel Nanda, Toni Lorente, Julia Kerley, Paria Rashidinejad, Jonathan Stray, Rafael Albert, Tom Lenaerts, Jakob Foerster, Cassidy Laidlaw, Alyssa Li Dayan, Micah Carroll, Johannes Treutlein, Jessy Lin, Harry Giles, Eric Michaud, Cynthia Chen, Charlotte Roman, Alex Gunning, Stephen Casper, Sören Mindermann, Sergei Volodin, Pedro Freire, Noor Brody, Neel Alex, Meir Friedenberg, Matthew Rahtz, Christopher Cundy, Pulkit Verma, Moritz Hardt, Brian Christian, Lawrence Chan, Rediet Abebe, Nika Haghtalab, David Lindner, Rachel Freedman, Jacob Steinhardt, Jess Reidel, Shlomi Hod, Sam Toyer, Martin Fukui, Caroline Jeanmaire, Vincent Corruble, Rohin Shah, Niko Kolodny, Brandon Perry, Michael Littman, Juliana Schroeder, Gillian Hadfield, Smitha Milli, Lara Buchak, Ken Goldberg, John Zysman, Dylan Hadfield-Menell, Demian Pouzo, Dawn Song, Daniel Filan, Charis Thompson, Alison Gopnik, Monica Gates, Marion Fourcade, Dan Hendryks, Cody Wild, Steven Wang, Rosie Campbell, Mariano Florentino Cuéllar, Tania Lombrozo, Siddharth Srivastava, Adam Gleave, Beth Barnes, Elizabeth Barnes, Dmitrii Krasheninnikov, Andrew Critch, Mark Nitzberg, Karthika Mohan, Joseph Halpern, Bart Selman, Anca Dragan, Tom Griffiths, Stuart Russell, Satinder Singh Baveja, Pieter Abbeel, Michael Wellman, Vael Gates, Thanard Kurutach, Michael Dennis, Thomas Krendl Gilbert, Jaime Fernandez Fisac, Dorsa Sadigh
Centre for the Study of Existential Risk 101 Pablo Suarez, Reuben Makomere, Aarathi Krishnan, Thomas Moynihan, Elizabeth Cooper, Alexandra Klein, Laura Elmer, Shoshana Dahdi, Clare Arnstein, Kennedy Mbeva, Julian Huppert, Taniel Yusef, Cecil Abungu, Madhulika Srikumar, Zoe Hemsley, Matthew Connelly, Constantin Arnscheidt, Coleman Snell, Clarissa Rios Rojas, Dennis Müller, Sarah Dryhurst, Maurice Chiodo, Sam Clarke, Fazl Barez, Ross Gruetzemacher, Abdullahi Alim, Nathaniel Cooke, Paul Ingram, José Hernández-Orallo, Freya Jephcott, Jessica Bland, Catherine Rhodes, Matthijs M. Maas, Charlotte Christiane Hammer, Tom Hobson, Shin-Shin Hua, Lara Mani, Shahar Avin, Rumtin Sepasspour, S. J. Beard, Carla Zoe Cremer, Adrian Weller, Chris Lowe, Adrian Kent, Sean Holden, Stephen Hawking, Hermann Hauser, Tim Crane, David Cleevely, Jonathan Wiener, Max Tegmark, Peter Singer, Murray Shanahan, Dana Scott, Stuart Russell, Peter Piot, Tim Palmer, Elon Musk, Robert May, David Chalmers, Nick Bostrom, Margaret Boden, Martina Kunz, Beth Barnes, Yang Liu, Simon Goldhill, Jane Heal, Partha Dasgupta, Lalitha Sundaram, Haydn Belfield, Seán Ó hÉigeartaigh, Huw Price, Simon Beard, Alison Gopnik, Ryan Carey, Jaan Tallinn, William Sutherland, Martin Rees, Susan Owens, Mami Mizutori, Piers Millett, Thomas Homer-Dixon, Robert Doubleday, Beatrice Crona, Belinda Cleeland, Des Browne, Yuval Noah Harari, Olaf Corry, Seth Baum, Caroline Baylon, Sophie Dannreuther, Charlotte Hammer, Jaime Sevilla, David Krueger, Elizabeth Seger, Di Cooke, Mike Cassidy, James Ginns, Andrew Tanentzap, Allan Dafoe, Rachel Burgess
Google DeepMind 91 Ruiqi Gao, Stanislav Fort, Aditya Srikanth Veerubhotla, Shixiang Shane Gu, Yonghui Wu, Gargi Balasubramaniam, Nithya Attaluri, Rohan Anil, Rishabh Joshi, Pierre Sermanet, Isabel Leal, Thibault Sellam, Anushka Nijhawan, Abhishek Rao, Sergio Guadarrama, Roopali (Paali) V., Raphael Hoffmann, Been Kim, Azade Nova, Piyush Patil, Nidhi Vyas, Wilfried L. Bounsi, Sridhar Thiagarajan, Sebastian Riedel, João Gabriel Lopes, Dmitry Nikulin, Sholto Douglas, Grace Lam, Kavya Kopparapu, Arthur Douillard, Shreya Pathak, Mehdi Jafarnia, Paige Bailey, Krishna Haridasan, Ian Goodfellow, Blanca Huergo, Yousuf Khan, Pratik Joshi, Daniel Sohn, Ruizhe Zhao, Will Grathwohl, Shubham Agrawal, Yayi Zou, Hamze M., David Stutz, Sho Arora, Yuzhu Dong, Keerthana Gopalakrishnan, Sylvestre Rebuffi, Jennifer She, Ira Ktena, Praneet Dutta, Pauline (Luc) Luc, Behnam Neyshabur, Paul Muller, Shantanu Thakoor, Petar Veličković , Yasaman Bahri, Durk Kingma, Lila Ibrahim, Gheorghe Comanici, Vishal Maini, Hanie Sedghi, Sean Legassick, Verity Harding, John Jumper, Laurel Wagstaff, Gagan Bansal, Daniel J. Mankowitz, Zachary Gleicher, Pushmeet Kohli, Miljan Martic, Vandana Bachani, Andrew Lefrancq, Pedro A. Ortega, Koray Kavukcuoglu, Shane Legg, Mustafa Suleyman, Demis Hassabis, Nick Bostrom, Tom Everitt, Jeffrey D. Sachs, Jan Leike, James Manyika, Edward W. Felten, Diane Coyle, Christiana Figueres, Laurent Orseau, Chris Maddison, Thore Graepel, Victoria Krakovna
Future of Humanity Institute 71 Michael Cohen, Patrick Butlin, Elise Bohan, Matthew van der Merwe, Janvi Ahuja, Peter Wills, Isaac Friend, Maria Violaris, Joar Skalse, Hannah Klim, Tushant Jha, Sam Clarke, Mrinank Sharma, Karolina Milewicz, Duncan Snidal, Michael Osborne, Jacob Lagerros, Jan Brauner, Roger Grosse, Lewis Hammond, Thomas Orton, Carla Zoe Cremer, Michael Montague, David Manheim, Ben Garfinkel, Gregory Lewis, Ondrej Bajgar, Baobao Zhang, Ryan Carey, Michael Bonsall, Helen Toner, Tom McGrath, Robin Hanson, Paul Christiano, Jan Leike, Sören Mindermann, Christopher Cundy, William Saunders, Neal Jean, Girish Sastry, Tamay Besiroglu, Clare Lyle, David Kristoffersson, Piers Millett, John Salvatier, David Krueger, David Abel, Allan Dafoe, Carrick Flynn, Niel Bowerman, Kyle Scott, Andrew Snyder-Beattie, Daniel Dewey, Toby Ord, Anders Sandberg, Vincent C. Müller, Stuart Armstrong, Sebastian Farquhar, Seán Ó hÉigeartaigh, Roxanne Heston, Owain Evans, Miles Brundage, Katja Grace, Jeffrey Ding, Jade Leung, Eric Drexler, Daniel Filan, Chelsea Guo, Cecilia Tilli, Carl Shulman, Nick Bostrom
Future of Life Institute 64 Isabella Hampton, Tim Schreier, Alexandra Tsalidis, Hamza Tariq Chaudhry, Maggie Munro, Ben Eisenpress, Landon Klein, Fazl Barez, Anna Hehir, Claudia Prettner, Akhil Deo, Taylor Jones, Risto Uuk, Andrea Berman, Mark Brakel, Carlos Ignacio Gutierrez, Anna Yelizarova, David E. Nicholson, Emilia Javorsky, Alan Yan, Tucker Davey, Na Li, Jacob Beebe, Yishuai Du, Lucas Perry, Melody Guan, David Stanley, Maxim Kesin, Daniel Dewey, Ariel Conn, Meia Chita-Tegmark, Jaan Tallinn, Anthony Aguirre, Richard Mallah, Ales Flidr, Victoria Krakovna, Jesse Galef, Zara Yaqoob, William Jones, Vera Koroleva, Stuart Russell, Stephen Hawking, Saul Perlmutter, Sandra Faber, Rafael Martinez-Galarza, Peter Haas, Nick Bostrom, Morgan Freeman, Max Tegmark, Martin Rees, Kazue Evans, Janos Kramar, George Church, Frank Wilczek, Francesca Rossi, Eric Gastfriend, Erik Brynjolfsson, Elon Musk, Daniel R. Miller, Christof Koch, Chase Moores, Blake Pierson, Alan Guth, Alan Alda
Flowers Laboratory 63 Guillermo Valle, Masataka Sawayama, Eleni Nisioti, Cécile Mazon, Maxime Adolphe , Julius Taylor, Clément Moulin-Frier, Tristan Karch, Laetitia Teodorescu, Mayalen Etcheverry, Benjamin Clément, Grgur Kovac, Hélène Sauzéon, Nathalie Robin, Catherine Cattaert-Megrat, Timothée Lesort, Hugo Caselles-Dupré, Alexander Ten, Remy Portelas, Cédric Colas, Alvaro Ovalle Castaneda, Anna-Lisa Vollmer, Stéphanie Noirpoudre, Loïc Dauphin, Florian Golemo, Sébastien Forestier, William Schueller, Cem Karaoguz, Clément Masson, Adrien Matricon, Nicolas Rabault, Matthieu Lapeyre, Pierre Rouanet, Nicolas Jahier, Alexandre Gepperth, Céline Craye, Alexandra Delmas, Gennaro Raiola, Baptiste Busch, Panagiotis Papadakis, Yoan Mollard, Thibaut Munzer, Didier Roy, Freek Stulp, Thomas Degris, Jonathan Grizou, Guillaume Duceux, Louis-Charles Caron, Manuel Lopes, Olivier Mangin, Natalia Lyubova, Olivier Ly, Fabien Benureau, Paul Fudal, Thomas Cederborg, David Filliat, Adrien Baranes, Jérome Béchu, Pierre-Yves Oudeyer, Damien Caselli, Théo Segonds, Natalia Diaz Rodriguez (this list is partial)
AI for Good Foundation 52 Volodymyr Goshylyk, Elizabeth Taylor, Lacey Hunter, Mamdouh Alqudsi, Marcos Tidball, Bessie O"dell, Sajeda Amro, Raeda Zleik, Catherine Li, Ralf Bremer, Christine Cepelak, Chi Yun Chen, Pedro Gonçalves, Mariana Rufin, XiLin Choi, Kat Weideman, Randon Taylor, Angela Higley, Mark Minevich, Trisha Nicole Bautista, Sileshi Bedasie Hirko, Mario De Jesus, Tenzin Migmar, Eliot Frazier, Lindsey Asis, Blaz Novak, Anastassia Fedyk, Achim Rettinger, Adam Mincks, Ifejesu Ogunleye, Courtney Perales Reyes, Ruchir Sachdev, Tia Christopher, Pedro Siena Neto, Mitja Jermol, Matthew Grotenstein, Amir Banifatemi, Abe Hsuan, Andy Spezzatti, Johannes Erett, Charlotte Stanton, Vanessa S. Bradesko, Rayid Ghani, Marko Grobelnik, Estevam Rafael Hruschka Junior, Zaruhi Mkrtumyan, Stefano Pacifico, Gary Marcus, Damian Borth, Claudia Perlich, James Hodson, Michael Witbrock
FAR.AI 51 Jessica Lim, Edward Yee, Lilian Hughes, Chris Cundy, Taylor Boyle, Lindsay Murachver, Jeremy Rich, Saad Siddiqui, Oskar Hollinsworth, Anastasiia Gaidashenko, Philip Quirke, Brendan Murphy, Isabella Duan, Ian McKenzie, Dillon Bowen, Michał Zając, Siao Si Looi, Chris MacLeod, Aaron Tucker, Claudia Shi, Moritz von Knebel, Tony Wang, Fynn Heide, Kellin Pelrine, Ben Goldhaber, Lev McKinney, Adrià Garriga-Alonso, Pablo Moreno, Tomasz Korbak, Pedro Freire, Nora Belrose, Jérémy Scheurer, Juan Rocamonde, Alex Tamkin, Niki Howe, Ethan Perez, ChengCheng Tan, Adam Gleave, Nino Scherrer, Sawyer Bernath, Karl Berzins, Hannah Betts, Edmund Mills, Josh Jacobson, Alyse Spiehler, Tom Tseng, Mohammad Taufeeque, Joseph Miller, Euan McLean, Lawrence Chan, Scott Emmons
Leverhulme Centre for the Future of Intelligence 46 Kofi Yeboah, Irene Pellegero Querol, Henry Shevlin, Flavia Saxler, Malak Sadek, Niall Donnelly, Toshie Takahashi, Matthijs M. Maas, Haydn Belfield, Rafael A. Calvo, Carla Zoe Cremer, Zoubin Ghahramani, Thomas D. Grant, Tameem Adel, Susan Gowans, Stuart Russell, Stephen John, Stephen Cave, Seán Ó hÉigeartaigh, Sarah Dillon, Rune Nyrup, Philip Pettit, Nick Bostrom, Neil Lawrence, Murray Shanahan, Michael A. Osborne, Martin Rees, Marta Halina, Margaret Boden, Manuela M. Veloso, Lucy Cheke, Kay Firth-Butterfield, Karina Vold, Kanta Dihal, José Hernández-Orallo, Huw Price, Heather Roff, Francesca Rossi, Demis Hassabis, David Runciman, Beth Singler, Anna Alexandrova, Andrew Snyder-Beattie, Alison Gopnik, Alan Winfield, Adrian Weller
Center for Applied Rationality 43 Tara Mac Aulay, Maria Eduarda Rodrigues Sampaio, Arsalaan Alam, Kyle Scott, Logan Brienne Strohl, Kathryn Schmiedicke, Xavier Prospero, Brienne Strohl, Dan Keys, Luke Raskopf, Duncan Sabien, Adom Hartell, Tsvi Benson-Tilsen, Timothy Telleen-Lawton, Stephanie Zolayvar, Qiaochu Yuan, Matthew Graves, Jordan Tyrrell, Eric Rogstad, Elizabeth Garrett, Ben Goldhaber, Adam Scholl, Jack Carroll, Eli Tyre, Michael Keenan, Michael Blume, Lauren Lee, Jesse Liptrap, Cat Lavigne, Ben Sancetta, Gina Stuessy, Pete Michaud, Eric Chisholm, Daniel Colson, Davis Kingsley, Michael Smith, Oliver Habryka, Andrew Critch, Leah Libresco, Kenzi Amodei, Julia Galef, Gwern Branwen, Anna Salamon
Effective Altruism Foundation 40 Jia Yuan Loke, Daniel Kokotajlo, Paul Knott, Mojmír Stehlík, Michael Aird, Julian Stastny, Jesse Clifton, Hadrien Pouget, Eric Chen, Emery Cooper, Anthony DiGiovanni, Ali Merali, Maxime Riché, Alexander Lyzhov, Amrit Sidhu-Brar, Linh Chi Nguyen, Ulla Wessels, Olle Häggström, Ole Martin Moen, Lucius Caviola, David Pearce, Daniel Rüthemann, Adrian Hutter, Stefan Torges, Stefan Klein, Sascha Fink, Sarah Dörpinghaus, Rajshri Jayaraman, Melinda Lohmann, Lukas Gloor, Klaus Wälde, Dina Pomeranz, Brian Tomasik, Anni Leskelä, Thomas Metzinger, Persis Eskander, Ozy Brennan, Johannes Treutlein, David Althaus, Max Daniel
Global Catastrophic Risk Institute 37 Anthony M. Barrett, Dakota Norris, Allan Suresh, Uliana Certan, Andrea Owe, Kyle L. Evanoff, McKenna Fitzgerald, Oliver Couttolenc, Jared Brown, Robert de Neufville, John Garrick, Seán Ó hÉigeartaigh, Adam Scholl, Marilyn Cotrich, Lena Wang, Jenny Mith, Matthijs Maas, Jessica Cianci, Trevor White, Roman Yampolskiy, Gary Ackerman, Caroline Zaw-Mon, Dave Denkenberger, David Denkenberger, Arden Rowell, Jianhua Xu, U. Tuncay Alparslan, Steven Umbrello, Jacob Haqq-Misra, Mark Fusco, Kaitlin Butler, Grant Wilson, Tim Maher, Matt Moretto, Kelly Hostetler, Tony Barrett, Seth Baum
Model Evaluation and Threat Research 36 Neev Parikh, David Rein, Amritanshu Prasad, Michael Chen, Nikola Jurkovic, Joshua Clymer, Ben West, Emma Abele, Maksym Taran, Kit Harris, Ryan Bloom, Francisco Carvalho, Arjun Khandelwal, Martin Milbradt, Sudarsh K, Thomas Broadley, Chris Painter, Aric Floyd, Amy Ngo, Max Hasin, Amanda (Rae) She, Kyle Scott, Lucas Sato, Brian Goodrich, Hjalmar Wijk, Elizabeth Barnes, Yoshua Bengio, Holden Karnofsky, Adam Gleave, Rebecca Baron, Rae She, Kris Chari, Tao Lin, Sami Jawhar, Megan Kinniment, Kathy Garcia
GoodAI 34 Karolína H., Ryan Camilleri, Jose Solorzano, Alex Angelini, Dominik Čech, Sarka Krejcova, David Castillo, Stephanie Wendler, Reham Bukhari, Šimon Šicko, Lucia Šicková, Nicholas Guttenberg, Viktorie Knezkova, Steffen Eichler, Isabeau Premont-Schwarz, Filip Hauptfleisch, Petr Sramek, Jan Štafa, Michal Dvořák, Christine Lee, Will Millership, Lucie Krestova, Marek Havrda, Jan Feyereisl, Olga Afanasjeva, Martin Poliak, Marek Rosa, Simon Andersson, Přemek Paška, Jaroslav Vitku, Shantesh Patil, Petr Hlubuček, Joseph Davidson, Ege Atici
Center for AI Safety 32 Rebecca Rothwell, Ayush Panda, Isabelle Barrass, Zifan Wang, Matthias Hein, David Bau, Long Phan, Ayham Al-Saffar, Xuwang Yin, Aidan O'Gara, Suryansh Mehta, Corin Katzke, Marc Carauleanu, Max Kaufmann, David Lambert, Sidney Hough, Scott Emmons, Michael Chen, Mantas Mazeika, Kevin Liu, Jun Shern Chan, Dan Hendrycks, Andy Zou, Nathaniel Li, Joshua Clymer, Anders Edson, Alex Pan, Madhav Malhotra, Steven Basart, Rune Kvist, Oliver Zhang, Thomas Woodside
Fund for Alignment Research 30 Oskar Hollinsworth, Philip Quirke, Lindsay Murachver, Saad Siddiqui, Anastasiia Gaidashenko, Vael Gates, Siao Si Looi, Isabella Duan, Conor McGurk, Moritz von Knebel, Fynn Heide, Ben Goldhaber, Adrià Garriga-Alonso, Niki Howe, Adam Gleave, Sawyer Bernath, Lawrence Chan, Kellin Pelrine, Karl Berzins, Mohammad Taufeeque, Hannah Betts, Tom Tseng, Nora Belrose, Tomasz Korbak, Scott Emmons, Jun Shern Chan, Jérémy Scheurer, Ian McKenzie, Ethan Perez, Claudia Shi
Ought 29 Sarah Park, Adrian Smith, Luke Stebbing, Charlie George, James Brady, Ian McKenzie, Justin Reppert, Eli Lifland, Amanda Ngo, Aparna Ashok, Jungwon Byun, Paul Christiano, Ozzie Gooen, Owain Evans, Neal Jean, Milan Griffes, Girish Sastry, Chris Cundy, Ben Weinstein-Raun, Ben Goldhaber, Andrew Schreiber, Ben Rachbach, Zachary Miller, Zac Kenton, Tom McGrath, Noah Goodman, Ben West, Ryan Carey, Andreas Stuhlmüller
AI Safety Camp 27 Kristi Uustalu, Sebastian Kosch , Nix Goldowsky-Dill, JJ Hepburn, Cynthia Yoon, Colin Bested, Andrew Player, Tomáš Gavenčiak, Sabrina Kavanagh, Fabian Steuer, David Lindner, Brandon Perry, Tom McGrath, Nandi Schoots, Markus Salmela, Maia Pasek, Linda Linsefors, Karol Kubicki, David Kristoffersson, Remmelt Ellen, Jessica Cooper, Kristina Nemcova, Jirí Nadvorník, Anne Wissemann, Jan Kulveit, Johannes Heidecke, Sai Joseph
ContinualAI 27 James Smith, Andrea Cossu, Martin Mundt, Tyler Hayes, Akshita Gupta, Alec Diallo, Ayşin Sancı, Ghada Sokar, Bing Liu, Joost van de Weijer, Christopher Kanan, Tinne Tuytelaars, Irina Rish, Itamar Arel, Subutai Ahmad, Massimiliano Versace, Razvan Pascanu, David Lopez Paz, Eugenio Culurciello, Marc Pickett, Xu Ji, Timothée Lesort, Natalia Díaz Rodríguez, Davide Maltoni, German I. Parisi, Keiland Cooper, Vincenzo Lomonaco
Berkeley Existential Risk Initiative 25 Elizabeth Cooper, Kyle Scott, Sofia Davis-Fogel, James Paul Gonzales, Jess Riedel, Andrew Critch, Sawyer Bernath, Stuart Russell, Alex Flint, Josh Jacobson, Sam Bankman-Fried, Matt Fallshaw, Colleen Gleason, Jeremy Schlatter, Jaan Tallinn, Rebecca Raible, Kenzi Amodei, Jacob Tsimerman, Qiaochu Yuan, Eric Rogstad, Andrew Snyder-Beattie, Michael Keenan, Gina Stuessy, Seán Ó hÉigeartaigh, Malo Bourgon
Redwood Research 25 Guilhermo Cutrim Costa, Tyler Storlie, Luke Sallmen, Cienna Rominger, Ryan Greenblatt, Noa Nabeshima, Tao Lin, Peter Schmidt-Nielsen, Daniel Ziegler, Aqeel Ali, Ben Weinstein-Raun, Paul Christiano, Owen Cotton-Barratt, Malo Bourgon, Holden Karnofsky, James Bregan, Claire Zabel, Blake Borgeson, Bill Zito, Ajeya Cotra, Adam Scherlis, Seraphina Nix, Royston Noronha, Buck Shlegeris, Nathaniel Thomas
General AI Challenge 24 Tomas Mikolov, Tak Lo, Ryota Kanai, Roman Yampolskiy, Rodolfo Rosini, Pavel Kordik, Ling Ge, Julian Togelius, José Hernández-Orallo, Jan Romportl, Ivan Zelinka, Ayako Fukui, Alison Lowndes, Will Millership, Olga Afanasjeva, Virginia Dignum, Miles Brundage, Marek Rosa, Marek Havrda, Jan Sekerka, Jan Pospíšil, Irakli Beridze, Frank Dignum, Danit Gal
Conjecture 23 Mihir Rege, Jan Michelfeit, Maris Sala, Beren Millidge, Daniel Braun, Adam Shimi, Myriame Honnay, Lee Sharkey, Katrina Joslin, Jonathan Low, Janko Prester, Carlos Guevara, Caelum Forder, Rachel Stockton, Andrea Miotti, Gabriel Alfour, Sid Black, Laria Reynolds, Kip Parker, Jacob Merizian, Chris Scammell, Kyle McDonell, Connor Leahy
AI Impacts 20 Aysja Johnson, Jeffrey Heninger, Zach Stein-Perlman, Harlan Stewart, Jimmy Rintjema, Richard Korzekwa, Ronny Fernandez, Katja Grace, Daniel Kokotajlo, Asya Bergal, Ronja Lutz, Tegan McCaslin, Paul Christiano, Ben Hoffman, Justis Mills, Connor Flexman, Finan Adamson, Michael Wulfsohn, John Salvatier, Stephanie Zolayvar
Alignment Research Center 20 Rae She, Rebecca Baron, Quentin Feuillade-Montixi, Luke Miles, Megan Kinniment, Aryan Bhatt, Lawrence Chan, Timothy Kokotajlo, Josh Jacobson, Ben Hoskin, Buck Shlegeris, Paul Christiano, Kris Chari, Kyle Scott, George Robinson, Victor Lecomte, Dávid Matolcsi, Eric Neyman, Jacob Hilton, Mark Xu
Epoch 19 Virginia Blanton, Josh You, Daniela Amodei, Ben Cottier, Keith Wynroe, Jenny Xiao, David Atkinson, Maria da Lama, Matthew Barnett, Ege Erdil, Anson Ho, Tom Davidson, Tamay Besiroglu, Pablo Villalobos, Neil Thompson, Marius Hobbhahn, Lennart Heim, Jaime Sevilla, Eduardo Infante-Roldán
Stanford University 17 Siddharth Karamcheti, Peter Henderson, Pratyusha Kalluri, Dorsa Sadigh, Stanislav Fort, Cody Coleman, Stefano Ermon, Michael Webb, Percy Liang, Alex Aiken, Jacob Steinhardt, Noah D. Goodman, Andreas Stuhlmüller, Aditi Raghunathan, Alex Tamkin, Thomas Icard, Ray Briggs
Cooperative AI Foundation 15 Natasha Jaques, Rebecca Eddington, David Norman, Cecilia Elena Tilli, Michelle Virgo, Akbir Khan, Vincent Conitzer, Lewis Hammond, Jesse Clifton, Ruairi Donnelly, Gillian Hadfield, Eric Horvitz, Dario Amodei, Allan Dafoe, Amrit Sidhu-Brar
Nonlinear 15 Luca De Leo, Matt Putz, Deena Englander, Aaron Bergman, Emerson Spartz, Tristan Cook, Peter Barnett, Daniel del Castillo, Chris Leong, Corey Wood, Kat Woods, Spencer Greenberg, Robert Miles, David Moss, Alex Zhu
University of California, Berkeley 15 Dan Hendrycks, Max Simchowitz, Andrew Critch, Tom Kalil, Qiaochu Yuan, Stuart Russell, Pieter Abbeel, Smitha Milli, Dylan Hadfield-Menell, Anca Dragan, Sergey Levine, Paul Christiano, Michael Janner, Frances Ding, Lydia T. Liu
Lightcone Infrastructure 13 Robert Mushkatblat, Rafe Kennedy, Jacob Lagerros, Ruben Bloom, Raymond Arnold, Kaj Sotala, Matthew Graves, Harmanas Chopra, Eric Rogstad, Ben Albert Pace, Oliver Habryka, James Babcock, Elizabeth Van Nostrand
University of Oxford 10 Charlie Rogers-Smith, Mrinank Sharma, Allan Dafoe, Ruth Fong, Chris Maddison, Owain Evans, Michael Wooldridge, Aidan Gomez, Nick Bostrom, Heather Roff
Whole Brain Architecture Initiative 9 Koji Morikawa, Hideyuki Nakashima, Hiroyuki Morikawa, Masaru Tomita, Kitano Hiroaki, Kenji Doya, Koichi Takahashi, Yutaka Matsuo, Hiroshi Yamakawa
Road to AI Safety Excellence 8 Remmelt Ellen, Trent Fowler, Erik Istre, Rupert McCallum, Robert Miles, Johannes Heidecke, Veerle de Goederen, Toon Alfrink
Theiss Research 8 Karina Torres Castro, Rebecca Bone, Rich Lew, Soraya Bernal, Sebastian Engmann, Brian Nablo, Rodrigo Duran, Jack Glover
7 Joe Collman, Angela P., Anand Srinivasan, Orpheus Lummis, Stag Lynn, Alexander Gietelink Oldenziel, Tegan McCaslin
Australian National University 7 Elliot Catt, Tom Everitt, Jan Leike, Marcus Hutter, Jarryd Martin, Gary Lea, Alan Hájek
Palisade Research 7 Timothee Chauvin, Simon Lermen, Pranav Gade, Kyle Scott, Karina Belokapov, Jeffrey Ladish, Charlie Rogers-Smith
Foundational Research Institute 6 Brian Tomasik, Max Daniel, Kaj Sotala, Caspar Oesterheld, Lukas Gloor, Tobias Baumann
Google Brain 6 Jeremy Nixon, Melody Guan, Tom Brown, Dan Mané, Dario Amodei, Christopher Olah
University of Cambridge 6 Richard Ngo, Adrian Weller, Ramana Kumar, Arif Ahmed, Huw Price, Yang Liu
Association for Long Term Existence and Resilience 5 Vanessa Kosoy, Joshua Fox, Gidon Kadosh, Edo Arad, David Manheim
Carnegie Mellon University 5 Leqi Liu, Noam Brown, Manuela Veloso, Andre Platzer, David Danks
Convergence Analysis 5 Ozzie Gooen, Claire Abu-Assal, Kristian Rönn, Andrew X Stewart, Justin Shovelain
EthicsNet 4 Aleksandra Orchowska, Remco Bloemen, Anish Mohammed, Nell Watson
Massachusetts Institute of Technology 4 Joshua Brett Tenenbaum, Jon Gauthier, Julius Adebayo, Andrew Ilyas
Montreal Institute for Learning Algorithms 4 Zac Kenton, Doina Precup, Joelle Pineau, Yoshua Bengio
Center for AI Policy 3 Jason Green-Lowe, Thomas Larsen, Jakub Kraus
Centre for Effective Altruism 3 Johannes Treutlein, Ales Flidr, Owen Cotton-Barratt
Cornell University 3 Bart Selman, Jim Babcock, Joseph Halpern
eCortex 3 Randall C. O’Reilly, Seth J. Herd, David J. Jilk
Foresight Institute 3 Christine Peterson, Mark S. Miller, Allison Duettmann
IDSIA 3 Bas R. Steunebrink, Jürgen Schmidhuber, Mark Ring
Open Philanthropy 3 Paul Christiano, Jacob Steinhardt, Dario Amodei
SUPSI 3 Jürgen Schmidhuber, Bas R. Steunebrink, Mark Ring
Università della Svizzera italiana 3 Jürgen Schmidhuber, Bas R. Steunebrink, Mark Ring
University of Toronto 3 Dami Choi, Roger Grosse, Joshua Gans
Yale University 3 Wendell Wallach, Allan Dafoe, Daniel Eth
AIDEUS 2 Sergey Rodionov, Alexey Potapov
Arizona State University 2 Heather Roff, Miles Brundage
Center for a New American Security 2 Gregory C. Allen, Paul Scharre
Center for Human Success 2 Wyatt Tessari, David Yu
Encultured AI 2 Andrew Critch, Nick Hay
Endgame 2 Hyrum Anderson, Bobby Filar
Google 2 Marcello Herreshoff, Vladimir Slepnev
Learning Intelligent Distribution Agent 2 Tamas Madl, Stan Franklin
Linköping University 2 Mikael Böörs, Tobias Wängberg
London School of Economics 2 Katie Steele, Wlodek Rabinowicz
Ludwig Maximilian University of Munich 2 Stephan Hartmann, Reuben Stern
Oregon State University 2 Alex Turner, Thomas Dietterich
UCLA School of Law 2 Richard Re, Edward Parson
University College London 2 John Shawe-Taylor, Tobias Baumann
University of Michigan 2 Michael Wellman, James M. Joyce
University of Wisconsin–Madison 2 Reuben Stern, Patrick LaVictoire
Aalto University 1 Jelena Luketina
AgroParisTech 1 Laurent Orseau
Aix-Marseille University 1 Sergey Rodionov
American University 1 Thomas Zeitzoff
Bar-Ilan University 1 Ram Rachum
Birkbeck, University of London 1 Ulrike Hahn
Broad Institute of MIT and Harvard 1 Gopal Sarma
Brown University 1 David Abel
California Institute of Technology 1 Frederick Eberhardt
Carleton University 1 Andrew MacFie
Center for Analysis & Design of Intelligent Agents 1 Kristinn R. Thórisson
CogPrime 1 Ben Goertzel
Czech Technical University 1 Vojtěch Kovařík
Data61 1 Ramana Kumar
Duke University 1 Vincent Conitzer
Electronic Frontier Foundation 1 Peter Eckersley
ETH Zurich 1 Felix Berkenkamp
George Mason University 1 Robin Hanson
Georgia Institute of Technology 1 Fuxin Li
Global Politics of Artificial Intelligence Research Group at Yale University and University of Oxford 1 Matthijs Maas
Hague Centre for Strategic Studies 1 Matthijs Maas
Harvard University 1 David Parkes
Icelandic Institute for Intelligent Machines 1 Kristinn R. Thórisson
Information Society Project 1 Rebecca Crootof
INRA 1 Laurent Orseau
Institute for Future Studies 1 H. Orri Stefánsson
Institute for Theoretical Studies at ETH Zurich 1 Will Sawin
Institute of Ethics and Emerging Technologies 1 Steven Umbrello
Internet Archive 1 Brewster Kahle
ITMO University 1 Alexey Potapov
Legal Priorities Project 1 Nick Hollman
Lingnan University 1 Jiji Zhang
Moscow Institute of Physics and Technology 1 Vladimir Shakirov
Munich Center for Mathematical Philosophy 1 Catrin Campbell-Moore
Nanyang Technological University 1 Preston Greene
NARS 1 Pei Wang
National Bureau of Economic Research 1 Joshua Gans
New America Foundation 1 Heather Roff
New York University 1 Ethan Perez
Nilcons 1 Mihaly Barasz
NNAISENSE 1 Bas R. Steunebrink
organization 1 Robert Sandler
Oxford University 1 Joar Skalse
Phenomenological AI Safety Research Institute 1 G Gordon Worley III
Princeton University 1 Will Sawin
Quebec Artificial Intelligence Institute 1 Vincent Luczkow
Quixey 1 Patrick LaVictoire
Real AI 1 Jonathan Yan
Rice University 1 Moshe Vardi
Self-Aware Systems 1 Steve Omohundro
Smith College 1 James Miller
Social & Environmental Entrepreneurs 1 Seth Baum
Sorbonne University 1 Michaël Trazzi
Susaro 1 Richard Loosemore
Teesside University 1 The Anh Han
Texas A&M University 1 Kenny Easwaran
The Australian National University 1 Michael Cohen
The Consortium on the Landscape of AI Safety 1 Alexis Carlier
The New School 1 Peter Asaro
Ulm University 1 ‪Daniel Alexander Braun
Université de Montréal 1 Jelena Luketina
University of Alberta 1 Tor Lattimore
University of Amsterdam 1 Dmitrii Krasheninnikov
University of Arizona 1 Jenann Ismael
University of Bath 1 Joanna Bryson
University of Bristol 1 Benya Fallenstein
University of Colorado 1 Seth Herd
University of Colorado Boulder 1 Randall C. O’Reilly
University of Copenhagen 1 Matthijs Maas
University of Edinburgh 1 Angelo Frank De Bellis
University of Illinois at Chicago 1 Brian Ziebart
University of Louisville 1 Roman Yampolskiy
University of Melbourne 1 Benjamin Rubinstein
University of Montreal 1 Janos Kramar
University of New Hampshire 1 Andrew Ware
University of Padova 1 Francesca Rossi
University of Southern California 1 Stephen J. Read
University of Texas 1 Peter Stone
University of Washington 1 Daniel Weld
Washington University in St. Louis 1 Julia Haas

Individuals not affiliated with any organization

Showing 17 people.

Organization Website Source
Wei Dai http://www.weidai.com/ [1], [2]
Iceman [3], [4], [5], [6], [7]
Max Harms http://raelifin.com/ [8], [7]
Jeff Kaufman https://www.jefftk.com [9], [7]
Federico Pistono http://federicopistono.org/ [10]
Chris Pasek [11], [12]
Sune Kristian Jakobsen
Hilary Greaves [13]
Sophie-Charlotte Fischer [14]
Alexey Turchin https://avturchin.livejournal.com/ [15], [16], [17], [18]
Dustin Juliano http://dustinjuliano.com/ [19], [20]
Matteo Turchetta [21]
Angela P. Schoellig [21]
Andreas Krause [21]
Jim O’Neill [22]
Gordon Irlam http://www.gordoni.com/ [23]
John Maxwell [24]

Products

This section lists AI safety-related “products”: interactive tools, websites, flowcharts, datasets, etc. Unlike documents, products tend to be interactive, are updated continually, or require inputs from the consumer.

Showing 33 products.

Name Type Creator Creation date Description
Clarifying some key hypotheses in AI alignment diagram Ben Cottier, Rohin Shah 2019-08-15 A diagram collecting several hypotheses in AI alignment and their relationships to existing research agendas.
AI Alignment Forum blog LessWrong 2.0 2018-07-10 A group blog for discussion of technical aspects of AI alignment. The forum is built using the same software as LessWrong 2.0, and is integrated with LessWrong 2.0. For creation date, see [25].
AI Safety Research Camp workshop Tom McGrath, Remmelt Ellen, Linda Linsefors, Nandi Schoots, David Kristoffersson, Chris Pasek 2018-02-01 A research camp to take place in Gran Canaria in April 2018 and in the United Kingdom in July–August 2018. Facebook group at [26]. The creation date is the date of announcement on LessWrong 2.0.
“Levels of defense” in AI safety flowchart Alexey Turchin 2017-12-12 A flowchart applying multilevel defense to AI safety. There is an accompanying post on LessWrong at [27].
AI Alignment Prize contest Zvi Mowshowitz, Vladimir Slepnev, Paul Christiano 2017-11-03 A prize for work that advances understanding in alignment of smarter-than-human artificial intelligence. Winners for the first round, as well as announcement of the second round, can be found at [24]. Winners for the second round, as well as announcement of the third round, can be found at [28].
AI Watch interactive application Issa Rice 2017-10-23 A website to track people and organizations working on AI safety.
AI Safety Open Discussion discussion group Mati Roy 2017-10-23 A Facebook discussion group about AI safety. This is an open group.
AI safety resources list Victoria Krakovna 2017-10-01 A list of resources for long-term AI safety. Seems to have been first announced at [29].
Map of the AI Safety Community graphic Søren Elverlin 2017-09-26 A pictorial map that lists organizations and individuals in the AI safety community.
Open Philanthropy Project AI Fellows Program fellowship Open Philanthropy 2017-09-12 A fellowship to support PhD students in AI and machine learning. For the creation date, see [30].
LessWrong 2.0 blog LessWrong 2.0 2017-06-18 A community blog about rationality, decision theory, AI, the rationality community, and other topics relevant to AI safety. This is a re-launch/modernization of the original LessWrong. For the launch date, the date of the welcome post [31] is used.
Road to AI Safety Excellence course Toon Alfrink 2017-06-15 A proposed course that is designed to produce AI safety researchers. It used to be called “Accelerating AI Safety Adoption in Academia” and was announced on LessWrong at [32]. The Facebook group was created on 2017-06-30 [33].
Annotated bibliography of recommended materials list Center for Human-Compatible AI 2016-12-01 An annotated and interactive bibliography of AI safety-related course materials, textbooks, videos, papers, etc.
Extinction Risk from Artificial Intelligence blog Michael Cohen 2016-06-01 A series of pages exploring arguments for and against working on AI safety. The creation date is inferred from the URLs of images (example: [34]).
AI Alignment blog Paul Christiano 2016-05-28 Paul Christiano’s blog about AI alignment.
AISafety.com Reading Group discussion group Søren Elverlin, Erik B. Jacobsen, Volkan Erdogan 2016-05-24 A weekly reading group covering topics in AI safety.
Cause prioritization app interactive application Michael Dickens, Buck Shlegeris 2016-05-18 An interactive app for quantitative cause prioritization. The app includes a section [35] on AI safety intervention. The creation date is the date of the first commit in the Git repository [36].
Arbital AI alignment domain wiki Arbital, Eliezer Yudkowsky 2016-03-04 A collection of wiki-like pages on topics in AI alignment. The creation date is the date of the launch announcement for Arbital [37]; it’s unclear when the AI alignment domain itself was created.
Introductory resources on AI safety research list Victoria Krakovna 2016-02-28 A list of readings on long-term AI safety. Mirrored at [38]. There is an updated list at [39].
AI Safety Discussion discussion group Victoria Krakovna 2016-02-21 A Facebook discussion group about AI safety. This is a closed group so one needs to request access to see posts.
Reinforce.js implementation of Stuart Armstrong’s toy control problem interactive application Gwern Branwen, FeepingCreature 2016-02-03 A live demo of Stuart Armstrong’s toy control problem [40]. gwern introduced the demo in a LessWrong comment [41].
AI Policies Wiki wiki Gordon Irlam 2015-12-14 A wiki on AI policy. The wiki creation date can be seen in the revision history of the main page [42].
The Control Problem discussion group CyberPersona 2015-08-29 A subreddit about AI safety and control. For the subreddit creation date, see [43].
AGI Failures Modes and Levels map flowchart Alexey Turchin 2015-01-01 A flowchart about failure modes of artificial general intelligence, grouped by the stage of development. There is an accompanying post on LessWrong at [18].
AGI Safety Solutions Map flowchart Alexey Turchin 2015-01-01 A flowchart on potential solutions to AI safety. There is an accompanying post on LessWrong at [44].
Intelligent Agent Foundations Forum discussion group Machine Intelligence Research Institute 2014-11-04 A forum for technical AI safety research. The source code is hosted on GitHub [45]. The timestamp on the introductory post [46] gives the launch date.
A flowchart of AI safety considerations flowchart Eliezer Yudkowsky 2014-11-02 The flowchart was posted to Eliezer Yudkowsky’s Essays (a Facebook group) and has no title.
Effective Altruism Forum blog Centre for Effective Altruism, Rethink Charity, Ryan Carey 2014-09-10 A community blog about effective altruism which often has posts about AI safety. The forum was announced on LessWrong by Ryan Carey [47].
How to study superintelligence strategy list Luke Muehlhauser 2014-07-03 A list of project ideas in superintelligence strategy.
Ordinary Ideas blog Paul Christiano 2011-12-21 Paul Christiano’s blog about “weird AI stuff” [48].
The Uncertain Future interactive application Machine Intelligence Research Institute 2009-10-01 A tool to model future technology and its effect on civilization. For more about the history of the site, see [49].
LessWrong Wiki wiki Machine Intelligence Research Institute 2009-03-12 A companion wiki to the community blog LessWrong. The wiki has pages about AI safety.
LessWrong blog Machine Intelligence Research Institute 2009-02-01 A community blog about rationality, decision theory, AI, updates to MIRI, among other topics.