AI Watch

Welcome! This is a website to track people and organizations working on AI safety. See the code repository for the source code and data of this website.

This website is developed by Issa Rice and is partially funded by Vipul Naik.

This site is still under active development.

Last updated on 2018-04-08.

Table of contents

AI safety relation by subject

Note: as shown by the large number of “unknown” values, most of the positions haven’t been categorized by relation/subject so this table will only be useful in the future.

Subject UnknownAGI organizationGCR organizationposition Total
Unknown 385 107 18 126 636
background 0 0 0 12 12
general 0 0 1 2 3
grant investigation 0 0 0 3 3
policy 0 0 0 1 1
popularization 0 0 0 1 1
scientific advising 0 0 0 4 4
software engineering 0 0 0 3 3
strategy 0 0 0 1 1
technical research 0 0 0 14 14
Total 385 107 19 167 678

Positions summary by year

Note: as shown by the large number of “unknown” values, most of the positions haven’t been categorized by start/end dates so this table will only be useful in the future.

Year Start date End date Start date lower guess Start date upper guess End date lower guess End date upper guess
Unknown 539 659 672 540 541 676
2008 1 0 0 0 0 0
2010 3 0 0 3 2 0
2011 4 0 0 8 5 0
2012 3 0 0 2 6 0
2013 3 1 0 2 0 0
2014 9 1 0 0 0 0
2015 22 3 1 28 27 0
2016 46 1 2 4 2 0
2017 43 11 3 91 94 2
2018 5 2 0 0 1 0

Positions grouped by person

Showing 493 people with positions.

Name Number of organizations List of organizations
Paul Christiano 7 AI Impacts, Future of Humanity Institute, Machine Intelligence Research Institute, Open Philanthropy Project, OpenAI, Theiss Research, University of California, Berkeley
Stuart Russell 7 Berkeley Existential Risk Initiative, Center for Human-Compatible AI, Centre for the Study of Existential Risk, Future of Life Institute, Leverhulme Centre for the Future of Intelligence, Machine Intelligence Research Institute, University of California, Berkeley
Nick Bostrom 6 Centre for the Study of Existential Risk, Future of Humanity Institute, Future of Life Institute, Google DeepMind, Leverhulme Centre for the Future of Intelligence, Machine Intelligence Research Institute
Heather Roff 5 Arizona State University, Leverhulme Centre for the Future of Intelligence, New America Foundation, University of Denver, University of Oxford
Andrew Critch 4 Berkeley Existential Risk Initiative, Center for Human-Compatible AI, Machine Intelligence Research Institute, University of California, Berkeley
Bas R. Steunebrink 4 IDSIA, NNAISENSE, SUPSI, Università della Svizzera italiana
Daniel Dewey 4 Future of Humanity Institute, Future of Life Institute, Machine Intelligence Research Institute, Open Philanthropy Project
Jaan Tallinn 4 Berkeley Existential Risk Initiative, Centre for the Study of Existential Risk, Future of Life Institute, Machine Intelligence Research Institute
Jan Leike 4 Australian National University, Future of Humanity Institute, Google DeepMind, Machine Intelligence Research Institute
Matthijs Maas 4 Global Catastrophic Risk Institute, Global Politics of Artificial Intelligence Research Group at Yale University and University of Oxford, Hague Centre for Strategic Studies, University of Copenhagen
Seán Ó hÉigeartaigh 4 Berkeley Existential Risk Initiative, Centre for the Study of Existential Risk, Future of Humanity Institute, Leverhulme Centre for the Future of Intelligence
Adrian Weller 3 Centre for the Study of Existential Risk, Leverhulme Centre for the Future of Intelligence, University of Cambridge
Andrew Snyder-Beattie 3 Berkeley Existential Risk Initiative, Future of Humanity Institute, Leverhulme Centre for the Future of Intelligence
Bart Selman 3 Center for Human-Compatible AI, Cornell University, Machine Intelligence Research Institute
Dario Amodei 3 Google Brain, Open Philanthropy Project, OpenAI
Francesca Rossi 3 Future of Life Institute, Leverhulme Centre for the Future of Intelligence, University of Padova
Huw Price 3 Centre for the Study of Existential Risk, Leverhulme Centre for the Future of Intelligence, University of Cambridge
Jürgen Schmidhuber 3 IDSIA, SUPSI, Università della Svizzera italiana
Kaj Sotala 3 Foundational Research Institute, Machine Intelligence Research Institute, Theiss Research
Katja Grace 3 AI Impacts, Future of Humanity Institute, Machine Intelligence Research Institute
Laurent Orseau 3 AgroParisTech, Google DeepMind, INRA
Mark Ring 3 IDSIA, SUPSI, Università della Svizzera italiana
Martin Rees 3 Centre for the Study of Existential Risk, Future of Life Institute, Leverhulme Centre for the Future of Intelligence
Max Tegmark 3 Centre for the Study of Existential Risk, Future of Life Institute, Machine Intelligence Research Institute
Patrick LaVictoire 3 Machine Intelligence Research Institute, Quixey, University of Wisconsin–Madison
Pieter Abbeel 3 Center for Human-Compatible AI, OpenAI, University of California, Berkeley
Ramana Kumar 3 Data61, Machine Intelligence Research Institute, University of Cambridge
Robin Hanson 3 Future of Humanity Institute, George Mason University, Machine Intelligence Research Institute
Roman Yampolskiy 3 Global Catastrophic Risk Institute, Machine Intelligence Research Institute, University of Louisville
Ryan Carey 3 Centre for the Study of Existential Risk, Future of Humanity Institute, Machine Intelligence Research Institute
Seth Baum 3 Centre for the Study of Existential Risk, Global Catastrophic Risk Institute, Social & Environmental Entrepreneurs
Alexey Potapov 2 AIDEUS, ITMO University
Alison Gopnik 2 Centre for the Study of Existential Risk, Leverhulme Centre for the Future of Intelligence
Allan Dafoe 2 Future of Humanity Institute, Yale University
Anca Dragan 2 Center for Human-Compatible AI, University of California, Berkeley
Anna Salamon 2 Center for Applied Rationality, Machine Intelligence Research Institute
Ben Goertzel 2 CogPrime, Machine Intelligence Research Institute
Benya Fallenstein 2 Machine Intelligence Research Institute, University of Bristol
Beth Barnes 2 Center for Human-Compatible AI, Centre for the Study of Existential Risk
Carl Shulman 2 Future of Humanity Institute, Machine Intelligence Research Institute
Christine Peterson 2 Foresight Institute, Machine Intelligence Research Institute
Christopher Cundy 2 Center for Human-Compatible AI, Future of Humanity Institute
Christopher Olah 2 Google Brain, Open Philanthropy Project
David Abel 2 Brown University, Future of Humanity Institute
Demis Hassabis 2 Google DeepMind, Leverhulme Centre for the Future of Intelligence
Elon Musk 2 Centre for the Study of Existential Risk, Future of Life Institute
Holden Karnofsky 2 Open Philanthropy Project, OpenAI
Jacob Steinhardt 2 Open Philanthropy Project, Stanford University
Janos Kramar 2 Future of Life Institute, University of Montreal
Jelena Luketina 2 Aalto University, Université de Montréal
John Salvatier 2 AI Impacts, Future of Humanity Institute
Joseph Halpern 2 Center for Human-Compatible AI, Cornell University
Joshua Gans 2 National Bureau of Economic Research, University of Toronto
Kristinn R. Thórisson 2 Center for Analysis & Design of Intelligent Agents, Icelandic Institute for Intelligent Machines
Kyle Scott 2 Berkeley Existential Risk Initiative, Future of Humanity Institute
Malo Bourgon 2 Berkeley Existential Risk Initiative, Machine Intelligence Research Institute
Marcello Herreshoff 2 Google, Machine Intelligence Research Institute
Margaret Boden 2 Centre for the Study of Existential Risk, Leverhulme Centre for the Future of Intelligence
Martina Kunz 2 Centre for the Study of Existential Risk, Leverhulme Centre for the Future of Intelligence
Matthew Graves 2 Center for Applied Rationality, Machine Intelligence Research Institute
Max Daniel 2 Effective Altruism Foundation, Foundational Research Institute
Melody Guan 2 Future of Life Institute, Google Brain
Michael Keenan 2 Berkeley Existential Risk Initiative, Center for Applied Rationality
Michael Wellman 2 Center for Human-Compatible AI, University of Michigan
Mihaly Barasz 2 Machine Intelligence Research Institute, Nilcons
Miles Brundage 2 Arizona State University, Future of Humanity Institute
Murray Shanahan 2 Centre for the Study of Existential Risk, Leverhulme Centre for the Future of Intelligence
Owain Evans 2 Future of Humanity Institute, University of Oxford
Qiaochu Yuan 2 Center for Applied Rationality, University of California, Berkeley
Randall C. O’Reilly 2 eCortex, University of Colorado Boulder
Reuben Stern 2 Ludwig Maximilian University of Munich, University of Wisconsin–Madison
Sergey Rodionov 2 AIDEUS, Aix-Marseille University
Smitha Milli 2 OpenAI, University of California, Berkeley
Stephanie Zolayvar 2 AI Impacts, Center for Applied Rationality
Stephen Hawking 2 Centre for the Study of Existential Risk, Future of Life Institute
Steve Omohundro 2 Machine Intelligence Research Institute, Self-Aware Systems
Stuart Armstrong 2 Future of Humanity Institute, Machine Intelligence Research Institute
Tom Brown 2 Google Brain, OpenAI
Tom Everitt 2 Australian National University, Google DeepMind
Tsvi Benson-Tilsen 2 Center for Applied Rationality, Machine Intelligence Research Institute
Victoria Krakovna 2 Future of Life Institute, Google DeepMind
Will Sawin 2 Institute for Theoretical Studies at ETH Zurich, Princeton University
Yang Liu 2 Centre for the Study of Existential Risk, University of Cambridge
Aaron Silverbook 1 Machine Intelligence Research Institute
Abram Demski 1 Machine Intelligence Research Institute
Adam Scholl 1 Center for Applied Rationality
Adom Hartell 1 Center for Applied Rationality
Adrien Matricon 1 FLOWERS
Alan Alda 1 Future of Life Institute
Alan Guth 1 Future of Life Institute
Alan Hájek 1 Australian National University
Alan Winfield 1 Leverhulme Centre for the Future of Intelligence
Alan Yan 1 Future of Life Institute
Alec Radford 1 OpenAI
Aleks Kamko 1 OpenAI
Aleksandra Orchowska 1 EthicsNet
Ales Flidr 1 Future of Life Institute
Alex Aiken 1 Stanford University
Alex Altair 1 Machine Intelligence Research Institute
Alex Nichol 1 OpenAI
Alex Ray 1 OpenAI
Alex Vermeer 1 Machine Intelligence Research Institute
Alexandra Delmas 1 FLOWERS
Alexandre Gepperth 1 FLOWERS
Allison Duettmann 1 Foresight Institute
Alvaro Ovalle Castaneda 1 FLOWERS
Amy Willey 1 Machine Intelligence Research Institute
Anders Sandberg 1 Future of Humanity Institute
Andre Platzer 1 Carnegie Mellon University
Andreas Stuhlmüller 1 Stanford University
Andrej Karpathy 1 OpenAI
Andrew Lapinski-Barker 1 Machine Intelligence Research Institute
Andrew Lefrancq 1 Google DeepMind
Andrew MacFie 1 Carleton University
Andrew Ware 1 University of New Hampshire
Andrew X Stewart 1 Convergence Analysis
Angelo Frank De Bellis 1 University of Edinburgh
Anish Athalye 1 OpenAI
Anish Mohammed 1 EthicsNet
Ankur Handa 1 OpenAI
Anna Alexandrova 1 Leverhulme Centre for the Future of Intelligence
Anna-Lisa Vollmer 1 FLOWERS
Anthony Aguirre 1 Future of Life Institute
Ariel Conn 1 Future of Life Institute
Arif Ahmed 1 University of Cambridge
Aviv Tamar 1 OpenAI
Baobao Zhang 1 Future of Humanity Institute
Baptiste Busch 1 FLOWERS
Ben Garfinkel 1 Future of Humanity Institute
Ben Goldhaber 1 Center for Applied Rationality
Ben Hoffman 1 AI Impacts
Ben Sancetta 1 Center for Applied Rationality
Benjamin Clément 1 FLOWERS
Benjamin Rubinstein 1 University of Melbourne
Beth Singler 1 Leverhulme Centre for the Future of Intelligence
Bill Hibbard 1 Machine Intelligence Research Institute
Blake Borgeson 1 Machine Intelligence Research Institute
Bob McGrew 1 OpenAI
Bobby Filar 1 Endgame
Bradly Stadie 1 OpenAI
Brewster Kahle 1 Internet Archive
Brian Tomasik 1 Foundational Research Institute
Brian Ziebart 1 University of Illinois at Chicago
Carrick Flynn 1 Future of Humanity Institute
Caspar Oesterheld 1 Foundational Research Institute
Catherine Olsson 1 OpenAI
Catrin Campbell-Moore 1 Munich Center for Mathematical Philosophy
Cecilia Tilli 1 Future of Humanity Institute
Céline Craye 1 FLOWERS
Cem Karaoguz 1 FLOWERS
Chase Moores 1 Future of Life Institute
Chelsea Guo 1 Future of Humanity Institute
Christiana Figueres 1 Google DeepMind
Christof Koch 1 Future of Life Institute
Christopher Berner 1 OpenAI
Christopher Hesse 1 OpenAI
Claire Abu-Assal 1 Convergence Analysis
Clare Lyle 1 Future of Humanity Institute
Colm Ó Riain 1 Machine Intelligence Research Institute
Connor Flexman 1 AI Impacts
Damien Caselli 1 FLOWERS
Dan Keys 1 Center for Applied Rationality
Dan Mané 1 Google Brain
Dana Scott 1 Centre for the Study of Existential Risk
Daniel Eth 1 Yale University
Daniel Filan 1 Future of Humanity Institute
Daniel R. Miller 1 Future of Life Institute
Daniel Weld 1 University of Washington
David Chalmers 1 Centre for the Study of Existential Risk
David Danks 1 Carnegie Mellon University
David Filliat 1 FLOWERS
David Hart 1 Machine Intelligence Research Institute
David J. Jilk 1 eCortex
David Kristoffersson 1 Future of Humanity Institute
David Krueger 1 Future of Humanity Institute
David Parkes 1 Harvard University
David Runciman 1 Leverhulme Centre for the Future of Intelligence
David Stanley 1 Future of Life Institute
David Yu 1 Center for Human Success
Diane Coyle 1 Google DeepMind
Didier Roy 1 FLOWERS
Dmitrii Krasheninnikov 1 Center for Human-Compatible AI
Doina Precup 1 Montreal Institute for Learning Algorithms
Duncan Sabien 1 Center for Applied Rationality
Durk Kingma 1 OpenAI
Dylan Hadfield-Menell 1 University of California, Berkeley
Edward Parson 1 UCLA School of Law
Edward W. Felten 1 Google DeepMind
Edwin Evans 1 Machine Intelligence Research Institute
Eli Tyre 1 Center for Applied Rationality
Eliezer Yudkowsky 1 Machine Intelligence Research Institute
Elizabeth Garrett 1 Center for Applied Rationality
Elliot Catt 1 Australian National University
Elman Mansimov 1 OpenAI
Eric Drexler 1 Future of Humanity Institute
Eric Gastfriend 1 Future of Life Institute
Eric Price 1 OpenAI
Eric Rogstad 1 Center for Applied Rationality
Erik Brynjolfsson 1 Future of Life Institute
Erika Reinhardt 1 OpenAI
Felix Berkenkamp 1 ETH Zurich
Filip Wolski 1 OpenAI
Finan Adamson 1 AI Impacts
Florian Golemo 1 FLOWERS
Frank Wilczek 1 Future of Life Institute
Frederick Eberhardt 1 California Institute of Technology
Freek Stulp 1 FLOWERS
Fuxin Li 1 Georgia Institute of Technology
Gary Drescher 1 Machine Intelligence Research Institute
Gennaro Raiola 1 FLOWERS
Geoffrey Irving 1 OpenAI
George Church 1 Future of Life Institute
Gillian Hadfield 1 Center for Human-Compatible AI
Gina Stuessy 1 Berkeley Existential Risk Initiative
Girish Sastry 1 Future of Humanity Institute
Grant Wilson 1 Global Catastrophic Risk Institute
Greg Brockman 1 OpenAI
Gregory C. Allen 1 Center for a New American Security
Grzegorz Orwiński 1 Future of Life Institute
Guillaume Duceux 1 FLOWERS
H. Orri Stefánsson 1 Institute for Future Studies
Harri Edwards 1 OpenAI
Haydn Belfield 1 Centre for the Study of Existential Risk
Helen Toner 1 Open Philanthropy Project
Henry Shevlin 1 Leverhulme Centre for the Future of Intelligence
Hideyuki Nakashima 1 Whole Brain Architecture Initiative
Hiroshi Yamakawa 1 Whole Brain Architecture Initiative
Hiroyuki Morikawa 1 Whole Brain Architecture Initiative
Hyrum Anderson 1 Endgame
Ian Goodfellow 1 OpenAI
Igor Mordatch 1 OpenAI
Ilya Sutskever 1 OpenAI
Jack Carroll 1 Center for Applied Rationality
Jack Clark 1 OpenAI
Jack Gallagher 1 Machine Intelligence Research Institute
Jacob Beebe 1 Future of Life Institute
Jacob Trefethen 1 Future of Life Institute
Jacob Tsimerman 1 Berkeley Existential Risk Initiative
Jade Leung 1 Future of Humanity Institute
Jakob Foerster 1 OpenAI
Jakub Pachocki 1 OpenAI
James M. Joyce 1 University of Michigan
James Manyika 1 Google DeepMind
Jan Feyereisl 1 GoodAI
Jarryd Martin 1 Australian National University
Jasen Murray 1 Machine Intelligence Research Institute
Jean Harb 1 OpenAI
Jed McCaleb 1 Machine Intelligence Research Institute
Jeffrey D. Sachs 1 Google DeepMind
Jeffrey Ding 1 Future of Humanity Institute
Jenann Ismael 1 University of Arizona
Jeremy Schlatter 1 OpenAI
Jesse Galef 1 Future of Life Institute
Jesse Liptrap 1 Machine Intelligence Research Institute
Jessica Taylor 1 Machine Intelligence Research Institute
Jie Tang 1 OpenAI
Jiji Zhang 1 Lingnan University
Jim Babcock 1 Cornell University
Jimmy Rintjema 1 AI Impacts
Joanna Bryson 1 University of Bath
Joelle Pineau 1 Montreal Institute for Learning Algorithms
John Schulman 1 OpenAI
Jon Gauthier 1 OpenAI
Jonas Schneider 1 OpenAI
Jonathan Gray 1 OpenAI
Jonathan Ho 1 OpenAI
Jonathan Raiman 1 OpenAI
Jonathan Yan 1 Real AI
Jordan Tirrell 1 Center for Applied Rationality
José Hernández-Orallo 1 Leverhulme Centre for the Future of Intelligence
Josh Tobin 1 OpenAI
Julia Galef 1 OpenAI
Julia Haas 1 Washington University in St. Louis
Justin Shovelain 1 Convergence Analysis
Justis Mills 1 AI Impacts
Kanta Dihal 1 Leverhulme Centre for the Future of Intelligence
Karina Vold 1 Leverhulme Centre for the Future of Intelligence
Kate Miltenberger 1 OpenAI
Katie Steele 1 London School of Economics
Kay Firth-Butterfield 1 Leverhulme Centre for the Future of Intelligence
Kaya Stechly 1 Machine Intelligence Research Institute
Kazue Evans 1 Future of Life Institute
Kenji Doya 1 Whole Brain Architecture Initiative
Kenny Easwaran 1 Texas A&M University
Kenzi Amodei 1 Berkeley Existential Risk Initiative
Kevin Fischer 1 Machine Intelligence Research Institute
Kevin Frans 1 OpenAI
Kitano Hiroaki 1 Whole Brain Architecture Initiative
Koichi Takahashi 1 Whole Brain Architecture Initiative
Koji Morikawa 1 Whole Brain Architecture Initiative
Kristian Rönn 1 Convergence Analysis
Lauren Lee 1 Center for Applied Rationality
Lerrel Pinto 1 OpenAI
Linxi Fan 1 OpenAI
Liron Shapira 1 Machine Intelligence Research Institute
Loïc Dauphin 1 FLOWERS
Long Ouyang 1 Theiss Research
Louie Helm 1 Machine Intelligence Research Institute
Louis-Charles Caron 1 FLOWERS
Lucas Perry 1 Future of Life Institute
Lucy Cheke 1 Leverhulme Centre for the Future of Intelligence
Ludwig Pettersson 1 OpenAI
Lukas Gloor 1 Foundational Research Institute
Luke Muehlhauser 1 Machine Intelligence Research Institute
Manuel Lopes 1 FLOWERS
Manuela M. Veloso 1 Leverhulme Centre for the Future of Intelligence
Manuela Veloso 1 Carnegie Mellon University
Maran Nelson 1 OpenAI
Marcin Andrychowicz 1 OpenAI
Marcus Hutter 1 Australian National University
Marek Havrda 1 GoodAI
Marek Rosa 1 GoodAI
Mark Fusco 1 Global Catastrophic Risk Institute
Mark Nitzberg 1 Center for Human-Compatible AI
Mark S. Miller 1 Foresight Institute
Marta Halina 1 Leverhulme Centre for the Future of Intelligence
Maruan Al-Shedivat 1 OpenAI
Masaru Tomita 1 Whole Brain Architecture Initiative
Matthew Fallshaw 1 Machine Intelligence Research Institute
Matthias Plappert 1 OpenAI
Matthieu Lapeyre 1 FLOWERS
Max Kesin 1 Future of Life Institute
Meia Chita-Tegmark 1 Future of Life Institute
Michael A. Osborne 1 Leverhulme Centre for the Future of Intelligence
Michael Anissimov 1 Machine Intelligence Research Institute
Michael Page 1 OpenAI
Michael Smith 1 Center for Applied Rationality
Michael Vassar 1 Machine Intelligence Research Institute
Michael Webb 1 Stanford University
Michael Wooldridge 1 University of Oxford
Michael Wulfsohn 1 AI Impacts
Mikael Böörs 1 Linköping University
Miljan Martic 1 Google DeepMind
Morgan Freeman 1 Future of Life Institute
Moshe Looks 1 Machine Intelligence Research Institute
Moshe Vardi 1 Rice University
Mustafa Suleyman 1 Google DeepMind
Na Li 1 Future of Life Institute
Nate Soares 1 Machine Intelligence Research Institute
Nate Thomas 1 Machine Intelligence Research Institute
Neal Jean 1 Future of Humanity Institute
Neil Lawrence 1 Leverhulme Centre for the Future of Intelligence
Nell Watson 1 EthicsNet
Nick Tarleton 1 Machine Intelligence Research Institute
Nicolas Jahier 1 FLOWERS
Nicolas Papernot 1 OpenAI
Nicolas Rabault 1 FLOWERS
Niel Bowerman 1 Future of Humanity Institute
Nisan Stiennon 1 Machine Intelligence Research Institute
Noah D. Goodman 1 Stanford University
Oleg Klimov 1 OpenAI
Olga Afanasjeva 1 GoodAI
Oliver Habryka 1 Center for Applied Rationality
Owen Cotton-Barratt 1 Centre for Effective Altruism
Ozzie Gooen 1 Convergence Analysis
Pamela Vagata 1 OpenAI
Panagiotis Papadakis 1 FLOWERS
Parnian Barekatain 1 OpenAI
Paul Scharre 1 Center for a New American Security
Pedro A. Ortega 1 Google DeepMind
Pei Wang 1 NARS
Pejman Makhfi 1 Machine Intelligence Research Institute
Percy Liang 1 Stanford University
Pete Michaud 1 Center for Applied Rationality
Peter Asaro 1 The New School
Peter Chen 1 OpenAI
Peter de Blanc 1 Machine Intelligence Research Institute
Peter Eckersley 1 Electronic Frontier Foundation
Peter Haas 1 Future of Life Institute
Peter Thiel 1 Machine Intelligence Research Institute
Peter Welinder 1 OpenAI
Philip Pettit 1 Leverhulme Centre for the Future of Intelligence
Pierre Rouanet 1 FLOWERS
Pierre-Yves Oudeyer 1 FLOWERS
Prafulla Dhariwal 1 OpenAI
Preston Greene 1 Nanyang Technological University
Przemyslaw Debiak 1 OpenAI
Quirin Fischer 1 OpenAI
Rachel Fong 1 OpenAI
Rafael Martinez-Galarza 1 Future of Life Institute
Rafał Józefowicz 1 OpenAI
Ray Briggs 1 Stanford University
Ray Kurzweil 1 Machine Intelligence Research Institute
Rebecca Crootof 1 Information Society Project
Rebecca Raible 1 Berkeley Existential Risk Initiative
Rein Houthooft 1 OpenAI
Remco Bloemen 1 EthicsNet
Richard Chen 1 OpenAI
Richard Loosemore 1 Susaro
Richard Mallah 1 Future of Life Institute
Richard Re 1 UCLA School of Law
Rob Bensinger 1 Machine Intelligence Research Institute
Robert de Neufville 1 Global Catastrophic Risk Institute
Rocky Duan 1 OpenAI
Rohin Shah 1 Center for Human-Compatible AI
Roxanne Heston 1 Future of Humanity Institute
Rune Nyrup 1 Leverhulme Centre for the Future of Intelligence
Ryan Lowe 1 OpenAI
Sam Altman 1 OpenAI
Sam Eisenstat 1 Machine Intelligence Research Institute
Sandy Huang 1 OpenAI
Sarah Dillon 1 Leverhulme Centre for the Future of Intelligence
Satinder Singh Baveja 1 Center for Human-Compatible AI
Saul Perlmutter 1 Future of Life Institute
Scott Garrabrant 1 Machine Intelligence Research Institute
Scott Gray 1 OpenAI
Sean Holden 1 Centre for the Study of Existential Risk
Sean Legassick 1 Google DeepMind
Sebastian Farquhar 1 Future of Humanity Institute
Sébastien Forestier 1 FLOWERS
Sergey Levine 1 University of California, Berkeley
Seth Herd 1 University of Colorado
Seth J. Herd 1 eCortex
Shahar Avin 1 Centre for the Study of Existential Risk
Shane Legg 1 Google DeepMind
Shariq Hashme 1 OpenAI
Shimon Whiteson 1 OpenAI
Shivon Zilis 1 OpenAI
Shun Liao 1 OpenAI
Siddharth Srivastava 1 Center for Human-Compatible AI
Simon Beard 1 Centre for the Study of Existential Risk
Stan Franklin 1 Learning Intelligent Distribution Agent
Stefano Ermon 1 Stanford University
Stephan Hartmann 1 Ludwig Maximilian University of Munich
Stéphanie Noirpoudre 1 FLOWERS
Stephen Cave 1 Leverhulme Centre for the Future of Intelligence
Stephen J. Read 1 University of Southern California
Stephen John 1 Leverhulme Centre for the Future of Intelligence
Steve Rayhawk 1 Machine Intelligence Research Institute
Steven Umbrello 1 Institute of Ethics and Emerging Technologies
Susan Gowans 1 Leverhulme Centre for the Future of Intelligence
Szymon Sidor 1 OpenAI
Taco Cohen 1 OpenAI
Tamas Madl 1 Learning Intelligent Distribution Agent
Tamay Beriroglu 1 Future of Humanity Institute
Tambet Matiisen 1 OpenAI
Tameem Adel 1 Leverhulme Centre for the Future of Intelligence
Tamim Asfour 1 OpenAI
Tania Lombrozo 1 Center for Human-Compatible AI
Théo Segonds 1 FLOWERS
Thibaut Munzer 1 FLOWERS
Thomas D. Grant 1 Leverhulme Centre for the Future of Intelligence
Thomas Dietterich 1 Oregon State University
Thomas Icard 1 Stanford University
Thomas Zeitzoff 1 American University
Tim Crane 1 Centre for the Study of Existential Risk
Tim Salimans 1 OpenAI
Timothy Telleen-Lawton 1 Center for Applied Rationality
Tobias Baumann 1 Foundational Research Institute
Tobias Wängberg 1 Linköping University
Tom Griffiths 1 Center for Human-Compatible AI
Tom Kalil 1 University of California, Berkeley
Tom McGrath 1 Future of Humanity Institute
Tomer Kagan 1 Machine Intelligence Research Institute
Tony Barrett 1 Global Catastrophic Risk Institute
Tor Lattimore 1 University of Alberta
Trapit Bansal 1 OpenAI
Trevor Blackwell 1 OpenAI
Trevor White 1 Global Catastrophic Risk Institute
Tucker Davey 1 Future of Life Institute
Ulrike Hahn 1 Birkbeck, University of London
Vadim Kosoy 1 Machine Intelligence Research Institute
Vera Koroleva 1 Future of Life Institute
Verity Harding 1 Google DeepMind
Vicki Cheung 1 OpenAI
Vikash Kumar 1 OpenAI
Vincent C. Müller 1 Future of Humanity Institute
Vincent Conitzer 1 Duke University
Vipul Naik 1 Machine Intelligence Research Institute
Vladimir Shakirov 1 Moscow Institute of Physics and Technology
Vladimir Slepnev 1 Google
Wendell Wallach 1 Yale University
William Saunders 1 Future of Humanity Institute
William Schueller 1 FLOWERS
Wlodek Rabinowicz 1 London School of Economics
Wojciech Zaremba 1 OpenAI
Wyatt Tessari 1 Center for Human Success
Xi Chen 1 OpenAI
Xin Wen 1 Future of Life Institute
Xue Bin Peng 1 OpenAI
Yan Duan 1 OpenAI
Yaroslav Bulatov 1 OpenAI
Yi Wu 1 OpenAI
Yishuai Du 1 Future of Life Institute
Yoan Mollard 1 FLOWERS
Yoshua Bengio 1 Montreal Institute for Learning Algorithms
Yuhuai Wu 1 OpenAI
Yura Burda 1 OpenAI
Yutaka Matsuo 1 Whole Brain Architecture Initiative
Zac Kenton 1 Montreal Institute for Learning Algorithms
Zain Shah 1 OpenAI
Zara Yaqoob 1 Future of Life Institute
Zoubin Ghahramani 1 Leverhulme Centre for the Future of Intelligence

Positions grouped by organization

Showing 116 organizations.

Organization Number of people List of people
OpenAI 94 Alec Radford, Aleks Kamko, Alex Nichol, Alex Ray, Andrej Karpathy, Anish Athalye, Ankur Handa, Aviv Tamar, Bob McGrew, Bradly Stadie, Catherine Olsson, Christopher Berner, Christopher Hesse, Dario Amodei, Durk Kingma, Elman Mansimov, Eric Price, Erika Reinhardt, Filip Wolski, Geoffrey Irving, Greg Brockman, Harri Edwards, Holden Karnofsky, Ian Goodfellow, Igor Mordatch, Ilya Sutskever, Jack Clark, Jakob Foerster, Jakub Pachocki, Jean Harb, Jeremy Schlatter, Jie Tang, John Schulman, Jon Gauthier, Jonas Schneider, Jonathan Gray, Jonathan Ho, Jonathan Raiman, Josh Tobin, Julia Galef, Kate Miltenberger, Kevin Frans, Lerrel Pinto, Linxi Fan, Ludwig Pettersson, Maran Nelson, Marcin Andrychowicz, Maruan Al-Shedivat, Matthias Plappert, Michael Page, Nicolas Papernot, Oleg Klimov, Pamela Vagata, Parnian Barekatain, Paul Christiano, Peter Chen, Peter Welinder, Pieter Abbeel, Prafulla Dhariwal, Przemyslaw Debiak, Quirin Fischer, Rachel Fong, Rafał Józefowicz, Rein Houthooft, Richard Chen, Rocky Duan, Ryan Lowe, Sam Altman, Sandy Huang, Scott Gray, Shariq Hashme, Shimon Whiteson, Shivon Zi
Machine Intelligence Research Institute 70 Aaron Silverbook, Abram Demski, Alex Altair, Alex Vermeer, Amy Willey, Andrew Critch, Andrew Lapinski-Barker, Anna Salamon, Bart Selman, Ben Goertzel, Benya Fallenstein, Bill Hibbard, Blake Borgeson, Carl Shulman, Christine Peterson, Colm Ó Riain, Daniel Dewey, David Hart, Edwin Evans, Eliezer Yudkowsky, Gary Drescher, Jaan Tallinn, Jack Gallagher, Jan Leike, Jasen Murray, Jed McCaleb, Jesse Liptrap, Jessica Taylor, Kaj Sotala, Katja Grace, Kaya Stechly, Kevin Fischer, Liron Shapira, Louie Helm, Luke Muehlhauser, Malo Bourgon, Marcello Herreshoff, Matthew Fallshaw, Matthew Graves, Max Tegmark, Michael Anissimov, Michael Vassar, Mihaly Barasz, Moshe Looks, Nate Soares, Nate Thomas, Nick Bostrom, Nick Tarleton, Nisan Stiennon, Patrick LaVictoire, Paul Christiano, Pejman Makhfi, Peter de Blanc, Peter Thiel, Ramana Kumar, Ray Kurzweil, Rob Bensinger, Robin Hanson, Roman Yampolskiy, Ryan Carey, Sam Eisenstat, Scott Garrabrant, Steve Omohundro, Steve Rayhawk, Stuart Armstrong, Stuart Russell, Tomer Kagan, Tsvi Benson-Tilsen, Vadim Kosoy, Vipul Naik
Future of Life Institute 45 Alan Alda, Alan Guth, Alan Yan, Ales Flidr, Anthony Aguirre, Ariel Conn, Chase Moores, Christof Koch, Daniel Dewey, Daniel R. Miller, David Stanley, Elon Musk, Eric Gastfriend, Erik Brynjolfsson, Francesca Rossi, Frank Wilczek, George Church, Grzegorz Orwiński, Jaan Tallinn, Jacob Beebe, Jacob Trefethen, Janos Kramar, Jesse Galef, Kazue Evans, Lucas Perry, Martin Rees, Max Kesin, Max Tegmark, Meia Chita-Tegmark, Melody Guan, Morgan Freeman, Na Li, Nick Bostrom, Peter Haas, Rafael Martinez-Galarza, Richard Mallah, Saul Perlmutter, Stephen Hawking, Stuart Russell, Tucker Davey, Vera Koroleva, Victoria Krakovna, Xin Wen, Yishuai Du, Zara Yaqoob
Future of Humanity Institute 40 Allan Dafoe, Anders Sandberg, Andrew Snyder-Beattie, Baobao Zhang, Ben Garfinkel, Carl Shulman, Carrick Flynn, Cecilia Tilli, Chelsea Guo, Christopher Cundy, Clare Lyle, Daniel Dewey, Daniel Filan, David Abel, David Kristoffersson, David Krueger, Eric Drexler, Girish Sastry, Jade Leung, Jan Leike, Jeffrey Ding, John Salvatier, Katja Grace, Kyle Scott, Miles Brundage, Neal Jean, Nick Bostrom, Niel Bowerman, Owain Evans, Paul Christiano, Robin Hanson, Roxanne Heston, Ryan Carey, Seán Ó hÉigeartaigh, Sebastian Farquhar, Stuart Armstrong, Tamay Beriroglu, Tom McGrath, Vincent C. Müller, William Saunders
Leverhulme Centre for the Future of Intelligence 37 Adrian Weller, Alan Winfield, Alison Gopnik, Andrew Snyder-Beattie, Anna Alexandrova, Beth Singler, David Runciman, Demis Hassabis, Francesca Rossi, Heather Roff, Henry Shevlin, Huw Price, José Hernández-Orallo, Kanta Dihal, Karina Vold, Kay Firth-Butterfield, Lucy Cheke, Manuela M. Veloso, Margaret Boden, Marta Halina, Martin Rees, Martina Kunz, Michael A. Osborne, Murray Shanahan, Neil Lawrence, Nick Bostrom, Philip Pettit, Rune Nyrup, Sarah Dillon, Seán Ó hÉigeartaigh, Stephen Cave, Stephen John, Stuart Russell, Susan Gowans, Tameem Adel, Thomas D. Grant, Zoubin Ghahramani
FLOWERS 31 Adrien Matricon, Alexandra Delmas, Alexandre Gepperth, Alvaro Ovalle Castaneda, Anna-Lisa Vollmer, Baptiste Busch, Benjamin Clément, Céline Craye, Cem Karaoguz, Damien Caselli, David Filliat, Didier Roy, Florian Golemo, Freek Stulp, Gennaro Raiola, Guillaume Duceux, Loïc Dauphin, Louis-Charles Caron, Manuel Lopes, Matthieu Lapeyre, Nicolas Jahier, Nicolas Rabault, Panagiotis Papadakis, Pierre Rouanet, Pierre-Yves Oudeyer, Sébastien Forestier, Stéphanie Noirpoudre, Théo Segonds, Thibaut Munzer, William Schueller, Yoan Mollard
Centre for the Study of Existential Risk 25 Adrian Weller, Alison Gopnik, Beth Barnes, Dana Scott, David Chalmers, Elon Musk, Haydn Belfield, Huw Price, Jaan Tallinn, Margaret Boden, Martin Rees, Martina Kunz, Max Tegmark, Murray Shanahan, Nick Bostrom, Ryan Carey, Sean Holden, Seán Ó hÉigeartaigh, Seth Baum, Shahar Avin, Simon Beard, Stephen Hawking, Stuart Russell, Tim Crane, Yang Liu
Center for Applied Rationality 22 Adam Scholl, Adom Hartell, Anna Salamon, Ben Goldhaber, Ben Sancetta, Dan Keys, Duncan Sabien, Eli Tyre, Elizabeth Garrett, Eric Rogstad, Jack Carroll, Jordan Tirrell, Lauren Lee, Matthew Graves, Michael Keenan, Michael Smith, Oliver Habryka, Pete Michaud, Qiaochu Yuan, Stephanie Zolayvar, Timothy Telleen-Lawton, Tsvi Benson-Tilsen
Google DeepMind 18 Andrew Lefrancq, Christiana Figueres, Demis Hassabis, Diane Coyle, Edward W. Felten, James Manyika, Jan Leike, Jeffrey D. Sachs, Laurent Orseau, Miljan Martic, Mustafa Suleyman, Nick Bostrom, Pedro A. Ortega, Sean Legassick, Shane Legg, Tom Everitt, Verity Harding, Victoria Krakovna
Center for Human-Compatible AI 17 Anca Dragan, Andrew Critch, Bart Selman, Beth Barnes, Christopher Cundy, Dmitrii Krasheninnikov, Gillian Hadfield, Joseph Halpern, Mark Nitzberg, Michael Wellman, Pieter Abbeel, Rohin Shah, Satinder Singh Baveja, Siddharth Srivastava, Stuart Russell, Tania Lombrozo, Tom Griffiths
Berkeley Existential Risk Initiative 12 Andrew Critch, Andrew Snyder-Beattie, Gina Stuessy, Jaan Tallinn, Jacob Tsimerman, Kenzi Amodei, Kyle Scott, Malo Bourgon, Michael Keenan, Rebecca Raible, Seán Ó hÉigeartaigh, Stuart Russell
AI Impacts 10 Ben Hoffman, Connor Flexman, Finan Adamson, Jimmy Rintjema, John Salvatier, Justis Mills, Katja Grace, Michael Wulfsohn, Paul Christiano, Stephanie Zolayvar
University of California, Berkeley 10 Anca Dragan, Andrew Critch, Dylan Hadfield-Menell, Paul Christiano, Pieter Abbeel, Qiaochu Yuan, Sergey Levine, Smitha Milli, Stuart Russell, Tom Kalil
Stanford University 9 Alex Aiken, Andreas Stuhlmüller, Jacob Steinhardt, Michael Webb, Noah D. Goodman, Percy Liang, Ray Briggs, Stefano Ermon, Thomas Icard
Whole Brain Architecture Initiative 9 Hideyuki Nakashima, Hiroshi Yamakawa, Hiroyuki Morikawa, Kenji Doya, Kitano Hiroaki, Koichi Takahashi, Koji Morikawa, Masaru Tomita, Yutaka Matsuo
Global Catastrophic Risk Institute 8 Grant Wilson, Mark Fusco, Matthijs Maas, Robert de Neufville, Roman Yampolskiy, Seth Baum, Tony Barrett, Trevor White
Open Philanthropy Project 7 Christopher Olah, Daniel Dewey, Dario Amodei, Helen Toner, Holden Karnofsky, Jacob Steinhardt, Paul Christiano
Australian National University 6 Alan Hájek, Elliot Catt, Jan Leike, Jarryd Martin, Marcus Hutter, Tom Everitt
Foundational Research Institute 6 Brian Tomasik, Caspar Oesterheld, Kaj Sotala, Lukas Gloor, Max Daniel, Tobias Baumann
Convergence Analysis 5 Andrew X Stewart, Claire Abu-Assal, Justin Shovelain, Kristian Rönn, Ozzie Gooen
Google Brain 5 Christopher Olah, Dan Mané, Dario Amodei, Melody Guan, Tom Brown
University of Cambridge 5 Adrian Weller, Arif Ahmed, Huw Price, Ramana Kumar, Yang Liu
EthicsNet 4 Aleksandra Orchowska, Anish Mohammed, Nell Watson, Remco Bloemen
GoodAI 4 Jan Feyereisl, Marek Havrda, Marek Rosa, Olga Afanasjeva
Montreal Institute for Learning Algorithms 4 Doina Precup, Joelle Pineau, Yoshua Bengio, Zac Kenton
Carnegie Mellon University 3 Andre Platzer, David Danks, Manuela Veloso
Cornell University 3 Bart Selman, Jim Babcock, Joseph Halpern
eCortex 3 David J. Jilk, Randall C. O’Reilly, Seth J. Herd
Foresight Institute 3 Allison Duettmann, Christine Peterson, Mark S. Miller
IDSIA 3 Bas R. Steunebrink, Jürgen Schmidhuber, Mark Ring
SUPSI 3 Bas R. Steunebrink, Jürgen Schmidhuber, Mark Ring
Theiss Research 3 Kaj Sotala, Long Ouyang, Paul Christiano
Università della Svizzera italiana 3 Bas R. Steunebrink, Jürgen Schmidhuber, Mark Ring
University of Oxford 3 Heather Roff, Michael Wooldridge, Owain Evans
Yale University 3 Allan Dafoe, Daniel Eth, Wendell Wallach
AIDEUS 2 Alexey Potapov, Sergey Rodionov
Arizona State University 2 Heather Roff, Miles Brundage
Center for a New American Security 2 Gregory C. Allen, Paul Scharre
Center for Human Success 2 David Yu, Wyatt Tessari
Endgame 2 Bobby Filar, Hyrum Anderson
Google 2 Marcello Herreshoff, Vladimir Slepnev
Learning Intelligent Distribution Agent 2 Stan Franklin, Tamas Madl
Linköping University 2 Mikael Böörs, Tobias Wängberg
London School of Economics 2 Katie Steele, Wlodek Rabinowicz
Ludwig Maximilian University of Munich 2 Reuben Stern, Stephan Hartmann
UCLA School of Law 2 Edward Parson, Richard Re
University of Michigan 2 James M. Joyce, Michael Wellman
University of Wisconsin–Madison 2 Patrick LaVictoire, Reuben Stern
Aalto University 1 Jelena Luketina
AgroParisTech 1 Laurent Orseau
Aix-Marseille University 1 Sergey Rodionov
American University 1 Thomas Zeitzoff
Birkbeck, University of London 1 Ulrike Hahn
Brown University 1 David Abel
California Institute of Technology 1 Frederick Eberhardt
Carleton University 1 Andrew MacFie
Center for Analysis & Design of Intelligent Agents 1 Kristinn R. Thórisson
Centre for Effective Altruism 1 Owen Cotton-Barratt
CogPrime 1 Ben Goertzel
Data61 1 Ramana Kumar
Duke University 1 Vincent Conitzer
Effective Altruism Foundation 1 Max Daniel
Electronic Frontier Foundation 1 Peter Eckersley
ETH Zurich 1 Felix Berkenkamp
George Mason University 1 Robin Hanson
Georgia Institute of Technology 1 Fuxin Li
Global Politics of Artificial Intelligence Research Group at Yale University and University of Oxford 1 Matthijs Maas
Hague Centre for Strategic Studies 1 Matthijs Maas
Harvard University 1 David Parkes
Icelandic Institute for Intelligent Machines 1 Kristinn R. Thórisson
Information Society Project 1 Rebecca Crootof
INRA 1 Laurent Orseau
Institute for Future Studies 1 H. Orri Stefánsson
Institute for Theoretical Studies at ETH Zurich 1 Will Sawin
Institute of Ethics and Emerging Technologies 1 Steven Umbrello
Internet Archive 1 Brewster Kahle
ITMO University 1 Alexey Potapov
Lingnan University 1 Jiji Zhang
Moscow Institute of Physics and Technology 1 Vladimir Shakirov
Munich Center for Mathematical Philosophy 1 Catrin Campbell-Moore
Nanyang Technological University 1 Preston Greene
NARS 1 Pei Wang
National Bureau of Economic Research 1 Joshua Gans
New America Foundation 1 Heather Roff
Nilcons 1 Mihaly Barasz
NNAISENSE 1 Bas R. Steunebrink
Oregon State University 1 Thomas Dietterich
Princeton University 1 Will Sawin
Quixey 1 Patrick LaVictoire
Real AI 1 Jonathan Yan
Rice University 1 Moshe Vardi
Self-Aware Systems 1 Steve Omohundro
Social & Environmental Entrepreneurs 1 Seth Baum
Susaro 1 Richard Loosemore
Texas A&M University 1 Kenny Easwaran
The New School 1 Peter Asaro
Université de Montréal 1 Jelena Luketina
University of Alberta 1 Tor Lattimore
University of Arizona 1 Jenann Ismael
University of Bath 1 Joanna Bryson
University of Bristol 1 Benya Fallenstein
University of Colorado 1 Seth Herd
University of Colorado Boulder 1 Randall C. O’Reilly
University of Copenhagen 1 Matthijs Maas
University of Denver 1 Heather Roff
University of Edinburgh 1 Angelo Frank De Bellis
University of Illinois at Chicago 1 Brian Ziebart
University of Louisville 1 Roman Yampolskiy
University of Melbourne 1 Benjamin Rubinstein
University of Montreal 1 Janos Kramar
University of New Hampshire 1 Andrew Ware
University of Padova 1 Francesca Rossi
University of Southern California 1 Stephen J. Read
University of Toronto 1 Joshua Gans
University of Washington 1 Daniel Weld
Washington University in St. Louis 1 Julia Haas

Individuals not affiliated with any organization

Showing 30 people.

Organization Website Source
Wei Dai http://www.weidai.com/ [1], [2]
Toon Alfrink [3], [4], [5]
Iceman [6], [7], [8], [9], [10]
Max Harms http://raelifin.com/ [11], [10]
Jeff Kaufman https://www.jefftk.com [12], [10]
Gwern Branwen https://www.gwern.net/ [10], [13], [14]
Federico Pistono http://federicopistono.org/ [15]
Chris Pasek [16], [17]
Peter Scheyer [16]
Alex Mennen http://alexmennen.com/ [18]
Sune Kristian Jakobsen
Alex Appel
Vladimir Nesov
Sören Mindermann
Amanda Askell http://www.amandaaskell.com [19]
Hilary Greaves [19]
Sophie-Charlotte Fischer [20]
Alexey Turchin https://avturchin.livejournal.com/ [21], [22], [23], [24]
Dustin Juliano http://dustinjuliano.com/ [25], [26]
Matteo Turchetta [27]
Angela P. Schoellig [27]
Andreas Krause [27]
Jim O’Neill [28]
Alex Flint http://www.alexflint.io/ [29]
Alex Zhu [30]
Gordon Irlam http://www.gordoni.com/ [31]
Remmelt Ellen [17]
Linda Linsefors [32], [17]
Nandi Schoots [17]
John Maxwell [18]

Products

This section lists AI safety-related “products”: interactive tools, websites, flowcharts, datasets, etc. Unlike documents, products tend to be interactive, are updated continually, or require inputs from the consumer.

Showing 30 products.

Name Type Creator Creation date Description
AI Safety Research Camp workshop Tom McGrath, Remmelt Ellen, Linda Linsefors, Nandi Schoots, David Kristoffersson, Chris Pasek 2018-02-01 A research camp to take place in Gran Canaria in April 2018 and in the United Kingdom in July–August 2018. Facebook group at [33]. The creation date is the date of announcement on LessWrong 2.0.
“Levels of defense” in AI safety flowchart Alexey Turchin 2017-12-12 A flowchart applying multilevel defense to AI safety. There is an accompanying post on LessWrong at [34].
AI Alignment Prize contest Zvi Mowshowitz, Vladimir Slepnev, Paul Christiano 2017-11-03 A prize for work that advances understanding in alignment of smarter-than-human artificial intelligence. Winners for the first round, as well as announcement of the second round, can be found at [18].
AI Watch interactive application Issa Rice 2017-10-23 A website to track people and organizations working on AI safety.
AI Safety Open Discussion discussion group Mati Roy 2017-10-23 A Facebook discussion group about AI safety. This is an open group.
AI safety resources list Victoria Krakovna 2017-10-01 A list of resources for long-term AI safety. Seems to have been first announced at [35].
Map of the AI Safety Community graphic Søren Elverlin 2017-09-26 A pictorial map that lists organizations and individuals in the AI safety community.
Open Philanthropy Project AI Fellows Program fellowship Open Philanthropy Project 2017-09-12 A fellowship to support PhD students in AI and machine learning. For the creation date, see [36].
Road to AI Safety Excellence course Toon Alfrink 2017-06-15 A proposed course that is designed to produce AI safety researchers. It used to be called “Accelerating AI Safety Adoption in Academia” and was announced on LessWrong at [37]. The Facebook group was created on 2017-06-30 [38].
Annotated bibliography of recommended materials list Center for Human-Compatible AI 2016-12-01 An annotated and interactive bibliography of AI safety-related course materials, textbooks, videos, papers, etc.
Extinction Risk from Artificial Intelligence blog Michael Cohen 2016-06-01 A series of pages exploring arguments for and against working on AI safety. The creation date is inferred from the URLs of images (example: [39]).
AI Alignment blog Paul Christiano 2016-05-28 Paul Christiano’s blog about AI alignment.
AISafety.com Reading Group discussion group Søren Elverlin, Erik B. Jacobsen, Volkan Erdogan 2016-05-24 A weekly reading group covering topics in AI safety.
Cause prioritization app interactive application Michael Dickens, Buck Shlegeris 2016-05-18 An interactive app for quantitative cause prioritization. The app includes a section [40] on AI safety intervention. The creation date is the date of the first commit in the Git repository [41].
Arbital AI alignment domain wiki Arbital, Eliezer Yudkowsky 2016-03-04 A collection of wiki-like pages on topics in AI alignment. The creation date is the date of the launch announcement for Arbital [42]; it’s unclear when the AI alignment domain itself was created.
Introductory resources on AI safety research list Victoria Krakovna 2016-02-28 A list of readings on long-term AI safety. Mirrored at [43]. There is an updated list at [44].
AI Safety Discussion discussion group Victoria Krakovna 2016-02-21 A Facebook discussion group about AI safety. This is a closed group so one needs to request access to see posts.
Reinforce.js implementation of Stuart Armstrong’s toy control problem interactive application Gwern Branwen, FeepingCreature 2016-02-03 A live demo of Stuart Armstrong’s toy control problem [45]. gwern introduced the demo in a LessWrong comment [46].
AI Policies Wiki wiki Gordon Irlam 2015-12-14 A wiki on AI policy. The wiki creation date can be seen in the revision history of the main page [47].
The Control Problem discussion group CyberPersona 2015-08-29 A subreddit about AI safety and control. For the subreddit creation date, see [48].
AGI Failures Modes and Levels map flowchart Alexey Turchin 2015-01-01 A flowchart about failure modes of artificial general intelligence, grouped by the stage of development. There is an accompanying post on LessWrong at [24].
AGI Safety Solutions Map flowchart Alexey Turchin 2015-01-01 A flowchart on potential solutions to AI safety. There is an accompanying post on LessWrong at [49].
Intelligent Agent Foundations Forum discussion group Machine Intelligence Research Institute 2014-11-04 A forum for technical AI safety research. The source code is hosted on GitHub [50]. The timestamp on the introductory post [51] gives the launch date.
A flowchart of AI safety considerations flowchart Eliezer Yudkowsky 2014-11-02 The flowchart was posted to Eliezer Yudkowsky’s Essays (a Facebook group) and has no title.
Effective Altruism Forum blog Centre for Effective Altruism, Rethink Charity, Ryan Carey 2014-09-10 A community blog about effective altruism which often has posts about AI safety. The forum was announced on LessWrong by Ryan Carey [52].
How to study superintelligence strategy list Luke Muehlhauser 2014-07-03 A list of project ideas in superintelligence strategy.
Ordinary Ideas blog Paul Christiano 2011-12-21 Paul Christiano’s blog about “weird AI stuff” [53].
The Uncertain Future interactive application Machine Intelligence Research Institute 2009-10-01 A tool to model future technology and its effect on civilization. For more about the history of the site, see [54].
LessWrong Wiki wiki Machine Intelligence Research Institute 2009-03-12 A companion wiki to the community blog LessWrong. The wiki has pages about AI safety.
LessWrong blog Machine Intelligence Research Institute 2009-02-01 A community blog about rationality, decision theory, AI, updates to MIRI, among other topics.