Information for Rohin Shah

Table of contents

Basic information

Item Value

List of positions (2 positions)

Organization Title Start date End date AI safety relation Subject Employment type Source Notes
Center for Human-Compatible AI Spotlighted Student 2019-01-19 2020-12-03 graduate student [1], [2]
Center for Human-Compatible AI Alumnus 2022-01-22 graduate student [3], [4]

Products (1 product)

Name Creation date Description
Clarifying some key hypotheses in AI alignment 2019-08-15 With Ben Cottier. A diagram collecting several hypotheses in AI alignment and their relationships to existing research agendas.

Organization documents (0 documents)

Title Publication date Author Publisher Affected organizations Affected people Document scope Cause area Notes

Documents (2 documents)

Title Publication date Author Publisher Affected organizations Affected people Affected agendas Notes
AI Alignment Podcast: An Overview of Technical AI Alignment with Rohin Shah (Part 2) 2019-04-25 Lucas Perry Future of Life Institute Rohin Shah, Dylan Hadfield-Menell, Gillian Hadfield Embedded agency, Cooperative inverse reinforcement learning, inverse reinforcement learning, deep reinforcement learning from human preferences, recursive reward modeling, iterated amplification Part two of a podcast episode that goes into detail about some technical approaches to AI alignment.
AI Alignment Podcast: An Overview of Technical AI Alignment with Rohin Shah (Part 1) 2019-04-11 Lucas Perry Future of Life Institute Rohin Shah iterated amplification Part one of an interview with Rohin Shah that goes covers some technical agendas for AI alignment.

Similar people

Showing at most 20 people who are most similar in terms of which organizations they have worked at.

Person Number of organizations in common List of organizations in common
Beth Barnes 1 Center for Human-Compatible AI
Alison Gopnik 1 Center for Human-Compatible AI
Stuart Russell 1 Center for Human-Compatible AI
Jacob Steinhardt 1 Center for Human-Compatible AI
Anca Dragan 1 Center for Human-Compatible AI
Dylan Hadfield-Menell 1 Center for Human-Compatible AI
Smitha Milli 1 Center for Human-Compatible AI
Pieter Abbeel 1 Center for Human-Compatible AI
Bart Selman 1 Center for Human-Compatible AI
Michael Wellman 1 Center for Human-Compatible AI
Andrew Critch 1 Center for Human-Compatible AI
Joseph Halpern 1 Center for Human-Compatible AI
Johannes Treutlein 1 Center for Human-Compatible AI
Dorsa Sadigh 1 Center for Human-Compatible AI
Dmitrii Krasheninnikov 1 Center for Human-Compatible AI
Michael Cohen 1 Center for Human-Compatible AI
Brandon Perry 1 Center for Human-Compatible AI
David Lindner 1 Center for Human-Compatible AI
Elizabeth Barnes 1 Center for Human-Compatible AI
Lawrence Chan 1 Center for Human-Compatible AI