Information for Debate

Basic information

Associated people: Paul Christiano

Associated organizations: OpenAI


Goals of the agenda

Assumptions the agenda makes

AI timelines

Nature of intelligence



Title Publication date Author Publisher Affected organizations Affected people Notes
Scalable agent alignment via reward modeling: a research direction 2018-11-19 Jan Leike, David Krueger, Tom Everitt, Miljan Martic, Vishal Maini, Shane Legg arXiv Google DeepMind This paper introduces the (recursive) reward modeling agenda, discussing its basic outline, challenges, and ways to overcome those challenges. The paper also discusses alternative agendas and their relation to reward modeling.