Information for Cooperative inverse reinforcement learning

Basic information

Associated people:

Associated organizations:

Title	Publication date	Author	Publisher	Affected organizations	Affected people	Affected agendas	Notes
AI Alignment Podcast: An Overview of Technical AI Alignment with Rohin Shah (Part 2)	2019-04-25	Lucas Perry	Future of Life Institute		Rohin Shah, Dylan Hadfield-Menell, Gillian Hadfield	Embedded agency, Cooperative inverse reinforcement learning, inverse reinforcement learning, deep reinforcement learning from human preferences, recursive reward modeling, iterated amplification	Part two of a podcast episode that goes into detail about some technical approaches to AI alignment.
Scalable agent alignment via reward modeling: a research direction	2018-11-19	Jan Leike, David Krueger, Tom Everitt, Miljan Martic, Vishal Maini, Shane Legg	arXiv	Google DeepMind		Recursive reward modeling, Imitation learning, inverse reinforcement learning, Cooperative inverse reinforcement learning, myopic reinforcement learning, iterated amplification, debate	This paper introduces the (recursive) reward modeling agenda, discussing its basic outline, challenges, and ways to overcome those challenges. The paper also discusses alternative agendas and their relation to reward modeling.