Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2011.08820
Cited By
REALab: An Embedded Perspective on Tampering
17 November 2020
Ramana Kumar
J. Uesato
Richard Ngo
Tom Everitt
Victoria Krakovna
Shane Legg
Re-assign community
ArXiv
PDF
HTML
Papers citing
"REALab: An Embedded Perspective on Tampering"
5 / 5 papers shown
Title
Defining and Characterizing Reward Hacking
Joar Skalse
Nikolaus H. R. Howe
Dmitrii Krasheninnikov
David M. Krueger
59
55
0
27 Sep 2022
Estimating and Penalizing Induced Preference Shifts in Recommender Systems
Micah Carroll
Anca Dragan
Stuart J. Russell
Dylan Hadfield-Menell
OffRL
14
41
0
25 Apr 2022
Safe Deep RL in 3D Environments using Human Feedback
Matthew Rahtz
Vikrant Varma
Ramana Kumar
Zachary Kenton
Shane Legg
Jan Leike
29
4
0
20 Jan 2022
On the Expressivity of Markov Reward
David Abel
Will Dabney
A. Harutyunyan
Mark K. Ho
Michael L. Littman
Doina Precup
Satinder Singh
26
82
0
01 Nov 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
340
1,960
0
04 May 2020
1