Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2011.08820
Cited By
REALab: An Embedded Perspective on Tampering
17 November 2020
Ramana Kumar
J. Uesato
Richard Ngo
Tom Everitt
Victoria Krakovna
Shane Legg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"REALab: An Embedded Perspective on Tampering"
7 / 7 papers shown
Solving math word problems with process- and outcome-based feedback
J. Uesato
Nate Kushman
Ramana Kumar
Francis Song
Noah Y. Siegel
L. Wang
Antonia Creswell
G. Irving
I. Higgins
FaML
ReLM
AIMat
LRM
425
640
0
25 Nov 2022
Defining and Characterizing Reward Hacking
Joar Skalse
Nikolaus H. R. Howe
Dmitrii Krasheninnikov
David M. Krueger
498
113
0
27 Sep 2022
Is Power-Seeking AI an Existential Risk?
Joseph Carlsmith
ELM
237
138
0
16 Jun 2022
Estimating and Penalizing Induced Preference Shifts in Recommender Systems
International Conference on Machine Learning (ICML), 2022
Micah Carroll
Anca Dragan
Stuart J. Russell
Dylan Hadfield-Menell
OffRL
399
49
0
25 Apr 2022
Safe Deep RL in 3D Environments using Human Feedback
Matthew Rahtz
Vikrant Varma
Ramana Kumar
Zachary Kenton
Shane Legg
Jan Leike
262
6
0
20 Jan 2022
On the Expressivity of Markov Reward
Neural Information Processing Systems (NeurIPS), 2021
David Abel
Will Dabney
Anna Harutyunyan
Mark K. Ho
Michael L. Littman
Doina Precup
Satinder Singh
293
98
0
01 Nov 2021
Counterfactual Planning in AGI Systems
K. Holtman
182
4
0
29 Jan 2021
1
Page 1 of 1