ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.08820
  4. Cited By
REALab: An Embedded Perspective on Tampering

REALab: An Embedded Perspective on Tampering

17 November 2020
Ramana Kumar
J. Uesato
Richard Ngo
Tom Everitt
Victoria Krakovna
Shane Legg
ArXiv (abs)PDFHTML

Papers citing "REALab: An Embedded Perspective on Tampering"

7 / 7 papers shown
Solving math word problems with process- and outcome-based feedback
Solving math word problems with process- and outcome-based feedback
J. Uesato
Nate Kushman
Ramana Kumar
Francis Song
Noah Y. Siegel
L. Wang
Antonia Creswell
G. Irving
I. Higgins
FaMLReLMAIMatLRM
425
640
0
25 Nov 2022
Defining and Characterizing Reward Hacking
Defining and Characterizing Reward Hacking
Joar Skalse
Nikolaus H. R. Howe
Dmitrii Krasheninnikov
David M. Krueger
498
113
0
27 Sep 2022
Is Power-Seeking AI an Existential Risk?
Is Power-Seeking AI an Existential Risk?
Joseph Carlsmith
ELM
237
138
0
16 Jun 2022
Estimating and Penalizing Induced Preference Shifts in Recommender
  Systems
Estimating and Penalizing Induced Preference Shifts in Recommender SystemsInternational Conference on Machine Learning (ICML), 2022
Micah Carroll
Anca Dragan
Stuart J. Russell
Dylan Hadfield-Menell
OffRL
399
49
0
25 Apr 2022
Safe Deep RL in 3D Environments using Human Feedback
Safe Deep RL in 3D Environments using Human Feedback
Matthew Rahtz
Vikrant Varma
Ramana Kumar
Zachary Kenton
Shane Legg
Jan Leike
262
6
0
20 Jan 2022
On the Expressivity of Markov Reward
On the Expressivity of Markov RewardNeural Information Processing Systems (NeurIPS), 2021
David Abel
Will Dabney
Anna Harutyunyan
Mark K. Ho
Michael L. Littman
Doina Precup
Satinder Singh
293
98
0
01 Nov 2021
Counterfactual Planning in AGI Systems
Counterfactual Planning in AGI Systems
K. Holtman
182
4
0
29 Jan 2021
1
Page 1 of 1