ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.04257
  4. Cited By
Deep Reinforcement Learning from Policy-Dependent Human Feedback

Deep Reinforcement Learning from Policy-Dependent Human Feedback

12 February 2019
Dilip Arumugam
Jun Ki Lee
S. Saskin
Michael L. Littman
ArXiv (abs)PDFHTML

Papers citing "Deep Reinforcement Learning from Policy-Dependent Human Feedback"

15 / 65 papers shown
GAN-Based Interactive Reinforcement Learning from Demonstration and
  Human Evaluative Feedback
GAN-Based Interactive Reinforcement Learning from Demonstration and Human Evaluative FeedbackIEEE International Conference on Robotics and Automation (ICRA), 2021
Jie Huang
Rongshun Juan
R. Gomez
Keisuke Nakamura
Q. Sha
Bo He
Guangliang Li
210
13
0
14 Apr 2021
Learning Online from Corrective Feedback: A Meta-Algorithm for Robotics
Learning Online from Corrective Feedback: A Meta-Algorithm for Robotics
Matt Schmittle
Sanjiban Choudhury
S. Srinivasa
120
3
0
02 Apr 2021
An overview of 11 proposals for building safe advanced AI
An overview of 11 proposals for building safe advanced AI
Evan Hubinger
AAML
161
27
0
04 Dec 2020
Avoiding Tampering Incentives in Deep RL via Decoupled Approval
Avoiding Tampering Incentives in Deep RL via Decoupled Approval
J. Uesato
Ramana Kumar
Victoria Krakovna
Tom Everitt
Richard Ngo
Shane Legg
228
18
0
17 Nov 2020
Human Engagement Providing Evaluative and Informative Advice for
  Interactive Reinforcement Learning
Human Engagement Providing Evaluative and Informative Advice for Interactive Reinforcement Learning
Adam Bignold
Francisco Cruz
Richard Dazeley
Peter Vamplew
Cameron Foale
256
21
0
21 Sep 2020
Human-in-the-Loop Methods for Data-Driven and Reinforcement Learning
  Systems
Human-in-the-Loop Methods for Data-Driven and Reinforcement Learning Systems
Vinicius G. Goecks
249
12
0
30 Aug 2020
Battlesnake Challenge: A Multi-agent Reinforcement Learning Playground
  with Human-in-the-loop
Battlesnake Challenge: A Multi-agent Reinforcement Learning Playground with Human-in-the-loop
Jonathan Chung
Anna Luo
Xavier Raffin
Scott Perry
OffRL
120
3
0
20 Jul 2020
Widening the Pipeline in Human-Guided Reinforcement Learning with
  Explanation and Context-Aware Data Augmentation
Widening the Pipeline in Human-Guided Reinforcement Learning with Explanation and Context-Aware Data Augmentation
L. Guan
Mudit Verma
Sihang Guo
Ruohan Zhang
Subbarao Kambhampati
358
53
0
26 Jun 2020
Retrospective Analysis of the 2019 MineRL Competition on Sample
  Efficient Reinforcement Learning
Retrospective Analysis of the 2019 MineRL Competition on Sample Efficient Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2020
Stephanie Milani
Nicholay Topin
Brandon Houghton
William H. Guss
Sharada Mohanty
Keisuke Nakata
Oriol Vinyals
N. Kuno
OffRL
488
28
0
10 Mar 2020
Facial Feedback for Reinforcement Learning: A Case Study and Offline
  Analysis Using the TAMER Framework
Facial Feedback for Reinforcement Learning: A Case Study and Offline Analysis Using the TAMER FrameworkAutonomous Agents and Multi-Agent Systems (AAMAS), 2020
Guangliang Li
H. Dibeklioğlu
Shimon Whiteson
Hayley Hung
OffRLCVBM
81
26
0
23 Jan 2020
FRESH: Interactive Reward Shaping in High-Dimensional State Spaces using
  Human Feedback
FRESH: Interactive Reward Shaping in High-Dimensional State Spaces using Human FeedbackAdaptive Agents and Multi-Agent Systems (AAMAS), 2020
Baicen Xiao
Qifan Lu
Bhaskar Ramasubramanian
Andrew Clark
L. Bushnell
Radha Poovendran
190
26
0
19 Jan 2020
Learning to Interactively Learn and Assist
Learning to Interactively Learn and AssistAAAI Conference on Artificial Intelligence (AAAI), 2019
Mark P. Woodward
Chelsea Finn
Karol Hausman
280
35
0
24 Jun 2019
Robot Learning via Human Adversarial Games
Robot Learning via Human Adversarial GamesIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2019
Jiali Duan
Qian Wang
Lerrel Pinto
C.-C. Jay Kuo
Stefanos Nikolaidis
AAMLSSL
133
8
0
02 Mar 2019
Scalable agent alignment via reward modeling: a research direction
Scalable agent alignment via reward modeling: a research direction
Jan Leike
David M. Krueger
Tom Everitt
Miljan Martic
Vishal Maini
Shane Legg
379
527
0
19 Nov 2018
DQN-TAMER: Human-in-the-Loop Reinforcement Learning with Intractable
  Feedback
DQN-TAMER: Human-in-the-Loop Reinforcement Learning with Intractable Feedback
Riku Arakawa
Sosuke Kobayashi
Y. Unno
Yuta Tsuboi
S. Maeda
168
86
0
28 Oct 2018
Previous
12
Page 2 of 2