Deep Reinforcement Learning from Policy-Dependent Human Feedback

12 February 2019

Michael L. Littman

Papers citing "Deep Reinforcement Learning from Policy-Dependent Human Feedback"

15 / 65 papers shown

GAN-Based Interactive Reinforcement Learning from Demonstration and Human Evaluative FeedbackIEEE International Conference on Robotics and Automation (ICRA), 2021

210

14 Apr 2021

Learning Online from Corrective Feedback: A Meta-Algorithm for Robotics

Matt Schmittle

Sanjiban Choudhury

S. Srinivasa

120

02 Apr 2021

An overview of 11 proposals for building safe advanced AI

Evan Hubinger

AAML

161

04 Dec 2020

Avoiding Tampering Incentives in Deep RL via Decoupled Approval

228

17 Nov 2020

Human Engagement Providing Evaluative and Informative Advice for Interactive Reinforcement Learning

256

21 Sep 2020

Human-in-the-Loop Methods for Data-Driven and Reinforcement Learning Systems

Vinicius G. Goecks

249

30 Aug 2020

Battlesnake Challenge: A Multi-agent Reinforcement Learning Playground with Human-in-the-loop

120

20 Jul 2020

Widening the Pipeline in Human-Guided Reinforcement Learning with Explanation and Context-Aware Data Augmentation

Ruohan Zhang

358

26 Jun 2020

Retrospective Analysis of the 2019 MineRL Competition on Sample Efficient Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2020

488

10 Mar 2020

Facial Feedback for Reinforcement Learning: A Case Study and Offline Analysis Using the TAMER FrameworkAutonomous Agents and Multi-Agent Systems (AAMAS), 2020

23 Jan 2020

FRESH: Interactive Reward Shaping in High-Dimensional State Spaces using Human FeedbackAdaptive Agents and Multi-Agent Systems (AAMAS), 2020

Baicen Xiao

Qifan Lu

Bhaskar Ramasubramanian

Andrew Clark

L. Bushnell

Radha Poovendran

190

19 Jan 2020

Learning to Interactively Learn and AssistAAAI Conference on Artificial Intelligence (AAAI), 2019

Mark P. Woodward

Chelsea Finn

Karol Hausman

280

24 Jun 2019

Robot Learning via Human Adversarial GamesIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2019

133

02 Mar 2019

Scalable agent alignment via reward modeling: a research direction

379

527

19 Nov 2018

DQN-TAMER: Human-in-the-Loop Reinforcement Learning with Intractable Feedback

168

28 Oct 2018