ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1701.04079
  4. Cited By
Agent-Agnostic Human-in-the-Loop Reinforcement Learning

Agent-Agnostic Human-in-the-Loop Reinforcement Learning

15 January 2017
David Abel
J. Salvatier
Andreas Stuhlmuller
Owain Evans
ArXivPDFHTML

Papers citing "Agent-Agnostic Human-in-the-Loop Reinforcement Learning"

19 / 19 papers shown
Title
Learning from Active Human Involvement through Proxy Value Propagation
Learning from Active Human Involvement through Proxy Value Propagation
Zhenghao Peng
Wenjie Mo
Chenda Duan
Quanyi Li
Bolei Zhou
114
14
0
05 Feb 2025
Constrained Exploration in Reinforcement Learning with Optimality
  Preservation
Constrained Exploration in Reinforcement Learning with Optimality Preservation
Peter C. Y. Chen
16
0
0
05 Apr 2023
Guarded Policy Optimization with Imperfect Online Demonstrations
Guarded Policy Optimization with Imperfect Online Demonstrations
Zhenghai Xue
Zhenghao Peng
Quanyi Li
Zhihan Liu
Bolei Zhou
OffRL
53
10
0
03 Mar 2023
Computational Charisma -- A Brick by Brick Blueprint for Building
  Charismatic Artificial Intelligence
Computational Charisma -- A Brick by Brick Blueprint for Building Charismatic Artificial Intelligence
Björn W. Schuller
Shahin Amiriparian
A. Batliner
Alexander Gebhard
Maurice Gerczuk
Vincent Karas
Alexander Kathan
Lennart Seizer
Johanna Löchner
56
4
0
31 Dec 2022
Human Decision Makings on Curriculum Reinforcement Learning with
  Difficulty Adjustment
Human Decision Makings on Curriculum Reinforcement Learning with Difficulty Adjustment
Yilei Zeng
Jiali Duan
Yongqian Li
Emilio Ferrara
Lerrel Pinto
Chloe Kuo
Stefanos Nikolaidis
51
3
0
04 Aug 2022
A Reinforcement Learning-based Offensive semantics Censorship System for
  Chatbots
A Reinforcement Learning-based Offensive semantics Censorship System for Chatbots
Shaokang Cai
Dezhi Han
Zibin Zheng
Dun Li
NoelCrespi
AAML
28
1
0
13 Jul 2022
Recent Advances in Leveraging Human Guidance for Sequential
  Decision-Making Tasks
Recent Advances in Leveraging Human Guidance for Sequential Decision-Making Tasks
Ruohan Zhang
F. Torabi
Garrett A. Warnell
Peter Stone
88
28
0
13 Jul 2021
A Survey on Interactive Reinforcement Learning: Design Principles and
  Open Challenges
A Survey on Interactive Reinforcement Learning: Design Principles and Open Challenges
Christian Arzate Cruz
Takeo Igarashi
OffRL
17
94
0
27 May 2021
Novelty Search in Representational Space for Sample Efficient
  Exploration
Novelty Search in Representational Space for Sample Efficient Exploration
Ruo Yu Tao
Vincent François-Lavet
Joelle Pineau
35
43
0
28 Sep 2020
AI Research Considerations for Human Existential Safety (ARCHES)
AI Research Considerations for Human Existential Safety (ARCHES)
Andrew Critch
David M. Krueger
30
50
0
30 May 2020
A Survey on Dialog Management: Recent Advances and Challenges
A Survey on Dialog Management: Recent Advances and Challenges
Yinpei Dai
Huihua Yu
Yixuan Jiang
Chengguang Tang
Yongbin Li
Jian Sun
OffRL
VLM
35
20
0
05 May 2020
Firearm Detection and Segmentation Using an Ensemble of Semantic Neural
  Networks
Firearm Detection and Segmentation Using an Ensemble of Semantic Neural Networks
Alexander Egiazarov
Vasileios Mavroeidis
Fabio Massimo Zennaro
Kamer Vishi
18
19
0
11 Feb 2020
Leveraging Human Guidance for Deep Reinforcement Learning Tasks
Leveraging Human Guidance for Deep Reinforcement Learning Tasks
Ruohan Zhang
F. Torabi
L. Guan
D. Ballard
Peter Stone
19
87
0
21 Sep 2019
Reinforcement Learning in Healthcare: A Survey
Reinforcement Learning in Healthcare: A Survey
Chao Yu
Jiming Liu
S. Nemati
LM&MA
OffRL
33
551
0
22 Aug 2019
Scalable agent alignment via reward modeling: a research direction
Scalable agent alignment via reward modeling: a research direction
Jan Leike
David M. Krueger
Tom Everitt
Miljan Martic
Vishal Maini
Shane Legg
36
397
0
19 Nov 2018
Learning Shaping Strategies in Human-in-the-loop Interactive
  Reinforcement Learning
Learning Shaping Strategies in Human-in-the-loop Interactive Reinforcement Learning
Chao Yu
Tianpei Yang
Wenxuan Zhu
Dongxu Wang
Guangliang Li
OffRL
19
7
0
10 Nov 2018
Unity: A General Platform for Intelligent Agents
Unity: A General Platform for Intelligent Agents
Arthur Juliani
Vincent-Pierre Berges
Esh Vckay
Andrew Cohen
Jonathan Harper
...
Chris Goy
Yuan Gao
Hunter Henry
Marwan Mattar
Danny Lange
39
808
0
07 Sep 2018
Trial without Error: Towards Safe Reinforcement Learning via Human
  Intervention
Trial without Error: Towards Safe Reinforcement Learning via Human Intervention
William Saunders
Girish Sastry
Andreas Stuhlmuller
Owain Evans
OffRL
35
229
0
17 Jul 2017
Safe Exploration in Markov Decision Processes
Safe Exploration in Markov Decision Processes
T. Moldovan
Pieter Abbeel
83
308
0
22 May 2012
1