ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.06709
  4. Cited By
Active Reinforcement Learning: Observing Rewards at a Cost
v1v2 (latest)

Active Reinforcement Learning: Observing Rewards at a Cost

13 November 2020
David M. Krueger
Jan Leike
Owain Evans
J. Salvatier
ArXiv (abs)PDFHTML

Papers citing "Active Reinforcement Learning: Observing Rewards at a Cost"

21 / 21 papers shown
Efficient Reinforcement Learning from Human Feedback via Bayesian Preference Inference
Efficient Reinforcement Learning from Human Feedback via Bayesian Preference Inference
Matteo Cercola
Valeria Capretti
Simone Formentin
272
1
0
06 Nov 2025
Active Measuring in Reinforcement Learning With Delayed Negative Effects
Active Measuring in Reinforcement Learning With Delayed Negative Effects
Daiqi Gao
Ziping Xu
Aseel Rawashdeh
P. Klasnja
Susan Murphy
OffRL
113
1
0
16 Oct 2025
Which Rewards Matter? Reward Selection for Reinforcement Learning under Limited Feedback
Which Rewards Matter? Reward Selection for Reinforcement Learning under Limited Feedback
Shreyas Chaudhari
Renhao Zhang
Philip S. Thomas
Bruno Castro da Silva
OffRL
233
1
0
30 Sep 2025
Beyond Optimism: Exploration With Partially Observable Rewards
Beyond Optimism: Exploration With Partially Observable Rewards
Simone Parisi
Alireza Kazemipour
Michael Bowling
OffRL
246
7
0
20 Jun 2024
Batch Active Learning of Reward Functions from Human Preferences
Batch Active Learning of Reward Functions from Human Preferences
Erdem Biyik
Nima Anari
Dorsa Sadigh
365
15
0
24 Feb 2024
Reinforcement Learning from Human Feedback with Active Queries
Reinforcement Learning from Human Feedback with Active Queries
Kaixuan Ji
Jiafan He
Quanquan Gu
445
36
0
14 Feb 2024
Monitored Markov Decision Processes
Monitored Markov Decision ProcessesAdaptive Agents and Multi-Agent Systems (AAMAS), 2024
Simone Parisi
Montaser Mohammedalamen
Alireza Kazemipour
Matthew E. Taylor
Michael Bowling
OffRL
328
9
0
09 Feb 2024
Learning Computational Efficient Bots with Costly Features
Learning Computational Efficient Bots with Costly Features
Anthony Kobanda
Valliappan C. A.
Joshua Romoff
Ludovic Denoyer
OffRL
167
2
0
18 Aug 2023
Active Learning for Video Classification with Frame Level Queries
Active Learning for Video Classification with Frame Level QueriesIEEE International Joint Conference on Neural Network (IJCNN), 2023
D. Goswami
Shayok Chakraborty
VLM
171
2
0
10 Jul 2023
Active Vision Reinforcement Learning under Limited Visual Observability
Active Vision Reinforcement Learning under Limited Visual ObservabilityNeural Information Processing Systems (NeurIPS), 2023
Jinghuan Shang
Michael S. Ryoo
345
0
0
01 Jun 2023
Embodied Active Learning of Relational State Abstractions for Bilevel
  Planning
Embodied Active Learning of Relational State Abstractions for Bilevel Planning
Amber Li
Tom Silver
228
17
0
08 Mar 2023
Scientific Discovery and the Cost of Measurement -- Balancing
  Information and Cost in Reinforcement Learning
Scientific Discovery and the Cost of Measurement -- Balancing Information and Cost in Reinforcement Learning
C. Bellinger
Andriy Drozdyuk
Mark Crowley
Isaac Tamblyn
OffRL
235
10
0
14 Dec 2021
Reinforcement Learning for Selective Key Applications in Power Systems:
  Recent Advances and Future Challenges
Reinforcement Learning for Selective Key Applications in Power Systems: Recent Advances and Future ChallengesIEEE Transactions on Smart Grid (IEEE Trans. Smart Grid), 2021
Xin Chen
Guannan Qu
Yujie Tang
S. Low
Na Li
586
334
0
27 Jan 2021
AI Research Considerations for Human Existential Safety (ARCHES)
AI Research Considerations for Human Existential Safety (ARCHES)
Andrew Critch
David M. Krueger
236
65
0
30 May 2020
Active Measure Reinforcement Learning for Observation Cost Minimization
Active Measure Reinforcement Learning for Observation Cost Minimization
C. Bellinger
Rory Coles
Mark Crowley
Isaac Tamblyn
OffRL
170
28
0
26 May 2020
Learning to Request Guidance in Emergent Communication
Learning to Request Guidance in Emergent CommunicationConference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Benjamin Kolb
Leon Lang
H. Bartsch
Arwin Gansekoele
Raymond Koopmanschap
Leonardo Romor
David Speck
Mathijs Mul
Elia Bruni
165
0
0
11 Dec 2019
Self-Regulated Interactive Sequence-to-Sequence Learning
Self-Regulated Interactive Sequence-to-Sequence LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2019
Julia Kreutzer
Stefan Riezler
137
8
0
11 Jul 2019
Scalable agent alignment via reward modeling: a research direction
Scalable agent alignment via reward modeling: a research direction
Jan Leike
David M. Krueger
Tom Everitt
Miljan Martic
Vishal Maini
Shane Legg
496
546
0
19 Nov 2018
Batch Active Preference-Based Learning of Reward Functions
Batch Active Preference-Based Learning of Reward Functions
Erdem Biyik
Dorsa Sadigh
351
130
0
10 Oct 2018
Active Reinforcement Learning with Monte-Carlo Tree Search
Active Reinforcement Learning with Monte-Carlo Tree Search
Sebastian Schulze
Owain Evans
272
20
0
13 Mar 2018
Trial without Error: Towards Safe Reinforcement Learning via Human
  Intervention
Trial without Error: Towards Safe Reinforcement Learning via Human Intervention
William Saunders
Girish Sastry
Andreas Stuhlmuller
Owain Evans
OffRL
264
262
0
17 Jul 2017
1
Page 1 of 1