v1v2 (latest)

Active Reinforcement Learning: Observing Rewards at a Cost

13 November 2020

Papers citing "Active Reinforcement Learning: Observing Rewards at a Cost"

21 / 21 papers shown

Efficient Reinforcement Learning from Human Feedback via Bayesian Preference Inference

Matteo Cercola

Valeria Capretti

Simone Formentin

272

06 Nov 2025

Active Measuring in Reinforcement Learning With Delayed Negative Effects

113

16 Oct 2025

Which Rewards Matter? Reward Selection for Reinforcement Learning under Limited Feedback

Shreyas Chaudhari

Renhao Zhang

Philip S. Thomas

Bruno Castro da Silva

OffRL

233

30 Sep 2025

Beyond Optimism: Exploration With Partially Observable Rewards

246

20 Jun 2024

Batch Active Learning of Reward Functions from Human Preferences

Erdem Biyik

Nima Anari

Dorsa Sadigh

365

24 Feb 2024

Reinforcement Learning from Human Feedback with Active Queries

Kaixuan Ji

Jiafan He

Quanquan Gu

445

14 Feb 2024

Monitored Markov Decision ProcessesAdaptive Agents and Multi-Agent Systems (AAMAS), 2024

Simone Parisi

Montaser Mohammedalamen

328

09 Feb 2024

Learning Computational Efficient Bots with Costly Features

167

18 Aug 2023

Active Learning for Video Classification with Frame Level QueriesIEEE International Joint Conference on Neural Network (IJCNN), 2023

D. Goswami

Shayok Chakraborty

VLM

171

10 Jul 2023

Active Vision Reinforcement Learning under Limited Visual ObservabilityNeural Information Processing Systems (NeurIPS), 2023

Jinghuan Shang

Michael S. Ryoo

345

01 Jun 2023

Embodied Active Learning of Relational State Abstractions for Bilevel Planning

Amber Li

Tom Silver

228

08 Mar 2023

Scientific Discovery and the Cost of Measurement -- Balancing Information and Cost in Reinforcement Learning

235

14 Dec 2021

Reinforcement Learning for Selective Key Applications in Power Systems: Recent Advances and Future ChallengesIEEE Transactions on Smart Grid (IEEE Trans. Smart Grid), 2021

Na Li

586

334

27 Jan 2021

AI Research Considerations for Human Existential Safety (ARCHES)

Andrew Critch

David M. Krueger

236

30 May 2020

Active Measure Reinforcement Learning for Observation Cost Minimization

170

26 May 2020

Learning to Request Guidance in Emergent CommunicationConference on Empirical Methods in Natural Language Processing (EMNLP), 2019

165

11 Dec 2019

Self-Regulated Interactive Sequence-to-Sequence LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2019

Julia Kreutzer

Stefan Riezler

137

11 Jul 2019

Scalable agent alignment via reward modeling: a research direction

496

546

19 Nov 2018

Batch Active Preference-Based Learning of Reward Functions

Erdem Biyik

Dorsa Sadigh

351

130

10 Oct 2018

Active Reinforcement Learning with Monte-Carlo Tree Search

Sebastian Schulze

Owain Evans

272

13 Mar 2018

Trial without Error: Towards Safe Reinforcement Learning via Human Intervention

264

262

17 Jul 2017