ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.09497
  4. Cited By
Task-agnostic Exploration in Reinforcement Learning

Task-agnostic Exploration in Reinforcement Learning

16 June 2020
Xuezhou Zhang
Yuzhe Ma
Adish Singla
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Task-agnostic Exploration in Reinforcement Learning"

37 / 37 papers shown
Statistical and Algorithmic Foundations of Reinforcement Learning
Statistical and Algorithmic Foundations of Reinforcement Learning
Yuejie Chi
Yuxin Chen
Yuting Wei
OffRL
275
2
0
19 Jul 2025
DIAL: Distribution-Informed Adaptive Learning of Multi-Task Constraints for Safety-Critical Systems
DIAL: Distribution-Informed Adaptive Learning of Multi-Task Constraints for Safety-Critical Systems
Se-Wook Yoo
Seung-Woo Seo
431
0
0
30 Jan 2025
Efficient Reinforcement Learning in Probabilistic Reward Machines
Efficient Reinforcement Learning in Probabilistic Reward MachinesAAAI Conference on Artificial Intelligence (AAAI), 2024
Xiaofeng Lin
Xuezhou Zhang
310
2
0
19 Aug 2024
Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis
Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis
Qining Zhang
Honghao Wei
Lei Ying
OffRL
462
3
0
11 Jun 2024
Offline Multi-task Transfer RL with Representational Penalization
Offline Multi-task Transfer RL with Representational Penalization
Avinandan Bose
S. S. Du
Maryam Fazel
OffRL
381
13
0
19 Feb 2024
Informativeness of Reward Functions in Reinforcement Learning
Informativeness of Reward Functions in Reinforcement LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2024
R. Devidze
Parameswaran Kamalaruban
Adish Singla
276
3
0
10 Feb 2024
Near-Optimal Reinforcement Learning with Self-Play under Adaptivity
  Constraints
Near-Optimal Reinforcement Learning with Self-Play under Adaptivity Constraints
Dan Qiao
Yu Wang
OffRL
331
4
0
02 Feb 2024
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate
  Exploration Bias
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias
Max Sobol Mark
Archit Sharma
Fahim Tajwar
Rafael Rafailov
Sergey Levine
Chelsea Finn
OffRLOnRL
340
4
0
12 Oct 2023
Between accurate prediction and poor decision making: the AI/ML gap
Between accurate prediction and poor decision making: the AI/ML gapInternational Conference on Machine Learning, Optimization, and Data Science (MOD), 2023
Gianluca Bontempi
163
1
0
03 Oct 2023
Learning to Make Adherence-Aware Advice
Learning to Make Adherence-Aware AdviceInternational Conference on Learning Representations (ICLR), 2023
Guanting Chen
Xiaocheng Li
Chunlin Sun
Hanzhao Wang
279
15
0
01 Oct 2023
Policy Finetuning in Reinforcement Learning via Design of Experiments
  using Offline Data
Policy Finetuning in Reinforcement Learning via Design of Experiments using Offline DataNeural Information Processing Systems (NeurIPS), 2023
Ruiqi Zhang
Andrea Zanette
OffRLOnRL
341
11
0
10 Jul 2023
On the Importance of Exploration for Generalization in Reinforcement
  Learning
On the Importance of Exploration for Generalization in Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Yiding Jiang
J. Zico Kolter
Roberta Raileanu
UQCVOffRL
229
40
0
08 Jun 2023
Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid
  Reinforcement Learning
Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Gen Li
Wenhao Zhan
Jason D. Lee
Yuejie Chi
Yuxin Chen
OffRLOnRL
333
17
0
17 May 2023
Provably Feedback-Efficient Reinforcement Learning via Active Reward
  Learning
Provably Feedback-Efficient Reinforcement Learning via Active Reward LearningNeural Information Processing Systems (NeurIPS), 2023
Dingwen Kong
Lin F. Yang
280
18
0
18 Apr 2023
Minimax-Optimal Reward-Agnostic Exploration in Reinforcement Learning
Minimax-Optimal Reward-Agnostic Exploration in Reinforcement LearningAnnual Conference Computational Learning Theory (COLT), 2023
Gen Li
Yuling Yan
Yuxin Chen
Jianqing Fan
OffRL
368
16
0
14 Apr 2023
The Role of Coverage in Online Reinforcement Learning
The Role of Coverage in Online Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2022
Tengyang Xie
Dylan J. Foster
Yu Bai
Nan Jiang
Sham Kakade
OffRL
419
77
0
09 Oct 2022
Near-Optimal Deployment Efficiency in Reward-Free Reinforcement Learning
  with Linear Function Approximation
Near-Optimal Deployment Efficiency in Reward-Free Reinforcement Learning with Linear Function ApproximationInternational Conference on Learning Representations (ICLR), 2022
Dan Qiao
Yu Wang
OffRL
338
15
0
03 Oct 2022
Task-Agnostic Learning to Accomplish New Tasks
Task-Agnostic Learning to Accomplish New TasksIEEE Transactions on Cognitive and Developmental Systems (IEEE TCDS), 2022
Xianqi Zhang
Xingtao Wang
Xu Liu
Wenrui Wang
Xiaopeng Fan
Debin Zhao
OffRL
619
0
0
09 Sep 2022
On the Statistical Efficiency of Reward-Free Exploration in Non-Linear
  RL
On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RLNeural Information Processing Systems (NeurIPS), 2022
Jinglin Chen
Aditya Modi
A. Krishnamurthy
Nan Jiang
Alekh Agarwal
415
29
0
21 Jun 2022
SEREN: Knowing When to Explore and When to Exploit
SEREN: Knowing When to Explore and When to Exploit
Changmin Yu
D. Mguni
Dong Li
Aivar Sootla
Jun Wang
Neil Burgess
154
1
0
30 May 2022
Provable Benefits of Representational Transfer in Reinforcement Learning
Provable Benefits of Representational Transfer in Reinforcement LearningAnnual Conference Computational Learning Theory (COLT), 2022
Alekh Agarwal
Yuda Song
Wen Sun
Kaiwen Wang
Mengdi Wang
Xuezhou Zhang
OffRL
358
40
0
29 May 2022
Sample-Efficient Reinforcement Learning with loglog(T) Switching Cost
Sample-Efficient Reinforcement Learning with loglog(T) Switching CostInternational Conference on Machine Learning (ICML), 2022
Dan Qiao
Ming Yin
Ming Min
Yu Wang
291
35
0
13 Feb 2022
Operator Deep Q-Learning: Zero-Shot Reward Transferring in Reinforcement
  Learning
Operator Deep Q-Learning: Zero-Shot Reward Transferring in Reinforcement Learning
Ziyang Tang
Yihao Feng
Qiang Liu
OffRL
301
1
0
01 Jan 2022
Unsupervised Reinforcement Learning in Multiple Environments
Unsupervised Reinforcement Learning in Multiple Environments
Mirco Mutti
Mattia Mancassola
Marcello Restelli
OffRL
213
30
0
16 Dec 2021
Adaptive Multi-Goal Exploration
Adaptive Multi-Goal Exploration
Jean Tarbouriech
O. D. Domingues
Pierre Ménard
Matteo Pirotta
Michal Valko
A. Lazaric
361
4
0
23 Nov 2021
Gap-Dependent Unsupervised Exploration for Reinforcement Learning
Gap-Dependent Unsupervised Exploration for Reinforcement LearningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2021
Jingfeng Wu
Vladimir Braverman
Lin F. Yang
285
12
0
11 Aug 2021
A Simple Reward-free Approach to Constrained Reinforcement Learning
A Simple Reward-free Approach to Constrained Reinforcement Learning
Sobhan Miryoosefi
Chi Jin
298
35
0
12 Jul 2021
Deep Learning for Embodied Vision Navigation: A Survey
Deep Learning for Embodied Vision Navigation: A Survey
Fengda Zhu
Yi Zhu
Vincent CS Lee
Xiaodan Liang
Xiaojun Chang
EgoVLM&Ro
591
0
0
07 Jul 2021
Optimal Uniform OPE and Model-based Offline Reinforcement Learning in
  Time-Homogeneous, Reward-Free and Task-Agnostic Settings
Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic SettingsNeural Information Processing Systems (NeurIPS), 2021
Ming Yin
Yu Wang
OffRL
316
19
0
13 May 2021
Reward Poisoning in Reinforcement Learning: Attacks Against Unknown
  Learners in Unknown Environments
Reward Poisoning in Reinforcement Learning: Attacks Against Unknown Learners in Unknown Environments
Amin Rakhsha
Xuezhou Zhang
Xiaojin Zhu
Adish Singla
AAMLOffRL
234
43
0
16 Feb 2021
Accommodating Picky Customers: Regret Bound and Exploration Complexity
  for Multi-Objective Reinforcement Learning
Accommodating Picky Customers: Regret Bound and Exploration Complexity for Multi-Objective Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2020
Jingfeng Wu
Vladimir Braverman
Lin F. Yang
322
11
0
25 Nov 2020
Nearly Minimax Optimal Reward-free Reinforcement Learning
Nearly Minimax Optimal Reward-free Reinforcement Learning
Zihan Zhang
S. Du
Xiangyang Ji
OffRL
248
32
0
12 Oct 2020
A Sharp Analysis of Model-based Reinforcement Learning with Self-Play
A Sharp Analysis of Model-based Reinforcement Learning with Self-PlayInternational Conference on Machine Learning (ICML), 2020
Qinghua Liu
Tiancheng Yu
Yu Bai
Chi Jin
357
136
0
04 Oct 2020
Fast active learning for pure exploration in reinforcement learning
Fast active learning for pure exploration in reinforcement learningInternational Conference on Machine Learning (ICML), 2020
Pierre Ménard
O. D. Domingues
Anders Jonsson
E. Kaufmann
Edouard Leurent
Michal Valko
256
108
0
27 Jul 2020
A Provably Efficient Sample Collection Strategy for Reinforcement
  Learning
A Provably Efficient Sample Collection Strategy for Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2020
Jean Tarbouriech
Matteo Pirotta
Michal Valko
A. Lazaric
OffRL
318
20
0
13 Jul 2020
On Reward-Free Reinforcement Learning with Linear Function Approximation
On Reward-Free Reinforcement Learning with Linear Function Approximation
Ruosong Wang
S. Du
Lin F. Yang
Ruslan Salakhutdinov
OffRL
290
115
0
19 Jun 2020
Adaptive Reward-Free Exploration
Adaptive Reward-Free ExplorationInternational Conference on Algorithmic Learning Theory (ALT), 2020
E. Kaufmann
Pierre Ménard
O. D. Domingues
Anders Jonsson
Edouard Leurent
Michal Valko
266
91
0
11 Jun 2020
1
Page 1 of 1