Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2002.02794
Cited By
Reward-Free Exploration for Reinforcement Learning
International Conference on Machine Learning (ICML), 2020
7 February 2020
Chi Jin
A. Krishnamurthy
Max Simchowitz
Tiancheng Yu
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Reward-Free Exploration for Reinforcement Learning"
50 / 159 papers shown
Rate optimal learning of equilibria from data
Till Freihaut
Luca Viano
Emanuele Nevali
Volkan Cevher
Matthieu Geist
Giorgia Ramponi
141
0
0
10 Oct 2025
Q-Learning with Fine-Grained Gap-Dependent Regret
Haochen Zhang
Zhong Zheng
Lingzhou Xue
206
1
0
08 Oct 2025
Instance-Dependent Continuous-Time Reinforcement Learning via Maximum Likelihood Estimation
Runze Zhao
Yue Yu
Ruhan Wang
Chunfeng Huang
Dongruo Zhou
268
1
0
04 Aug 2025
Mixture of Autoencoder Experts Guidance using Unlabeled and Incomplete Data for Exploration in Reinforcement Learning
Elias Malomgré
Pieter Simoens
OffRL
191
1
0
21 Jul 2025
Statistical and Algorithmic Foundations of Reinforcement Learning
Yuejie Chi
Yuxin Chen
Yuting Wei
OffRL
278
3
0
19 Jul 2025
Learning Policy Committees for Effective Personalization in MDPs with Diverse Tasks
Luise Ge
Michael Lanier
Anindya Sarkar
Bengisu Guresti
Yevgeniy Vorobeychik
Chongjie Zhang
470
3
0
26 Feb 2025
DIAL: Distribution-Informed Adaptive Learning of Multi-Task Constraints for Safety-Critical Systems
Se-Wook Yoo
Seung-Woo Seo
433
0
0
30 Jan 2025
Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data
International Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Chengrui Qu
Laixi Shi
Kishan Panaganti
Pengcheng You
Adam Wierman
OffRL
OnRL
312
8
0
06 Nov 2024
Learning in Markov Games with Adaptive Adversaries: Policy Regret, Fundamental Barriers, and Efficient Algorithms
Neural Information Processing Systems (NeurIPS), 2024
Thanh Nguyen-Tang
Raman Arora
447
1
0
01 Nov 2024
Can we hop in general? A discussion of benchmark selection and design using the Hopper environment
C. Voelcker
Marcel Hussing
Eric Eaton
OffRL
389
7
0
11 Oct 2024
Gap-Dependent Bounds for Q-Learning using Reference-Advantage Decomposition
International Conference on Learning Representations (ICLR), 2024
Zhong Zheng
Haochen Zhang
Lingzhou Xue
OffRL
457
9
0
10 Oct 2024
Task-Agnostic Pre-training and Task-Guided Fine-tuning for Versatile Diffusion Planner
Chenyou Fan
Chenjia Bai
Zhao Shan
Haoran He
Yang Zhang
Zhen Wang
462
4
0
30 Sep 2024
Advances in Preference-based Reinforcement Learning: A Review
IEEE International Conference on Systems, Man and Cybernetics (SMC), 2022
Youssef Abdelkareem
Shady Shehata
Fakhri Karray
OffRL
304
18
0
21 Aug 2024
Efficient Reinforcement Learning in Probabilistic Reward Machines
AAAI Conference on Artificial Intelligence (AAAI), 2024
Xiaofeng Lin
Xuezhou Zhang
317
3
0
19 Aug 2024
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
Huajian Xin
Zhaochun Ren
Junxiao Song
Zhihong Shao
Wanjia Zhao
...
Dejian Yang
Zhibin Gou
Z. F. Wu
Fuli Luo
Chong Ruan
AIMat
LRM
370
154
0
15 Aug 2024
Problem Solving Through Human-AI Preference-Based Cooperation
Computational Linguistics (CL), 2024
Subhabrata Dutta
Timo Kaufmann
Goran Glavaš
Ivan Habernal
Kristian Kersting
Frauke Kreuter
Mira Mezini
Iryna Gurevych
Eyke Hüllermeier
Hinrich Schuetze
1.1K
9
0
14 Aug 2024
A Single Goal is All You Need: Skills and Exploration Emerge from Contrastive RL without Rewards, Demonstrations, or Subgoals
International Conference on Learning Representations (ICLR), 2024
Grace Liu
Michael Tang
Benjamin Eysenbach
OffRL
451
16
0
11 Aug 2024
Uncertainty-Aware Reward-Free Exploration with General Function Approximation
Junkai Zhang
Weitong Zhang
Dongruo Zhou
Q. Gu
498
6
0
24 Jun 2024
Beyond Optimism: Exploration With Partially Observable Rewards
Simone Parisi
Alireza Kazemipour
Michael Bowling
OffRL
300
7
0
20 Jun 2024
Hybrid Reinforcement Learning from Offline Observation Alone
Yuda Song
J. Andrew Bagnell
Aarti Singh
OffRL
353
6
0
11 Jun 2024
How to Explore with Belief: State Entropy Maximization in POMDPs
Riccardo Zamboni
Duilio Cirino
Marcello Restelli
Mirco Mutti
287
6
0
04 Jun 2024
RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy Evaluation
Jeongyeol Kwon
Shie Mannor
Constantine Caramanis
Yonathan Efroni
OffRL
453
6
0
03 Jun 2024
Offline Oracle-Efficient Learning for Contextual MDPs via Layerwise Exploration-Exploitation Tradeoff
Jian Qian
Haichen Hu
David Simchi-Levi
292
6
0
28 May 2024
What Are the Odds? Improving the foundations of Statistical Model Checking
Tobias Meggendorfer
Maximilian Weininger
Patrick Wienhoft
507
8
0
08 Apr 2024
Multiple-policy Evaluation via Density Estimation
Yilei Chen
Aldo Pacchiano
I. Paschalidis
OffRL
444
1
0
29 Mar 2024
Horizon-Free Regret for Linear Markov Decision Processes
Zihan Zhang
Jason D. Lee
Yuxin Chen
Simon S. Du
260
4
0
15 Mar 2024
Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks
Ziping Xu
Zifan Xu
Runxuan Jiang
Peter Stone
Ambuj Tewari
431
2
0
03 Mar 2024
Scale-free Adversarial Reinforcement Learning
Mingyu Chen
Xuezhou Zhang
352
2
0
01 Mar 2024
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization
Tianying Ji
Yongyuan Liang
Yan Zeng
Yu-Juan Luo
Guowei Xu
Jiawei Guo
Ruijie Zheng
Furong Huang
Gang Hua
Huazhe Xu
CML
386
20
0
22 Feb 2024
Offline Multi-task Transfer RL with Representational Penalization
Avinandan Bose
S. S. Du
Maryam Fazel
OffRL
386
13
0
19 Feb 2024
Compressing Deep Reinforcement Learning Networks with a Dynamic Structured Pruning Method for Autonomous Driving
Wensheng Su
Zhenni Li
Minrui Xu
Jiawen Kang
Dusit Niyato
Shengli Xie
228
16
0
07 Feb 2024
Near-Optimal Reinforcement Learning with Self-Play under Adaptivity Constraints
Dan Qiao
Yu Wang
OffRL
335
5
0
02 Feb 2024
Experiment Planning with Function Approximation
Neural Information Processing Systems (NeurIPS), 2024
Aldo Pacchiano
Jonathan Lee
Emma Brunskill
OffRL
239
6
0
10 Jan 2024
Accelerating Exploration with Unlabeled Prior Data
Qiyang Li
Jason Zhang
Dibya Ghosh
Amy Zhang
Sergey Levine
OffRL
OnRL
473
18
0
09 Nov 2023
DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization
International Conference on Learning Representations (ICLR), 2023
Guowei Xu
Ruijie Zheng
Yongyuan Liang
Xiyao Wang
Zhecheng Yuan
...
Shuzhen Li
Yanjie Ze
Hal Daumé
Furong Huang
Huazhe Xu
394
53
0
30 Oct 2023
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias
Max Sobol Mark
Archit Sharma
Fahim Tajwar
Rafael Rafailov
Sergey Levine
Chelsea Finn
OffRL
OnRL
343
4
0
12 Oct 2023
Exploiting Causal Graph Priors with Posterior Sampling for Reinforcement Learning
International Conference on Learning Representations (ICLR), 2023
Mirco Mutti
Ric De Santi
Marcello Restelli
Alexander Marx
Giorgia Ramponi
CML
374
6
0
11 Oct 2023
Spectral Entry-wise Matrix Estimation for Low-Rank Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2023
Stefan Stojanovic
Yassir Jedra
Alexandre Proutière
363
5
0
10 Oct 2023
When is Agnostic Reinforcement Learning Statistically Tractable?
Neural Information Processing Systems (NeurIPS), 2023
Zeyu Jia
Gene Li
Alexander Rakhlin
Ayush Sekhari
Nathan Srebro
OffRL
383
7
0
09 Oct 2023
Learning to Make Adherence-Aware Advice
International Conference on Learning Representations (ICLR), 2023
Guanting Chen
Xiaocheng Li
Chunlin Sun
Hanzhao Wang
280
16
0
01 Oct 2023
Zero-Shot Reinforcement Learning from Low Quality Data
Neural Information Processing Systems (NeurIPS), 2023
Scott Jeen
Tom Bewley
Jonathan M. Cullen
OffRL
OnRL
404
17
0
26 Sep 2023
FoX: Formation-aware exploration in multi-agent reinforcement learning
AAAI Conference on Artificial Intelligence (AAAI), 2023
Yonghyeon Jo
Sunwoo Lee
Junghyuk Yum
Seungyul Han
383
20
0
22 Aug 2023
Settling the Sample Complexity of Online Reinforcement Learning
Annual Conference Computational Learning Theory (COLT), 2023
Zihan Zhang
Yuxin Chen
Jason D. Lee
S. Du
OffRL
907
42
0
25 Jul 2023
Policy Finetuning in Reinforcement Learning via Design of Experiments using Offline Data
Neural Information Processing Systems (NeurIPS), 2023
Ruiqi Zhang
Andrea Zanette
OffRL
OnRL
347
11
0
10 Jul 2023
Is RLHF More Difficult than Standard RL?
Neural Information Processing Systems (NeurIPS), 2023
Yuanhao Wang
Qinghua Liu
Chi Jin
OffRL
413
89
0
25 Jun 2023
Offline Policy Evaluation for Reinforcement Learning with Adaptively Collected Data
Sunil Madhow
Dan Xiao
Ming Yin
Yu-Xiang Wang
OffRL
321
0
0
24 Jun 2023
Active Coverage for PAC Reinforcement Learning
Annual Conference Computational Learning Theory (COLT), 2023
Aymen Al Marjani
Andrea Tirinzoni
E. Kaufmann
OffRL
263
7
0
23 Jun 2023
Optimistic Active Exploration of Dynamical Systems
Neural Information Processing Systems (NeurIPS), 2023
Bhavya Sukhija
Lenart Treven
Cansu Sancaktar
Sebastian Blaes
Stelian Coros
Andreas Krause
624
32
0
21 Jun 2023
Provably Efficient Adversarial Imitation Learning with Unknown Transitions
Conference on Uncertainty in Artificial Intelligence (UAI), 2023
Tian Xu
Ziniu Li
Yang Yu
Zhimin Luo
186
11
0
11 Jun 2023
Provable Reward-Agnostic Preference-Based Reinforcement Learning
International Conference on Learning Representations (ICLR), 2023
Wenhao Zhan
Masatoshi Uehara
Wen Sun
Jason D. Lee
468
14
0
29 May 2023
1
2
3
4
Next
Page 1 of 4