ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2108.02717
  4. Cited By
Beyond No Regret: Instance-Dependent PAC Reinforcement Learning

Beyond No Regret: Instance-Dependent PAC Reinforcement Learning

5 August 2021
Andrew Wagenmaker
Max Simchowitz
Kevin G. Jamieson
ArXivPDFHTML

Papers citing "Beyond No Regret: Instance-Dependent PAC Reinforcement Learning"

29 / 29 papers shown
Title
Minimax Optimal Reinforcement Learning with Quasi-Optimism
Harin Lee
Min-hwan Oh
OffRL
64
1
0
02 Mar 2025
Gap-Dependent Bounds for Q-Learning using Reference-Advantage Decomposition
Gap-Dependent Bounds for Q-Learning using Reference-Advantage Decomposition
Zhong Zheng
Haochen Zhang
Lingzhou Xue
OffRL
78
2
0
10 Oct 2024
State-free Reinforcement Learning
State-free Reinforcement Learning
Mingyu Chen
Aldo Pacchiano
Xuezhou Zhang
68
0
0
27 Sep 2024
Model-Free Active Exploration in Reinforcement Learning
Model-Free Active Exploration in Reinforcement Learning
Alessio Russo
Alexandre Proutiere
OffRL
25
2
0
30 Jun 2024
Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis
Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis
Qining Zhang
Honghao Wei
Lei Ying
OffRL
67
1
0
11 Jun 2024
Test-Time Regret Minimization in Meta Reinforcement Learning
Test-Time Regret Minimization in Meta Reinforcement Learning
Mirco Mutti
Aviv Tamar
26
4
0
04 Jun 2024
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in
  Tabular MDP
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
Subhojyoti Mukherjee
Josiah P. Hanna
Robert Nowak
OffRL
53
0
0
04 Jun 2024
What Are the Odds? Improving the foundations of Statistical Model Checking
What Are the Odds? Improving the foundations of Statistical Model Checking
Tobias Meggendorfer
Maximilian Weininger
Patrick Wienhoft
44
4
0
08 Apr 2024
Sample Efficient Reinforcement Learning with Partial Dynamics Knowledge
Sample Efficient Reinforcement Learning with Partial Dynamics Knowledge
Meshal Alharbi
Mardavij Roozbehani
M. Dahleh
29
0
0
19 Dec 2023
Towards Instance-Optimality in Online PAC Reinforcement Learning
Towards Instance-Optimality in Online PAC Reinforcement Learning
Aymen Al Marjani
Andrea Tirinzoni
Emilie Kaufmann
OffRL
23
4
0
31 Oct 2023
Maximum diffusion reinforcement learning
Maximum diffusion reinforcement learning
Thomas A. Berrueta
Allison Pinosky
Todd D. Murphey
AI4CE
DiffM
19
5
0
26 Sep 2023
Active Coverage for PAC Reinforcement Learning
Active Coverage for PAC Reinforcement Learning
Aymen Al Marjani
Andrea Tirinzoni
E. Kaufmann
OffRL
21
4
0
23 Jun 2023
Variance-aware robust reinforcement learning with linear function
  approximation under heavy-tailed rewards
Variance-aware robust reinforcement learning with linear function approximation under heavy-tailed rewards
Xiang Li
Qiang Sun
34
8
0
09 Mar 2023
Leveraging Offline Data in Online Reinforcement Learning
Leveraging Offline Data in Online Reinforcement Learning
Andrew Wagenmaker
Aldo Pacchiano
OffRL
OnRL
35
38
0
09 Nov 2022
Hardness in Markov Decision Processes: Theory and Practice
Hardness in Markov Decision Processes: Theory and Practice
Michelangelo Conserva
Paulo E. Rauber
39
3
0
24 Oct 2022
The Role of Coverage in Online Reinforcement Learning
The Role of Coverage in Online Reinforcement Learning
Tengyang Xie
Dylan J. Foster
Yu Bai
Nan Jiang
Sham Kakade
OffRL
38
57
0
09 Oct 2022
Optimistic PAC Reinforcement Learning: the Instance-Dependent View
Optimistic PAC Reinforcement Learning: the Instance-Dependent View
Andrea Tirinzoni
Aymen Al Marjani
E. Kaufmann
19
12
0
12 Jul 2022
Instance-Dependent Near-Optimal Policy Identification in Linear MDPs via
  Online Experiment Design
Instance-Dependent Near-Optimal Policy Identification in Linear MDPs via Online Experiment Design
Andrew Wagenmaker
Kevin G. Jamieson
OffRL
34
26
0
06 Jul 2022
Instance-optimal PAC Algorithms for Contextual Bandits
Instance-optimal PAC Algorithms for Contextual Bandits
Zhao Li
Lillian J. Ratliff
Houssam Nassif
Kevin G. Jamieson
Lalit P. Jain
20
17
0
05 Jul 2022
Guarantees for Epsilon-Greedy Reinforcement Learning with Function
  Approximation
Guarantees for Epsilon-Greedy Reinforcement Learning with Function Approximation
Christoph Dann
Yishay Mansour
M. Mohri
Ayush Sekhari
Karthik Sridharan
38
50
0
19 Jun 2022
Stabilizing Q-learning with Linear Architectures for Provably Efficient
  Learning
Stabilizing Q-learning with Linear Architectures for Provably Efficient Learning
Andrea Zanette
Martin J. Wainwright
OOD
45
5
0
01 Jun 2022
On Gap-dependent Bounds for Offline Reinforcement Learning
On Gap-dependent Bounds for Offline Reinforcement Learning
Xinqi Wang
Qiwen Cui
S. Du
OffRL
73
12
0
01 Jun 2022
ReVar: Strengthening Policy Evaluation via Reduced Variance Sampling
ReVar: Strengthening Policy Evaluation via Reduced Variance Sampling
Subhojyoti Mukherjee
Josiah P. Hanna
Robert D. Nowak
OffRL
29
12
0
09 Mar 2022
Reward-Free RL is No Harder Than Reward-Aware RL in Linear Markov
  Decision Processes
Reward-Free RL is No Harder Than Reward-Aware RL in Linear Markov Decision Processes
Andrew Wagenmaker
Yifang Chen
Max Simchowitz
S. Du
Kevin G. Jamieson
21
48
0
26 Jan 2022
Can Q-Learning be Improved with Advice?
Can Q-Learning be Improved with Advice?
Noah Golowich
Ankur Moitra
OffRL
19
12
0
25 Oct 2021
Gap-Dependent Unsupervised Exploration for Reinforcement Learning
Gap-Dependent Unsupervised Exploration for Reinforcement Learning
Jingfeng Wu
Vladimir Braverman
Lin F. Yang
33
12
0
11 Aug 2021
Navigating to the Best Policy in Markov Decision Processes
Navigating to the Best Policy in Markov Decision Processes
Aymen Al Marjani
Aurélien Garivier
Alexandre Proutiere
37
21
0
05 Jun 2021
Provably Efficient Reinforcement Learning with Linear Function
  Approximation Under Adaptivity Constraints
Provably Efficient Reinforcement Learning with Linear Function Approximation Under Adaptivity Constraints
Chi Jin
Zhuoran Yang
Zhaoran Wang
OffRL
122
167
0
06 Jan 2021
Reward-Free Exploration for Reinforcement Learning
Reward-Free Exploration for Reinforcement Learning
Chi Jin
A. Krishnamurthy
Max Simchowitz
Tiancheng Yu
OffRL
112
194
0
07 Feb 2020
1