Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2007.05456
Cited By
Improved Analysis of UCRL2 with Empirical Bernstein Inequality
10 July 2020
Ronan Fruit
Matteo Pirotta
A. Lazaric
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Improved Analysis of UCRL2 with Empirical Bernstein Inequality"
19 / 19 papers shown
Tail Distribution of Regret in Optimistic Reinforcement Learning
Sajad Khodadadian
Mehrdad Moharrami
150
0
0
23 Nov 2025
Statistical Guarantees for Offline Domain Randomization
Arnaud Fickinger
Abderrahim Bendahi
Stuart J. Russell
OffRL
345
0
0
11 Jun 2025
Model Selection for Average Reward RL with Application to Utility Maximization in Repeated Games
Alireza Masoumian
James R. Wright
587
2
0
09 Nov 2024
Learning Infinite-Horizon Average-Reward Linear Mixture MDPs of Bounded Span
International Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Woojin Chae
Kihyuk Hong
Yufan Zhang
Ambuj Tewari
Dabeen Lee
208
1
0
19 Oct 2024
Achieving Tractable Minimax Optimal Regret in Average Reward MDPs
Victor Boone
Zihan Zhang
232
10
0
03 Jun 2024
Reinforcement Learning for Infinite-Horizon Average-Reward Linear MDPs via Approximation by Discounted-Reward MDPs
International Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Kihyuk Hong
Yufan Zhang
Ambuj Tewari
Dabeen Lee
Ambuj Tewari
471
1
0
23 May 2024
On Reward Structures of Markov Decision Processes
Falcon Z. Dai
308
1
0
28 Aug 2023
A Cover Time Study of a non-Markovian Algorithm
Guanhua Fang
G. Samorodnitsky
Zhiqiang Xu
315
0
0
08 Jun 2023
Horizon-Free and Variance-Dependent Reinforcement Learning for Latent Markov Decision Processes
International Conference on Machine Learning (ICML), 2022
Runlong Zhou
Ruosong Wang
S. Du
428
3
0
20 Oct 2022
Optimism and Delays in Episodic Reinforcement Learning
International Conference on Artificial Intelligence and Statistics (AISTATS), 2021
Benjamin Howson
Ciara Pike-Burke
Sarah Filippi
268
8
0
15 Nov 2021
Understanding Domain Randomization for Sim-to-real Transfer
Xiaoyu Chen
Jiachen Hu
Chi Jin
Lihong Li
Liwei Wang
473
164
0
07 Oct 2021
Nearly Minimax Optimal Regret for Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation
International Conference on Artificial Intelligence and Statistics (AISTATS), 2021
Yue Wu
Dongruo Zhou
Quanquan Gu
227
23
0
15 Feb 2021
Improved Sample Complexity for Incremental Autonomous Exploration in MDPs
Neural Information Processing Systems (NeurIPS), 2020
Jean Tarbouriech
Matteo Pirotta
Michal Valko
A. Lazaric
222
13
0
29 Dec 2020
Local Differential Privacy for Regret Minimization in Reinforcement Learning
Evrard Garcelon
Vianney Perchet
Ciara Pike-Burke
Matteo Pirotta
397
42
0
15 Oct 2020
Improved Exploration in Factored Average-Reward MDPs
International Conference on Artificial Intelligence and Statistics (AISTATS), 2020
M. S. Talebi
Anders Jonsson
Odalric-Ambrym Maillard
259
9
0
09 Sep 2020
Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation
International Conference on Artificial Intelligence and Statistics (AISTATS), 2020
Chen-Yu Wei
Mehdi Jafarnia-Jahromi
Haipeng Luo
Rahul Jain
385
53
0
23 Jul 2020
A Provably Efficient Sample Collection Strategy for Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2020
Jean Tarbouriech
Matteo Pirotta
Michal Valko
A. Lazaric
OffRL
334
20
0
13 Jul 2020
Tightening Exploration in Upper Confidence Reinforcement Learning
International Conference on Machine Learning (ICML), 2020
Hippolyte Bourel
Odalric-Ambrym Maillard
M. S. Talebi
355
38
0
20 Apr 2020
No-Regret Exploration in Goal-Oriented Reinforcement Learning
International Conference on Machine Learning (ICML), 2019
Jean Tarbouriech
Evrard Garcelon
Michal Valko
Matteo Pirotta
A. Lazaric
339
48
0
07 Dec 2019
1
Page 1 of 1