Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1905.11527
Cited By
Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy Policies
27 May 2019
Yonathan Efroni
Nadav Merlis
Mohammad Ghavamzadeh
Shie Mannor
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy Policies"
16 / 16 papers shown
Title
Settling the Sample Complexity of Online Reinforcement Learning
Zihan Zhang
Yuxin Chen
Jason D. Lee
S. Du
OffRL
95
21
0
25 Jul 2023
When to Update Your Model: Constrained Model-based Reinforcement Learning
Tianying Ji
Yu-Juan Luo
Fuchun Sun
Mingxuan Jing
Fengxiang He
Wen-bing Huang
16
18
0
15 Oct 2022
Provably Efficient Fictitious Play Policy Optimization for Zero-Sum Markov Games with Structured Transitions
Shuang Qiu
Xiaohan Wei
Jieping Ye
Zhaoran Wang
Zhuoran Yang
OffRL
24
11
0
25 Jul 2022
Differentially Private Regret Minimization in Episodic Markov Decision Processes
Sayak Ray Chowdhury
Xingyu Zhou
21
21
0
20 Dec 2021
Adaptive Discretization in Online Reinforcement Learning
Sean R. Sinclair
Siddhartha Banerjee
C. Yu
OffRL
40
15
0
29 Oct 2021
Breaking the Sample Complexity Barrier to Regret-Optimal Model-Free Reinforcement Learning
Gen Li
Laixi Shi
Yuxin Chen
Yuejie Chi
OffRL
37
50
0
09 Oct 2021
Reinforcement Learning in Reward-Mixing MDPs
Jeongyeol Kwon
Yonathan Efroni
C. Caramanis
Shie Mannor
30
15
0
07 Oct 2021
Efficient Model-Based Multi-Agent Mean-Field Reinforcement Learning
Barna Pásztor
Ilija Bogunovic
Andreas Krause
23
41
0
08 Jul 2021
Learning Policies with Zero or Bounded Constraint Violation for Constrained MDPs
Tao-Wen Liu
Ruida Zhou
D. Kalathil
P. R. Kumar
Chao Tian
29
78
0
04 Jun 2021
Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic Settings
Ming Yin
Yu-Xiang Wang
OffRL
29
19
0
13 May 2021
Cautiously Optimistic Policy Optimization and Exploration with Linear Function Approximation
Andrea Zanette
Ching-An Cheng
Alekh Agarwal
32
52
0
24 Mar 2021
UCB Momentum Q-learning: Correcting the bias without forgetting
Pierre Menard
O. D. Domingues
Xuedong Shang
Michal Valko
79
40
0
01 Mar 2021
Online Apprenticeship Learning
Lior Shani
Tom Zahavy
Shie Mannor
OffRL
24
25
0
13 Feb 2021
Efficient Model-Based Reinforcement Learning through Optimistic Policy Search and Planning
Sebastian Curi
Felix Berkenkamp
Andreas Krause
25
82
0
15 Jun 2020
Tightening Exploration in Upper Confidence Reinforcement Learning
Hippolyte Bourel
Odalric-Ambrym Maillard
M. S. Talebi
11
31
0
20 Apr 2020
Exploration-Exploitation in Constrained MDPs
Yonathan Efroni
Shie Mannor
Matteo Pirotta
14
169
0
04 Mar 2020
1