Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1808.01813
Cited By
Regret Bounds for Reinforcement Learning via Markov Chain Concentration
6 August 2018
R. Ortner
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Regret Bounds for Reinforcement Learning via Markov Chain Concentration"
6 / 6 papers shown
Title
Improved Estimation of Relaxation Time in Non-reversible Markov Chains
Geoffrey Wolfer
A. Kontorovich
63
7
0
01 Sep 2022
Learning Infinite-Horizon Average-Reward Markov Decision Processes with Constraints
Liyu Chen
R. Jain
Haipeng Luo
57
25
0
31 Jan 2022
Dueling RL: Reinforcement Learning with Trajectory Preferences
Aldo Pacchiano
Aadirupa Saha
Jonathan Lee
33
82
0
08 Nov 2021
Theoretical Guarantees of Fictitious Discount Algorithms for Episodic Reinforcement Learning and Global Convergence of Policy Gradient Methods
Xin Guo
Anran Hu
Junzi Zhang
OffRL
28
6
0
13 Sep 2021
A Provably Efficient Sample Collection Strategy for Reinforcement Learning
Jean Tarbouriech
Matteo Pirotta
Michal Valko
A. Lazaric
OffRL
25
16
0
13 Jul 2020
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Chen-Yu Wei
Mehdi Jafarnia-Jahromi
Haipeng Luo
Hiteshi Sharma
R. Jain
107
100
0
15 Oct 2019
1