Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1808.01813
Cited By
Regret Bounds for Reinforcement Learning via Markov Chain Concentration
6 August 2018
R. Ortner
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Regret Bounds for Reinforcement Learning via Markov Chain Concentration"
5 / 5 papers shown
Title
Improved Estimation of Relaxation Time in Non-reversible Markov Chains
Geoffrey Wolfer
A. Kontorovich
45
7
0
01 Sep 2022
Learning Infinite-Horizon Average-Reward Markov Decision Processes with Constraints
Liyu Chen
R. Jain
Haipeng Luo
57
25
0
31 Jan 2022
Dueling RL: Reinforcement Learning with Trajectory Preferences
Aldo Pacchiano
Aadirupa Saha
Jonathan Lee
33
81
0
08 Nov 2021
Theoretical Guarantees of Fictitious Discount Algorithms for Episodic Reinforcement Learning and Global Convergence of Policy Gradient Methods
Xin Guo
Anran Hu
Junzi Zhang
OffRL
25
6
0
13 Sep 2021
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Chen-Yu Wei
Mehdi Jafarnia-Jahromi
Haipeng Luo
Hiteshi Sharma
R. Jain
107
99
0
15 Oct 2019
1