Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.00832
Cited By
Square-root regret bounds for continuous-time episodic Markov decision processes
3 October 2022
Xuefeng Gao
X. Zhou
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Square-root regret bounds for continuous-time episodic Markov decision processes"
5 / 5 papers shown
Title
Reinforcement Learning for Intensity Control: An Application to Choice-Based Network Revenue Management
Huiling Meng
Ningyuan Chen
Xuefeng Gao
55
1
0
08 Jun 2024
ε
ε
ε
-Policy Gradient for Online Pricing
Lukasz Szpruch
Tanut Treetanthiploet
Yufei Zhang
OffRL
44
1
0
06 May 2024
Statistical Learning with Sublinear Regret of Propagator Models
Eyal Neuman
Yufei Zhang
32
7
0
12 Jan 2023
Optimal scheduling of entropy regulariser for continuous-time linear-quadratic reinforcement learning
Lukasz Szpruch
Tanut Treetanthiploet
Yufei Zhang
8
8
0
08 Aug 2022
Logarithmic regret bounds for continuous-time average-reward Markov decision processes
Xuefeng Gao
X. Zhou
26
8
0
23 May 2022
1