ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.00832
  4. Cited By
Square-root regret bounds for continuous-time episodic Markov decision
  processes

Square-root regret bounds for continuous-time episodic Markov decision processes

3 October 2022
Xuefeng Gao
X. Zhou
ArXivPDFHTML

Papers citing "Square-root regret bounds for continuous-time episodic Markov decision processes"

5 / 5 papers shown
Title
Reinforcement Learning for Intensity Control: An Application to
  Choice-Based Network Revenue Management
Reinforcement Learning for Intensity Control: An Application to Choice-Based Network Revenue Management
Huiling Meng
Ningyuan Chen
Xuefeng Gao
55
1
0
08 Jun 2024
$ε$-Policy Gradient for Online Pricing
εεε-Policy Gradient for Online Pricing
Lukasz Szpruch
Tanut Treetanthiploet
Yufei Zhang
OffRL
49
1
0
06 May 2024
Statistical Learning with Sublinear Regret of Propagator Models
Statistical Learning with Sublinear Regret of Propagator Models
Eyal Neuman
Yufei Zhang
35
7
0
12 Jan 2023
Optimal scheduling of entropy regulariser for continuous-time
  linear-quadratic reinforcement learning
Optimal scheduling of entropy regulariser for continuous-time linear-quadratic reinforcement learning
Lukasz Szpruch
Tanut Treetanthiploet
Yufei Zhang
11
8
0
08 Aug 2022
Logarithmic regret bounds for continuous-time average-reward Markov
  decision processes
Logarithmic regret bounds for continuous-time average-reward Markov decision processes
Xuefeng Gao
X. Zhou
29
8
0
23 May 2022
1