ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2208.04466
  4. Cited By
Optimal scheduling of entropy regulariser for continuous-time
  linear-quadratic reinforcement learning

Optimal scheduling of entropy regulariser for continuous-time linear-quadratic reinforcement learning

8 August 2022
Lukasz Szpruch
Tanut Treetanthiploet
Yufei Zhang
ArXivPDFHTML

Papers citing "Optimal scheduling of entropy regulariser for continuous-time linear-quadratic reinforcement learning"

4 / 4 papers shown
Title
Policy Gradient Converges to the Globally Optimal Policy for Nearly Linear-Quadratic Regulators
Policy Gradient Converges to the Globally Optimal Policy for Nearly Linear-Quadratic Regulators
Yin-Huan Han
Meisam Razaviyayn
Renyuan Xu
22
5
0
15 Mar 2023
Statistical Learning with Sublinear Regret of Propagator Models
Statistical Learning with Sublinear Regret of Propagator Models
Eyal Neuman
Yufei Zhang
32
7
0
12 Jan 2023
Square-root regret bounds for continuous-time episodic Markov decision
  processes
Square-root regret bounds for continuous-time episodic Markov decision processes
Xuefeng Gao
X. Zhou
40
6
0
03 Oct 2022
Logarithmic regret bounds for continuous-time average-reward Markov
  decision processes
Logarithmic regret bounds for continuous-time average-reward Markov decision processes
Xuefeng Gao
X. Zhou
29
8
0
23 May 2022
1