ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.03585
  4. Cited By
On the Convergence of the Monte Carlo Exploring Starts Algorithm for
  Reinforcement Learning
v1v2 (latest)

On the Convergence of the Monte Carlo Exploring Starts Algorithm for Reinforcement Learning

10 February 2020
Che Wang
Shuhan Yuan
Kai Shao
George Andriopoulos
ArXiv (abs)PDFHTML

Papers citing "On the Convergence of the Monte Carlo Exploring Starts Algorithm for Reinforcement Learning"

8 / 8 papers shown
Finite-Sample Analysis of the Monte Carlo Exploring Starts Algorithm for
  Reinforcement Learning
Finite-Sample Analysis of the Monte Carlo Exploring Starts Algorithm for Reinforcement Learning
Suei-Wen Chen
Keith Ross
Pierre Youssef
276
1
0
03 Oct 2024
Online Learning of Decision Trees with Thompson Sampling
Online Learning of Decision Trees with Thompson SamplingInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Ayman Chaouki
Jesse Read
Nikolaos Perrakis
155
3
0
09 Apr 2024
On The Convergence Of Policy Iteration-Based Reinforcement Learning With
  Monte Carlo Policy Evaluation
On The Convergence Of Policy Iteration-Based Reinforcement Learning With Monte Carlo Policy EvaluationInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Anna Winnicki
R. Srikant
272
12
0
23 Jan 2023
On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs
On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs
Zixuan Dong
Che Wang
George Andriopoulos
321
3
0
07 Sep 2022
Applications of Reinforcement Learning in Finance -- Trading with a
  Double Deep Q-Network
Applications of Reinforcement Learning in Finance -- Trading with a Double Deep Q-Network
Frensi Zejnullahu
Maurice Moser
Joerg Osterrieder
AIFin
197
7
0
28 Jun 2022
Slowly Changing Adversarial Bandit Algorithms are Efficient for
  Discounted MDPs
Slowly Changing Adversarial Bandit Algorithms are Efficient for Discounted MDPsInternational Conference on Algorithmic Learning Theory (ALT), 2022
Ian A. Kash
L. Reyzin
Zishun Yu
453
1
0
18 May 2022
Improved Exploring Starts by Kernel Density Estimation-Based State-Space
  Coverage Acceleration in Reinforcement Learning
Improved Exploring Starts by Kernel Density Estimation-Based State-Space Coverage Acceleration in Reinforcement Learning
Maximilian Schenke
Oliver Wallscheid
OffRL
276
7
0
19 May 2021
On the Convergence of Reinforcement Learning with Monte Carlo Exploring
  Starts
On the Convergence of Reinforcement Learning with Monte Carlo Exploring Starts
Jun Liu
107
17
0
21 Jul 2020
1
Page 1 of 1