ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2501.05407
  4. Cited By
On-line Policy Improvement using Monte-Carlo Search

On-line Policy Improvement using Monte-Carlo Search

9 January 2025
Gerald Tesauro
Gregory R. Galperin
ArXivPDFHTML

Papers citing "On-line Policy Improvement using Monte-Carlo Search"

2 / 52 papers shown
Title
Fast Reinforcement Learning with Large Action Sets using
  Error-Correcting Output Codes for MDP Factorization
Fast Reinforcement Learning with Large Action Sets using Error-Correcting Output Codes for MDP Factorization
Gabriel Dulac-Arnold
Ludovic Denoyer
Philippe Preux
Patrick Gallinari
OffRL
60
24
0
29 Feb 2012
Approximate Policy Iteration with a Policy Language Bias: Solving
  Relational Markov Decision Processes
Approximate Policy Iteration with a Policy Language Bias: Solving Relational Markov Decision Processes
Alan Fern
R. Givan
S. Yoon
61
63
0
09 Sep 2011
Previous
12