ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1710.10866
  4. Cited By
Unifying Value Iteration, Advantage Learning, and Dynamic Policy
  Programming

Unifying Value Iteration, Advantage Learning, and Dynamic Policy Programming

30 October 2017
Tadashi Kozuno
E. Uchibe
Kenji Doya
ArXivPDFHTML

Papers citing "Unifying Value Iteration, Advantage Learning, and Dynamic Policy Programming"

1 / 1 papers shown
Title
An Alternative Softmax Operator for Reinforcement Learning
An Alternative Softmax Operator for Reinforcement Learning
Kavosh Asadi
Michael L. Littman
20
10
0
16 Dec 2016
1