ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2007.10297
  4. Cited By
A Short Note on Soft-max and Policy Gradients in Bandits Problems

A Short Note on Soft-max and Policy Gradients in Bandits Problems

20 July 2020
N. Walton
ArXivPDFHTML

Papers citing "A Short Note on Soft-max and Policy Gradients in Bandits Problems"

1 / 1 papers shown
Title
On the Global Convergence Rates of Softmax Policy Gradient Methods
On the Global Convergence Rates of Softmax Policy Gradient Methods
Jincheng Mei
Chenjun Xiao
Csaba Szepesvári
Dale Schuurmans
30
275
0
13 May 2020
1