Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2007.10297
Cited By
A Short Note on Soft-max and Policy Gradients in Bandits Problems
20 July 2020
N. Walton
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Short Note on Soft-max and Policy Gradients in Bandits Problems"
1 / 1 papers shown
Title
On the Global Convergence Rates of Softmax Policy Gradient Methods
Jincheng Mei
Chenjun Xiao
Csaba Szepesvári
Dale Schuurmans
30
275
0
13 May 2020
1