A Short Note on Soft-max and Policy Gradients in Bandits Problems

20 July 2020

Papers citing "A Short Note on Soft-max and Policy Gradients in Bandits Problems"

1 / 1 papers shown

Title
On the Global Convergence Rates of Softmax Policy Gradient Methods Jincheng Mei Chenjun Xiao Csaba Szepesvári Dale Schuurmans 30 275 0 13 May 2020