Classification-based Approximate Policy Iteration: Experiments and Extended Discussions

2 July 2014

Papers citing "Classification-based Approximate Policy Iteration: Experiments and Extended Discussions"

2 / 2 papers shown

Title
On-line Policy Improvement using Monte-Carlo Search Gerald Tesauro Gregory R. Galperin 92 270 0 09 Jan 2025
Neural PPO-Clip Attains Global Optimality: A Hinge Loss Perspective Nai-Chieh Huang Ping-Chun Hsieh Kuo-Hao Ho Hsuan-Yu Yao Kai-Chun Hu Liang-Chun Ouyang I-Chen Wu 30 1 0 26 Oct 2021