Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2509.02737
Cited By
Imitate Optimal Policy: Prevail and Induce Action Collapse in Policy Gradient
2 September 2025
Zhongzhu Zhou
Yibo Yang
Ziyan Chen
Fengxiang Bie
Haojun Xia
Xiaoxia Wu
Robert Wu
Ben Athiwaratkun
Bernard Ghanem
Shuaiwen Leon Song
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Imitate Optimal Policy: Prevail and Induce Action Collapse in Policy Gradient"
0 / 0 papers shown
Title
No papers found