Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2502.03854
Cited By
v1
v2 (latest)
Mirror Descent Actor Critic via Bounded Advantage Learning
6 February 2025
Ryo Iwaki
Re-assign community
ArXiv (abs)
PDF
HTML
Github (93922★)
Papers citing
"Mirror Descent Actor Critic via Bounded Advantage Learning"
1 / 1 papers shown
Title
Divergence-Augmented Policy Optimization
Neural Information Processing Systems (NeurIPS), 2025
Qing Wang
Yingru Li
Jiechao Xiong
Tong Zhang
OffRL
249
17
0
28 Jan 2025
1