Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2510.14967
Cited By
Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents
16 October 2025
Guoqing Wang
Sunhao Dai
Guangze Ye
Zeyu Gan
Wei Yao
Yong Deng
Xiaofeng Wu
ZhenZhe Ying
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (32 upvotes)
Github
Papers citing
"Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents"
1 / 1 papers shown
Title
The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
Long Li
Jiaran Hao
Jason Klein Liu
Zhijian Zhou
Yanting Miao
...
Wei Chu
Zhe Wang
Shirui Pan
Chao Qu
Yuan Qi
147
5
0
09 Sep 2025
1