ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2510.14967
  4. Cited By
Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents

Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents

16 October 2025
Guoqing Wang
Sunhao Dai
Guangze Ye
Zeyu Gan
Wei Yao
Yong Deng
Xiaofeng Wu
ZhenZhe Ying
    OffRL
ArXiv (abs)PDFHTMLHuggingFace (32 upvotes)Github

Papers citing "Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents"

1 / 1 papers shown
Title
The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
Long Li
Jiaran Hao
Jason Klein Liu
Zhijian Zhou
Yanting Miao
...
Wei Chu
Zhe Wang
Shirui Pan
Chao Qu
Yuan Qi
147
5
0
09 Sep 2025
1