ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2509.09265
  4. Cited By
Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents

Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents

11 September 2025
Jiawei Wang
Jiacai Liu
Y. Fu
Y. Li
Xintao Wang
Yuan Lin
Yu Yue
L. Zhang
Y. X. R. Wang
Ke Wang
ArXiv (abs)PDFHTMLHuggingFace (41 upvotes)

Papers citing "Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents"

7 / 7 papers shown
Revisiting Entropy in Reinforcement Learning for Large Reasoning Models
Revisiting Entropy in Reinforcement Learning for Large Reasoning Models
Renren Jin
Pengzhi Gao
Yuqi Ren
Zhuowen Han
Tongxuan Zhang
Wuwei Huang
Wei Liu
Jian Luan
Deyi Xiong
LRM
126
1
0
08 Nov 2025
SALT: Step-level Advantage Assignment for Long-horizon Agents via Trajectory Graph
SALT: Step-level Advantage Assignment for Long-horizon Agents via Trajectory Graph
Jiazheng Li
Y. X. R. Wang
David Yan
Yijun Tian
Zhichao Xu
Huan Song
Panpan Xu
Lin Lee Cheong
127
0
0
22 Oct 2025
Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization
Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization
Yang Li
Z. Dong
Yuhan Sun
Weixun Wang
Shaopan Xiong
...
Han Lu
Jiamang Wang
Wenbo Su
Bo Zheng
Junchi Yan
LRM
113
4
0
15 Oct 2025
Entropy Meets Importance: A Unified Head Importance-Entropy Score for Stable and Efficient Transformer Pruning
Entropy Meets Importance: A Unified Head Importance-Entropy Score for Stable and Efficient Transformer Pruning
Minsik Choi
Hyegang Son
Changhoon Kim
Young Geun Kim
AAML
117
0
0
10 Oct 2025
RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning
RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning
Sicheng Feng
Kaiwen Tuo
Song Wang
Lingdong Kong
Jianke Zhu
Huan Wang
LRM
203
2
0
02 Oct 2025
Gradient Coupling: The Hidden Barrier to Generalization in Agentic Reinforcement Learning
Gradient Coupling: The Hidden Barrier to Generalization in Agentic Reinforcement Learning
Jingyu Liu
xiaopeng Wu
Jingquan Peng
Kehan Chen
Chuan Yu
Lizhong Ding
Yong Liu
173
0
0
28 Sep 2025
Quantile Advantage Estimation for Entropy-Safe Reasoning
Quantile Advantage Estimation for Entropy-Safe Reasoning
Junkang Wu
Kexin Huang
Jiancan Wu
An Zhang
Xiang Wang
Xiangnan He
129
4
0
26 Sep 2025
1