ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2509.26628
  4. Cited By
Attention as a Compass: Efficient Exploration for Process-Supervised RL in Reasoning Models

Attention as a Compass: Efficient Exploration for Process-Supervised RL in Reasoning Models

30 September 2025
Runze Liu
Jiakang Wang
Yuling Shi
Zhihui Xie
Chenxin An
Kaiyan Zhang
Jian Zhao
Xiaodong Gu
Lei Lin
Wenping Hu
Xiu Li
Fuzheng Zhang
Guorui Zhou
Kun Gai
    OffRLLRM
ArXiv (abs)PDFHTMLHuggingFace (11 upvotes)Github (952★)

Papers citing "Attention as a Compass: Efficient Exploration for Process-Supervised RL in Reasoning Models"

1 / 1 papers shown
Title
MixReasoning: Switching Modes to Think
MixReasoning: Switching Modes to Think
Haiquan Lu
Gongfan Fang
Xinyin Ma
Qi Li
Xinchao Wang
LRM
0
0
0
07 Oct 2025
1