Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2507.23317
Cited By
Good Learners Think Their Thinking: Generative PRM Makes Large Reasoning Model More Efficient Math Learner
31 July 2025
Tao He
Rongchuan Mu
Lizi Liao
Yixin Cao
Ming Liu
Bing Qin
OffRL
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3★)
Papers citing
"Good Learners Think Their Thinking: Generative PRM Makes Large Reasoning Model More Efficient Math Learner"
5 / 5 papers shown
A Survey of Process Reward Models: From Outcome Signals to Process Supervisions for Large Language Models
Congming Zheng
Jiachen Zhu
Zhuoying Ou
Yuxiang Chen
Kangning Zhang
...
Zeyu Zheng
Mengyue Yang
Jianghao Lin
Yong Yu
Weinan Zhang
LRM
214
1
0
09 Oct 2025
Pushing on Multilingual Reasoning Models with Language-Mixed Chain-of-Thought
Guijin Son
Donghun Yang
Hitesh Laxmichand Patel
Amit Agarwal
Hyunwoo Ko
...
Minhyuk Kim
Nikunj Drolia
Dasol Choi
Kyong-Ha Lee
Youngjae Yu
LRM
161
1
0
05 Oct 2025
Attention as a Compass: Efficient Exploration for Process-Supervised RL in Reasoning Models
Runze Liu
Jiakang Wang
Yuling Shi
Zhihui Xie
Chenxin An
...
Wenping Hu
Xiu Li
Fuzheng Zhang
Guorui Zhou
Kun Gai
OffRL
LRM
153
3
0
30 Sep 2025
Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning
Yoonjeon Kim
Doohyuk Jang
Eunho Yang
ReLM
AIFin
LRM
206
1
0
26 Sep 2025
StepWiser: Stepwise Generative Judges for Wiser Reasoning
Wei Xiong
Wenting Zhao
Weizhe Yuan
O. Yu. Golovneva
Tong Zhang
Jason Weston
Sainbayar Sukhbaatar
LRM
147
13
0
26 Aug 2025
1
Page 1 of 1