Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2510.17923
Cited By
v1
v2
v3
v4 (latest)
Rewarding the Journey, Not Just the Destination: A Composite Path and Answer Self-Scoring Reward Mechanism for Test-Time Reinforcement Learning
20 October 2025
Chenwei Tang
Jingyu Xing
Xinyu Liu
Wei Ju
Jiancheng Lv
Fan Zhang
Deng Xiong
Ziyue Qiao
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Rewarding the Journey, Not Just the Destination: A Composite Path and Answer Self-Scoring Reward Mechanism for Test-Time Reinforcement Learning"
0 / 0 papers shown
No papers found
Page 1 of 0