Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.06606
Cited By
Benchmarking Multimodal CoT Reward Model Stepwise by Visual Program
9 April 2025
Minghe Gao
Xuqi Liu
Zhongqi Yue
Y. Wu
Shuang Chen
Juncheng Billy Li
Siliang Tang
Fei Wu
Tat-Seng Chua
Yueting Zhuang
OffRL
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Benchmarking Multimodal CoT Reward Model Stepwise by Visual Program"
1 / 1 papers shown
Title
Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models
Xiaobao Wu
LRM
60
0
0
05 May 2025
1