ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.20241
  4. Cited By
DreamPRM: Domain-Reweighted Process Reward Model for Multimodal Reasoning
v1v2v3 (latest)

DreamPRM: Domain-Reweighted Process Reward Model for Multimodal Reasoning

26 May 2025
Qi Cao
Ruiyi Wang
Ruiyi Zhang
Sai Ashish Somayajula
P. Xie
    LRM
ArXiv (abs)PDFHTMLGithub (15★)

Papers citing "DreamPRM: Domain-Reweighted Process Reward Model for Multimodal Reasoning"

8 / 8 papers shown
A Survey of Process Reward Models: From Outcome Signals to Process Supervisions for Large Language Models
A Survey of Process Reward Models: From Outcome Signals to Process Supervisions for Large Language Models
Congming Zheng
Jiachen Zhu
Zhuoying Ou
Yuxiang Chen
Kangning Zhang
...
Zeyu Zheng
Mengyue Yang
Jianghao Lin
Yong Yu
Weinan Zhang
LRM
214
1
0
09 Oct 2025
DreamPRM-1.5: Unlocking the Potential of Each Instance for Multimodal Process Reward Model Training
DreamPRM-1.5: Unlocking the Potential of Each Instance for Multimodal Process Reward Model Training
Qi Cao
P. Xie
OffRL
159
0
0
05 Sep 2025
GM-PRM: A Generative Multimodal Process Reward Model for Multimodal Mathematical Reasoning
GM-PRM: A Generative Multimodal Process Reward Model for Multimodal Mathematical Reasoning
Jianghangfan Zhang
Yibo Yan
Kening Zheng
Xin Zou
Song Dai
Xuming Hu
LRM
282
4
0
06 Aug 2025
MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency
MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency
Dongzhi Jiang
Renrui Zhang
Ziyu Guo
Yanwei Li
Yu Qi
...
Shen Yan
Bo Zhang
Chaoyou Fu
Peng Gao
Jiaming Song
MLLMLRM
455
89
0
13 Feb 2025
Dynamic Loss-Based Sample Reweighting for Improved Large Language Model Pretraining
Dynamic Loss-Based Sample Reweighting for Improved Large Language Model PretrainingInternational Conference on Learning Representations (ICLR), 2025
Daouda Sow
Herbert Woisetschläger
Saikiran Bulusu
Shiqiang Wang
Hans-Arno Jacobsen
Yingbin Liang
376
13
0
10 Feb 2025
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization
Weiyun Wang
Zhe Chen
Wenhai Wang
Yue Cao
Yangzhou Liu
...
Jinguo Zhu
X. Zhu
Lewei Lu
Yu Qiao
Jifeng Dai
LRM
536
186
1
15 Nov 2024
Self-Correction is More than Refinement: A Learning Framework for Visual and Language Reasoning Tasks
Self-Correction is More than Refinement: A Learning Framework for Visual and Language Reasoning TasksAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Jiayi He
Hehai Lin
Q. Wang
Yi R. Fung
Chenhui Xu
ReLMLRM
594
27
0
05 Oct 2024
Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance
Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance
Jiasheng Ye
Peiju Liu
Tianxiang Sun
Yunhua Zhou
Jun Zhan
Xipeng Qiu
394
111
0
25 Mar 2024
1
Page 1 of 1