Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2505.20241
Cited By

DreamPRM: Domain-Reweighted Process Reward Model for Multimodal Reasoning

v1v2v3 (latest)

DreamPRM: Domain-Reweighted Process Reward Model for Multimodal Reasoning

26 May 2025

Sai Ashish Somayajula

ArXiv (abs)PDF HTML Github (15★)

Papers citing "DreamPRM: Domain-Reweighted Process Reward Model for Multimodal Reasoning"

8 / 8 papers shown

A Survey of Process Reward Models: From Outcome Signals to Process Supervisions for Large Language Models

A Survey of Process Reward Models: From Outcome Signals to Process Supervisions for Large Language Models

...

214

1

0

09 Oct 2025

DreamPRM-1.5: Unlocking the Potential of Each Instance for Multimodal Process Reward Model Training

DreamPRM-1.5: Unlocking the Potential of Each Instance for Multimodal Process Reward Model Training

159

0

0

05 Sep 2025

GM-PRM: A Generative Multimodal Process Reward Model for Multimodal Mathematical Reasoning

GM-PRM: A Generative Multimodal Process Reward Model for Multimodal Mathematical Reasoning

Jianghangfan Zhang

282

4

0

06 Aug 2025

MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency

MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency

...

455

89

0

13 Feb 2025

Dynamic Loss-Based Sample Reweighting for Improved Large Language Model Pretraining

Dynamic Loss-Based Sample Reweighting for Improved Large Language Model PretrainingInternational Conference on Learning Representations (ICLR), 2025

Herbert Woisetschläger

Saikiran Bulusu

Hans-Arno Jacobsen

376

13

0

10 Feb 2025

Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization

Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization

...

536

186

1

15 Nov 2024

Self-Correction is More than Refinement: A Learning Framework for Visual and Language Reasoning Tasks

Self-Correction is More than Refinement: A Learning Framework for Visual and Language Reasoning TasksAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

594

27

0

05 Oct 2024

Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance

Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance

Jiasheng Ye

Tianxiang Sun

Xipeng Qiu

394

111

0

25 Mar 2024

Page 1 of 1