ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.23905
  4. Cited By
Boosting MLLM Reasoning with Text-Debiased Hint-GRPO

Boosting MLLM Reasoning with Text-Debiased Hint-GRPO

31 March 2025
Qihan Huang
Long Chan
Jinlong Liu
Wanggui He
Hao Jiang
Mingli Song
Jingyuan Chen
Chang Yao
Jie Song
    LRM
ArXivPDFHTML

Papers citing "Boosting MLLM Reasoning with Text-Debiased Hint-GRPO"

9 / 9 papers shown
Title
Unveiling the Compositional Ability Gap in Vision-Language Reasoning Model
Unveiling the Compositional Ability Gap in Vision-Language Reasoning Model
Tianle Li
Jihai Zhang
Yongming Rao
Yu Cheng
CoGe
LRM
VLM
65
0
0
26 May 2025
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-AI
Daya Guo
Dejian Yang
Haowei Zhang
Junxiao Song
...
Shiyu Wang
S. Yu
Shunfeng Zhou
Shuting Pan
S.S. Li
ReLM
VLM
OffRL
AI4TS
LRM
303
1,503
0
22 Jan 2025
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
Omkar Thawakar
Dinura Dissanayake
Ketan More
Ritesh Thawkar
Ahmed Heakl
...
Hisham Cholakkal
Ivan Laptev
Mubarak Shah
Fahad Shahbaz Khan
Salman Khan
VLM
LRM
93
46
0
10 Jan 2025
Diving into Self-Evolving Training for Multimodal Reasoning
Diving into Self-Evolving Training for Multimodal Reasoning
Wei Liu
Junlong Li
Xiwen Zhang
Fan Zhou
Yu Cheng
Junxian He
LRM
ReLM
111
15
0
23 Dec 2024
G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model
G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model
Jiahui Gao
Renjie Pi
Jipeng Zhang
Jiacheng Ye
Wanjun Zhong
...
Lanqing Hong
Jianhua Han
Hang Xu
Zhenguo Li
Lingpeng Kong
SyDa
ReLM
LRM
81
105
0
18 Dec 2023
Classifier-Free Diffusion Guidance
Classifier-Free Diffusion Guidance
Jonathan Ho
Tim Salimans
FaML
131
3,830
0
26 Jul 2022
Denoising Diffusion Probabilistic Models
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
386
17,550
0
19 Jun 2020
AI2D-RST: A multimodal corpus of 1000 primary school science diagrams
AI2D-RST: A multimodal corpus of 1000 primary school science diagrams
Tuomo Hiippala
Malihe Alikhani
Jonas Haverinen
Timo Kalliokoski
E. Logacheva
Serafina Orekhova
Aino Tuomainen
Matthew Stone
J. Bateman
46
50
0
09 Dec 2019
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
285
18,685
0
20 Jul 2017
1