Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.23905
Cited By
Boosting MLLM Reasoning with Text-Debiased Hint-GRPO
31 March 2025
Qihan Huang
Long Chan
Jinlong Liu
Wanggui He
Hao Jiang
Mingli Song
Jingyuan Chen
Chang Yao
Jie Song
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Boosting MLLM Reasoning with Text-Debiased Hint-GRPO"
9 / 9 papers shown
Title
Unveiling the Compositional Ability Gap in Vision-Language Reasoning Model
Tianle Li
Jihai Zhang
Yongming Rao
Yu Cheng
CoGe
LRM
VLM
65
0
0
26 May 2025
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-AI
Daya Guo
Dejian Yang
Haowei Zhang
Junxiao Song
...
Shiyu Wang
S. Yu
Shunfeng Zhou
Shuting Pan
S.S. Li
ReLM
VLM
OffRL
AI4TS
LRM
303
1,503
0
22 Jan 2025
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
Omkar Thawakar
Dinura Dissanayake
Ketan More
Ritesh Thawkar
Ahmed Heakl
...
Hisham Cholakkal
Ivan Laptev
Mubarak Shah
Fahad Shahbaz Khan
Salman Khan
VLM
LRM
93
46
0
10 Jan 2025
Diving into Self-Evolving Training for Multimodal Reasoning
Wei Liu
Junlong Li
Xiwen Zhang
Fan Zhou
Yu Cheng
Junxian He
LRM
ReLM
111
15
0
23 Dec 2024
G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model
Jiahui Gao
Renjie Pi
Jipeng Zhang
Jiacheng Ye
Wanjun Zhong
...
Lanqing Hong
Jianhua Han
Hang Xu
Zhenguo Li
Lingpeng Kong
SyDa
ReLM
LRM
81
105
0
18 Dec 2023
Classifier-Free Diffusion Guidance
Jonathan Ho
Tim Salimans
FaML
131
3,830
0
26 Jul 2022
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
386
17,550
0
19 Jun 2020
AI2D-RST: A multimodal corpus of 1000 primary school science diagrams
Tuomo Hiippala
Malihe Alikhani
Jonas Haverinen
Timo Kalliokoski
E. Logacheva
Serafina Orekhova
Aino Tuomainen
Matthew Stone
J. Bateman
46
50
0
09 Dec 2019
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
285
18,685
0
20 Jul 2017
1