Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2506.09736
Cited By
v1
v2 (latest)
Revisiting Visual Understanding in Multimodal Reasoning through a Lens of Image Perturbation
11 June 2025
Yuting Li
Lai Wei
Kaipeng Zheng
Jingyuan Huang
Linghe Kong
Shunian Chen
Weiran Huang
Lichao Sun
Weiran Huang
AAML
LRM
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (10 upvotes)
Github (57★)
Papers citing
"Revisiting Visual Understanding in Multimodal Reasoning through a Lens of Image Perturbation"
4 / 4 papers shown
Perception-Consistency Multimodal Large Language Models Reasoning via Caption-Regularized Policy Optimization
Songjun Tu
Qichao Zhang
Jingbo Sun
Y. Fu
Linjing Li
X. Lan
Dongmei Jiang
Yaowei Wang
Dongbin Zhao
OffRL
LRM
129
1
0
26 Sep 2025
MAPO: Mixed Advantage Policy Optimization
Wenke Huang
Quan Zhang
Yiyang Fang
Jian Liang
Xuankun Rong
...
Mingjun Li
Leszek Rutkowski
Mang Ye
Bo Du
Dacheng Tao
235
4
0
23 Sep 2025
Empowering Multimodal LLMs with External Tools: A Comprehensive Survey
Wenbin An
Jiahao Nie
Yaqiang Wu
Feng Tian
Shijian Lu
Q. Zheng
MLLM
182
1
0
14 Aug 2025
Perception-Aware Policy Optimization for Multimodal Reasoning
Zhenhailong Wang
Xuehang Guo
Sofia Stoica
Haiyang Xu
Hongru Wang
...
Xiusi Chen
Yangyi Chen
Ming Yan
Fei Huang
Mengyue Yang
OffRL
LRM
419
22
0
08 Jul 2025
1