ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.09736
  4. Cited By
Revisiting Visual Understanding in Multimodal Reasoning through a Lens of Image Perturbation
v1v2 (latest)

Revisiting Visual Understanding in Multimodal Reasoning through a Lens of Image Perturbation

11 June 2025
Yuting Li
Lai Wei
Kaipeng Zheng
Jingyuan Huang
Linghe Kong
Shunian Chen
Weiran Huang
Lichao Sun
Weiran Huang
    AAMLLRMVLM
ArXiv (abs)PDFHTMLHuggingFace (10 upvotes)Github (57★)

Papers citing "Revisiting Visual Understanding in Multimodal Reasoning through a Lens of Image Perturbation"

4 / 4 papers shown
Perception-Consistency Multimodal Large Language Models Reasoning via Caption-Regularized Policy Optimization
Perception-Consistency Multimodal Large Language Models Reasoning via Caption-Regularized Policy Optimization
Songjun Tu
Qichao Zhang
Jingbo Sun
Y. Fu
Linjing Li
X. Lan
Dongmei Jiang
Yaowei Wang
Dongbin Zhao
OffRLLRM
129
1
0
26 Sep 2025
MAPO: Mixed Advantage Policy Optimization
MAPO: Mixed Advantage Policy Optimization
Wenke Huang
Quan Zhang
Yiyang Fang
Jian Liang
Xuankun Rong
...
Mingjun Li
Leszek Rutkowski
Mang Ye
Bo Du
Dacheng Tao
235
4
0
23 Sep 2025
Empowering Multimodal LLMs with External Tools: A Comprehensive Survey
Empowering Multimodal LLMs with External Tools: A Comprehensive Survey
Wenbin An
Jiahao Nie
Yaqiang Wu
Feng Tian
Shijian Lu
Q. Zheng
MLLM
182
1
0
14 Aug 2025
Perception-Aware Policy Optimization for Multimodal Reasoning
Perception-Aware Policy Optimization for Multimodal Reasoning
Zhenhailong Wang
Xuehang Guo
Sofia Stoica
Haiyang Xu
Hongru Wang
...
Xiusi Chen
Yangyi Chen
Ming Yan
Fei Huang
Mengyue Yang
OffRLLRM
419
22
0
08 Jul 2025
1