ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.14677
  4. Cited By
Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning
v1v2v3 (latest)

Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning

20 May 2025
Jiaer Xia
Yuhang Zang
Peng Gao
Shouqing Yang
Kaiyang Zhou
    OffRLReLMAI4TSVLMLRM
ArXiv (abs)PDFHTMLHuggingFace (15 upvotes)Github (40★)

Papers citing "Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning"

32 / 32 papers shown
Reinforcement Learning for Large Model: A Survey
Reinforcement Learning for Large Model: A Survey
Weijia Wu
Chen Gao
Joya Chen
Kevin Lin
Qingwei Meng
Yiming Zhang
Yuke Qiu
Hong Zhou
Mike Zheng Shou
323
2
0
24 Dec 2025
TempR1: Improving Temporal Understanding of MLLMs via Temporal-Aware Multi-Task Reinforcement Learning
TempR1: Improving Temporal Understanding of MLLMs via Temporal-Aware Multi-Task Reinforcement Learning
Tao Wu
Li Yang
Gen Zhan
Y. Zhang
Yiting Liao
Junlin Li
Deliang Fu
Li Zhang
Limin Wang
AI4TSVLMLRM
258
0
0
03 Dec 2025
Be My Eyes: Extending Large Language Models to New Modalities Through Multi-Agent Collaboration
Be My Eyes: Extending Large Language Models to New Modalities Through Multi-Agent Collaboration
James Y. Huang
Sheng Zhang
Qianchu Liu
Guanghui Qin
Tinghui Zhu
Tristan Naumann
Muhao Chen
Hoifung Poon
VLMLRM
154
0
0
24 Nov 2025
Syn-GRPO: Self-Evolving Data Synthesis for MLLM Perception Reasoning
Syn-GRPO: Self-Evolving Data Synthesis for MLLM Perception Reasoning
Qihan Huang
H. Zhang
Rong Wei
Yi Wang
Rui Tang
Mingli Song
Jie Song
143
0
0
24 Nov 2025
Perceptual-Evidence Anchored Reinforced Learning for Multimodal Reasoning
Perceptual-Evidence Anchored Reinforced Learning for Multimodal Reasoning
Chi Zhang
Haibo Qiu
Qiming Zhang
Y. Xu
Zhixiong Zeng
Siqi Yang
Peng Shi
Lin Ma
Jing Zhang
OffRLReLMLRM
252
0
0
23 Nov 2025
Learning to Think Fast and Slow for Visual Language Models
Chenyu Lin
Cheng Chi
Jinlin Wu
Sharon Li
Kaiyang Zhou
ReLMVLM
226
0
0
20 Nov 2025
VisPlay: Self-Evolving Vision-Language Models from Images
VisPlay: Self-Evolving Vision-Language Models from Images
Yicheng He
Chengsong Huang
Zongxia Li
Jiaxin Huang
Yonghui Yang
OffRLLRMVLM
403
7
0
19 Nov 2025
SpatialThinker: Reinforcing 3D Reasoning in Multimodal LLMs via Spatial Rewards
SpatialThinker: Reinforcing 3D Reasoning in Multimodal LLMs via Spatial Rewards
Hunar Batra
Haoqin Tu
Hardy Chen
Yuanze Lin
Cihang Xie
Ronald Clark
OffRLReLMLRM
376
0
0
10 Nov 2025
Visual Attention Reasoning via Hierarchical Search and Self-Verification
Visual Attention Reasoning via Hierarchical Search and Self-Verification
Wei Cai
Jian Zhao
Yuchen Yuan
T. Zhang
Ming Zhu
Haichuan Tang
Chi Zhang
OffRLLRM
200
0
0
21 Oct 2025
Proactive Reasoning-with-Retrieval Framework for Medical Multimodal Large Language Models
Proactive Reasoning-with-Retrieval Framework for Medical Multimodal Large Language Models
Lehan Wang
Yi Qin
Honglong Yang
Xiaomeng Li
LRM
164
1
0
21 Oct 2025
A Survey on Agentic Multimodal Large Language Models
A Survey on Agentic Multimodal Large Language Models
Huanjin Yao
Ruifei Zhang
Jiaxing Huang
Jingyi Zhang
Yibo Wang
...
Ruolin Zhu
Yongcheng Jing
Shunyu Liu
Guanbin Li
Dacheng Tao
LM&RoAIFinAI4TSLRMAI4CE
250
6
0
13 Oct 2025
Spotlight on Token Perception for Multimodal Reinforcement Learning
Spotlight on Token Perception for Multimodal Reinforcement Learning
Siyuan Huang
Xiaoye Qu
Yafu Li
Yun Luo
Zefeng He
Daizong Liu
Yu Cheng
OffRLLRM
135
2
0
10 Oct 2025
More Thought, Less Accuracy? On the Dual Nature of Reasoning in Vision-Language Models
More Thought, Less Accuracy? On the Dual Nature of Reasoning in Vision-Language Models
Xinyu Tian
Shu Zou
Zhaoyuan Yang
Mengqi He
Fabian Waschkowski
Lukas Wesemann
Peter Tu
Jing Zhang
OffRLLRM
247
5
0
30 Sep 2025
Latent Visual Reasoning
Latent Visual Reasoning
Bangzheng Li
Ximeng Sun
Jiang-Long Liu
Ze Wang
Jialian Wu
Xiaodong Yu
Hao Chen
Emad Barsoum
Muhao Chen
Zicheng Liu
LRMVLM
203
5
0
29 Sep 2025
VTPerception-R1: Enhancing Multimodal Reasoning via Explicit Visual and Textual Perceptual Grounding
VTPerception-R1: Enhancing Multimodal Reasoning via Explicit Visual and Textual Perceptual Grounding
Yizhuo Ding
M. Ben-Chen
Zhibang Feng
Tong Xiao
Wanying Qu
Wenqi Shao
Yanwei Fu
LRMVLM
127
0
0
29 Sep 2025
Perception Before Reasoning: Two-Stage Reinforcement Learning for Visual Reasoning in Vision-Language Models
Perception Before Reasoning: Two-Stage Reinforcement Learning for Visual Reasoning in Vision-Language Models
Yan Chen
Long Li
Teng Xi
Long Zeng
Jingdong Wang
OffRLReLMLRMVLM
209
6
0
16 Sep 2025
Towards Secure and Explainable Smart Contract Generation with Security-Aware Group Relative Policy Optimization
Towards Secure and Explainable Smart Contract Generation with Security-Aware Group Relative Policy Optimization
Lei Yu
Jingyuan Zhang
Xin Wang
Jiajia Ma
Li Yang
Fengjun Zhang
ELMLRM
196
0
0
12 Sep 2025
Measuring Epistemic Humility in Multimodal Large Language Models
Measuring Epistemic Humility in Multimodal Large Language Models
Bingkui Tong
Jiaer Xia
Sifeng Shang
Kaiyang Zhou
HILM
143
2
0
11 Sep 2025
MMSearch-Plus: Benchmarking Provenance-Aware Search for Multimodal Browsing Agents
MMSearch-Plus: Benchmarking Provenance-Aware Search for Multimodal Browsing Agents
Xijia Tao
Yihua Teng
Xinxing Su
Xinyu Fu
Jihao Wu
Chaofan Tao
Ziru Liu
Haoli Bai
Rui Liu
Lingpeng Kong
VLMLRM
166
0
0
29 Aug 2025
Self-Rewarding Vision-Language Model via Reasoning Decomposition
Self-Rewarding Vision-Language Model via Reasoning Decomposition
Zongxia Li
Wenhao Yu
Chengsong Huang
Rui Liu
Zhenwen Liang
...
Jingxi Che
Dian Yu
Jordan L. Boyd-Graber
Haitao Mi
Dong Yu
ReLMVLMLRM
149
42
0
27 Aug 2025
Uni-cot: Towards Unified Chain-of-Thought Reasoning Across Text and Vision
Uni-cot: Towards Unified Chain-of-Thought Reasoning Across Text and Vision
Luozheng Qin
Jia Gong
Yuqing Sun
Tianjiao Li
Mengping Yang
Xiaomeng Yang
Chao Qu
Zhiyu Tan
Hao Li
MLLMLRM
228
0
0
07 Aug 2025
Don't Overthink It: A Survey of Efficient R1-style Large Reasoning Models
Don't Overthink It: A Survey of Efficient R1-style Large Reasoning Models
Linan Yue
Yichao Du
Yizhi Wang
W. Gao
Fangzhou Yao
...
Ye Liu
Ziyu Xu
Qi Liu
Shimin Di
Xiaoshi Zhong
LRM
216
16
0
04 Aug 2025
VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning
VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning
Ruifeng Yuan
Chenghao Xiao
Sicong Leng
Jianyu Wang
Long Li
...
Deli Zhao
Qifeng Bai
Zhongyu Wei
H. Zhang
Yu Rong
OffRLReLMLRM
262
13
0
30 Jul 2025
Perception-Aware Policy Optimization for Multimodal Reasoning
Perception-Aware Policy Optimization for Multimodal Reasoning
Zhenhailong Wang
Xuehang Guo
Sofia Stoica
Haiyang Xu
Hongru Wang
...
Xiusi Chen
Yangyi Chen
Ming Yan
Fei Huang
Mengyue Yang
OffRLLRM
423
22
0
08 Jul 2025
Seed1.5-VL Technical Report
Seed1.5-VL Technical Report
D. Guo
Faming Wu
Feida Zhu
Fuxing Leng
Guang Shi
...
Kai Hua
Kai Liu
Kai Shen
Jianchao Tan
Ke Shen
MLLMVLMLRM
230
172
0
11 May 2025
Video-R1: Reinforcing Video Reasoning in MLLMs
Video-R1: Reinforcing Video Reasoning in MLLMs
Kaituo Feng
Kaixiong Gong
Yangqiu Song
Zonghao Guo
Yibing Wang
Tianshuo Peng
Jian Wu
Xiaoying Zhang
Benyou Wang
Xiangyu Yue
AI4TSSyDaLRM
603
235
0
27 Mar 2025
MMCR: Advancing Visual Language Model in Multimodal Multi-Turn Contextual Reasoning
MMCR: Advancing Visual Language Model in Multimodal Multi-Turn Contextual Reasoning
Dawei Yan
Yangfu Li
Qing-Guo Chen
Weihua Luo
Peng Wang
Han Zhang
Chunhua Shen
VGenVLMLRM
223
5
0
24 Mar 2025
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization
Jingyi Zhang
Jiaxing Huang
Huanjin Yao
Shunyu Liu
Xikun Zhang
Shijian Lu
Dacheng Tao
LRM
425
209
0
17 Mar 2025
Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models
Hao Wu
Bohan Jia
Zijie Zhai
Shaosheng Cao
Zheyu Ye
Fei Zhao
Zhe Xu
Yao Hu
Shaohui Lin
MUOffRLLRMMLLMReLMVLM
600
361
0
09 Mar 2025
Qwen2.5-VL Technical Report
Qwen2.5-VL Technical Report
S. Bai
Keqin Chen
Xuejing Liu
Jialin Wang
Wenbin Ge
...
Zesen Cheng
Hang Zhang
Zhibo Yang
Haiyang Xu
Junyang Lin
VLM
725
2,990
0
20 Feb 2025
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Tianzhe Chu
Yuexiang Zhai
Jihan Yang
Shengbang Tong
Saining Xie
Dale Schuurmans
Quoc V. Le
Sergey Levine
Yi-An Ma
OffRL
693
419
0
28 Jan 2025
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-AI
Daya Guo
Dejian Yang
Haowei Zhang
Junxiao Song
...
Shiyu Wang
S. Yu
Shunfeng Zhou
Shuting Pan
S.S. Li
OffRLAI4TSLRMReLMVLM
1.3K
5,342
0
22 Jan 2025
1
Page 1 of 1