Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2505.23179
Cited By
v1
v2 (latest)
DIP-R1: Deep Inspection and Perception with RL Looking Through and Understanding Complex Scenes
29 May 2025
Sungjune Park
Hyunjun Kim
J. Kim
S. T. Kim
Y. Ro
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Papers citing
"DIP-R1: Deep Inspection and Perception with RL Looking Through and Understanding Complex Scenes"
13 / 13 papers shown
Reinforcement Learning for Large Model: A Survey
Weijia Wu
Chen Gao
Joya Chen
Kevin Lin
Qingwei Meng
Yiming Zhang
Yuke Qiu
Hong Zhou
Mike Zheng Shou
316
2
0
24 Dec 2025
Learning What to Attend First: Modality-Importance-Guided Reasoning for Reliable Multimodal Emotion Understanding
Hyeongseop Rha
Jeong Hun Yeo
Junil Won
Se Jin Park
Yong Man Ro
LRM
92
0
0
02 Dec 2025
Emotion-Coherent Reasoning for Multimodal LLMs via Emotional Rationale Verifier
Hyeongseop Rha
Jeong Hun Yeo
Yeonju Kim
Y. Ro
LRM
261
1
0
27 Oct 2025
HieroAction: Hierarchically Guided VLM for Fine-Grained Action Analysis
Junhao Wu
Xiuer Gu
Zhiying Li
Yeying Jin
Yunfeng Diao
Zhiyu Li
Zhenbo Song
Xiaomei Zhang
Zhaoxin Fan
104
1
0
23 Aug 2025
3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding
Ting Huang
Zeyu Zhang
Hao Tang
LRM
126
12
0
31 Jul 2025
Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement
Yuqi Liu
Bohao Peng
Zhisheng Zhong
Zihao Yue
Fanbin Lu
Bei Yu
Jiaya Jia
LRM
VLM
385
46
0
01 Jul 2025
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models
Jinguo Zhu
Weiyun Wang
Zhe Chen
Ziwei Liu
Shenglong Ye
...
Dahua Lin
Yu Qiao
Jifeng Dai
Wenhai Wang
Wei Wang
MLLM
VLM
549
790
1
14 Apr 2025
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning
Nvidia
A. Azzolini
Junjie Bai
Prithvijit Chattopadhyay
Huayu Chen
...
Xiaodong Yang
Zhuolin Yang
Jing Zhang
Xiaohui Zeng
Zhe Zhang
AI4CE
LM&Ro
LRM
625
69
0
18 Mar 2025
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization
Jingyi Zhang
Jiaxing Huang
Huanjin Yao
Shunyu Liu
Xikun Zhang
Shijian Lu
Dacheng Tao
LRM
385
200
0
17 Mar 2025
Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models
Hao Wu
Bohan Jia
Zijie Zhai
Shaosheng Cao
Zheyu Ye
Fei Zhao
Zhe Xu
Yao Hu
Shaohui Lin
MU
OffRL
LRM
MLLM
ReLM
VLM
565
353
0
09 Mar 2025
MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Jiazhen Pan
Che Liu
Junde Wu
Fenglin Liu
Jiayuan Zhu
Hongwei Bran Li
Chen Chen
Cheng Ouyang
Daniel Rueckert
LRM
LM&MA
VLM
462
107
0
26 Feb 2025
Qwen2.5-VL Technical Report
S. Bai
Keqin Chen
Xuejing Liu
Jialin Wang
Wenbin Ge
...
Zesen Cheng
Hang Zhang
Zhibo Yang
Haiyang Xu
Junyang Lin
VLM
719
2,841
0
20 Feb 2025
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-AI
Daya Guo
Dejian Yang
Haowei Zhang
Junxiao Song
...
Shiyu Wang
S. Yu
Shunfeng Zhou
Shuting Pan
S.S. Li
OffRL
AI4TS
LRM
ReLM
VLM
1.2K
5,342
0
22 Jan 2025
1