Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2406.15768
Cited By
MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception
22 June 2024
Guanqun Wang
Xinyu Wei
Jiaming Liu
Ray Zhang
Yichi Zhang
Kevin Zhang
Maurice Chong
Shanghang Zhang
VLM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Github
Papers citing
"MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception"
2 / 2 papers shown
From Perception to Reasoning: Deep Thinking Empowers Multimodal Large Language Models
Wenxin Zhu
Andong Chen
Yuchen Song
Kehai Chen
Conghui Zhu
Ziyan Chen
Tiejun Zhao
LRM
533
1
0
17 Nov 2025
MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection
IEEE International Conference on Computer Vision (ICCV), 2022
Renrui Zhang
Han Qiu
Tai Wang
Ziyu Guo
Xuan Xu
Xuanzhuo Xu
Ziteng Cui
Shiyang Feng
Jiaming Song
Hongsheng Li
ViT
MDE
667
168
0
24 Mar 2022
1
Page 1 of 1