Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.13607
Cited By
CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models
21 February 2024
Fuwen Luo
Chi Chen
Zihao Wan
Zhaolu Kang
Qidong Yan
Yingjie Li
Xiaolong Wang
Siyu Wang
Ziyue Wang
Xiaoyue Mi
Peng Li
Ning Ma
Maosong Sun
Yang Janet Liu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models"
8 / 8 papers shown
Title
CoSpace: Benchmarking Continuous Space Perception Ability for Vision-Language Models
Yiqi Zhu
Z. Wang
C. Zhang
Peng Li
Yang Liu
CoGe
VLM
63
0
0
18 Mar 2025
Large Language Model Benchmarks in Medical Tasks
Lawrence K. Q. Yan
Ming Li
Y. Zhang
Caitlyn Heqi Yin
Cheng Fei
...
Ziqian Bi
Pohsun Feng
Keyu Chen
Junyu Liu
Qian Niu
LM&MA
AI4MH
51
4
0
28 Oct 2024
ActiView: Evaluating Active Perception Ability for Multimodal Large Language Models
Ziyue Wang
Chi Chen
Fuwen Luo
Yurui Dong
Yuanchi Zhang
Yuzhuang Xu
Xiaolong Wang
Peng Li
Yang Liu
LRM
28
3
0
07 Oct 2024
Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations
Nick Jiang
Anish Kachinthaya
Suzie Petryk
Yossi Gandelsman
VLM
24
14
0
03 Oct 2024
A Survey on Multimodal Benchmarks: In the Era of Large AI Models
Lin Li
Guikun Chen
Hanrong Shi
Jun Xiao
Long Chen
34
8
0
21 Sep 2024
mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration
Qinghao Ye
Haiyang Xu
Jiabo Ye
Mingshi Yan
Anwen Hu
Haowei Liu
Qi Qian
Ji Zhang
Fei Huang
Jingren Zhou
MLLM
VLM
114
367
0
07 Nov 2023
Ambiguous Images With Human Judgments for Robust Visual Event Classification
Kate Sanders
Reno Kriz
Anqi Liu
Benjamin Van Durme
53
12
0
06 Oct 2022
Audio-Visual Floorplan Reconstruction
Senthil Purushwalkam
S. V. A. Garí
V. Ithapu
Carl Schissler
Philip Robinson
Abhinav Gupta
Kristen Grauman
VGen
3DV
51
41
0
31 Dec 2020
1