Title |
---|
![]() Multimodal Structured Generation: CVPR's 2nd MMFM Challenge Technical Report Franz Louis Cesista |
![]() MuirBench: A Comprehensive Benchmark for Robust Multi-image
Understanding Fei Wang Xingyu Fu James Y. Huang Zekun Li Qin Liu ...Kai-Wei Chang Dan Roth Sheng Zhang Hoifung Poon Muhao Chen |
![]() Rethinking Overlooked Aspects in Vision-Language Models Yuan Liu Le Tian Xiao Zhou Jie Zhou |