
Title |
|---|
![]() OMGM: Orchestrate Multiple Granularities and Modalities for Efficient Multimodal RetrievalAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() Fine-tuning Vision Language Models with Graph-based Knowledge for Explainable Medical Image AnalysisInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025 |
![]() MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language ModelsInternational Conference on Learning Representations (ICLR), 2024 Peng Xia Siwei Han Shi Qiu Yiyang Zhou Zhaoyang Wang ...Chenhang Cui Mingyu Ding Linjie Li Lijuan Wang Huaxiu Yao |
![]() CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision
Language ModelsNeural Information Processing Systems (NeurIPS), 2024 |
![]() Calibrated Self-Rewarding Vision Language ModelsNeural Information Processing Systems (NeurIPS), 2024 |