Adaptive Tool Use in Large Language Models with Meta-Cognition TriggerAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language ModelsInternational Conference on Learning Representations (ICLR), 2024 Peng Xia Siwei Han Shi Qiu Yiyang Zhou Zhaoyang Wang ...Chenhang Cui Mingyu Ding Linjie Li Lijuan Wang Huaxiu Yao |
MuRAR: A Simple and Effective Multimodal Retrieval and Answer Refinement Framework for Multimodal Question AnsweringInternational Conference on Computational Linguistics (COLING), 2024 |