MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning Haotian Zhang Mingfei Gao Zhe Gan Philipp Dufter Nina Wenzel ...Haoxuan You Zirui Wang Afshin Dehghan Peter Grasch Yinfei Yang |
A Survey on Evaluation of Multimodal Large Language Models Jiaxing Huang Jingyi Zhang |
mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal
Large Language ModelsInternational Conference on Learning Representations (ICLR), 2024 |
Enhancing Descriptive Image Quality Assessment with A Large-scale Multi-modal DatasetIEEE Transactions on Image Processing (TIP), 2024 |
Depicting Beyond Scores: Advancing Image Quality Assessment through
Multi-modal Language ModelsEuropean Conference on Computer Vision (ECCV), 2023 |