Title |
---|
![]() Artwork Explanation in Large-scale Vision Language Models Kazuki Hayashi Yusuke Sakai Hidetaka Kamigaito Katsuhiko Hayashi Taro Watanabe |
![]() Towards Open-ended Visual Quality Comparison Haoning Wu Hanwei Zhu Zicheng Zhang Erli Zhang Chaofeng Chen ...Qiong Yan Xiaohong Liu Guangtao Zhai Shiqi Wang Weisi Lin |
![]() Mementos: A Comprehensive Benchmark for Multimodal Large Language Model
Reasoning over Image Sequences Xiyao Wang Yuhang Zhou Xiaoyu Liu Hongjin Lu Yuancheng Xu ...Taixi Lu Gedas Bertasius Mohit Bansal Huaxiu Yao Furong Huang |
![]() G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model Jiahui Gao Renjie Pi Jipeng Zhang Jiacheng Ye Wanjun Zhong ...Lanqing Hong Jianhua Han Hang Xu Zhenguo Li Lingpeng Kong |
![]() Does Pre-trained Language Model Actually Infer Unseen Links in Knowledge
Graph Completion? Yusuke Sakai Hidetaka Kamigaito Katsuhiko Hayashi Taro Watanabe |