Title |
---|
![]() MouSi: Poly-Visual-Expert Vision-Language Models Xiaoran Fan Tao Ji Changhao Jiang Shuo Li Senjie Jin ...Qi Zhang Xipeng Qiu Xuanjing Huang Zuxuan Wu Yunchun Jiang |
![]() Merlin:Empowering Multimodal LLMs with Foresight Minds En Yu Liang Zhao Yana Wei Jinrong Yang Dongming Wu ...Haoran Wei Tiancai Wang Zheng Ge Xiangyu Zhang Wenbing Tao |