Title |
---|
![]() Merlin:Empowering Multimodal LLMs with Foresight Minds En Yu Liang Zhao Yana Wei Jinrong Yang Dongming Wu ...Haoran Wei Tiancai Wang Zheng Ge Xiangyu Zhang Wenbing Tao |
![]() GLaMM: Pixel Grounding Large Multimodal Model H. Rasheed Muhammad Maaz Sahal Shaji Mullappilly Abdelrahman M. Shaker Salman Khan Hisham Cholakkal Rao M. Anwer Erix Xing Ming-Hsuan Yang Fahad S. Khan |
![]() DreamLLM: Synergistic Multimodal Comprehension and Creation Runpei Dong Chunrui Han Yuang Peng Zekun Qi Zheng Ge ...Hao-Ran Wei Xiangwen Kong Xiangyu Zhang Kaisheng Ma Li Yi |