Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2311.17963
Cited By
M
2
^{2}
2
Chat: Empowering VLM for Multimodal LLM Interleaved Text-Image Generation
29 November 2023
Xiaowei Chi
Rongyu Zhang
Zhengkai Jiang
Yijiang Liu
Ziyi Lin
Renrui Zhang
Chaoyou Fu
Peng Gao
Shanghang Zhang
Qi-fei Liu
Yi-Ting Guo
MLLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"M$^{2}$Chat: Empowering VLM for Multimodal LLM Interleaved Text-Image Generation"
2 / 2 papers shown
Title
Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Yanwei Li
Yuechen Zhang
Chengyao Wang
Zhisheng Zhong
Yixin Chen
Ruihang Chu
Shaoteng Liu
Jiaya Jia
VLM
MLLM
MoE
29
210
0
27 Mar 2024
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
244
4,186
0
30 Jan 2023
1