Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2509.16197
Cited By
MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer
19 September 2025
Yanghao Li
Rui Qian
Bowen Pan
Haotian Zhang
Haoshuo Huang
Bowen Zhang
Jialing Tong
Haoxuan You
Xianzhi Du
Zhe Gan
H. Kim
Chao Jia
Zhenbang Wang
Yinfei Yang
Mingfei Gao
Zi-Yi Dou
Wenze Hu
Chang Gao
Dongxu Li
Philipp Dufter
Zirui Wang
Guoli Yin
Zhengdong Zhang
Chen Chen
Yang Zhao
Ruoming Pang
Zhifeng Chen
MLLM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (48 upvotes)
Github (1236★)
Papers citing
"MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer"
1 / 1 papers shown
Title
BLIP3o-NEXT: Next Frontier of Native Image Generation
Jiuhai Chen
Le Xue
Zhiyang Xu
Xichen Pan
Shusheng Yang
...
Tianyi Zhou
Junnan Li
Silvio Savarese
Caiming Xiong
Ran Xu
52
1
0
17 Oct 2025
1