ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2509.16197
  4. Cited By
MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

19 September 2025
Yanghao Li
Rui Qian
Bowen Pan
Haotian Zhang
Haoshuo Huang
Bowen Zhang
Jialing Tong
Haoxuan You
Xianzhi Du
Zhe Gan
H. Kim
Chao Jia
Zhenbang Wang
Yinfei Yang
Mingfei Gao
Zi-Yi Dou
Wenze Hu
Chang Gao
Dongxu Li
Philipp Dufter
Zirui Wang
Guoli Yin
Zhengdong Zhang
Chen Chen
Yang Zhao
Ruoming Pang
Zhifeng Chen
    MLLM
ArXiv (abs)PDFHTMLHuggingFace (48 upvotes)Github (1236★)

Papers citing "MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer"

1 / 1 papers shown
Title
BLIP3o-NEXT: Next Frontier of Native Image Generation
BLIP3o-NEXT: Next Frontier of Native Image Generation
Jiuhai Chen
Le Xue
Zhiyang Xu
Xichen Pan
Shusheng Yang
...
Tianyi Zhou
Junnan Li
Silvio Savarese
Caiming Xiong
Ran Xu
52
1
0
17 Oct 2025
1