Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.18193
Cited By
MammothModa: Multi-Modal Large Language Model
26 June 2024
Qi She
Junwen Pan
Xin Wan
Rui Zhang
Dawei Lu
Kai Huang
MLLM
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MammothModa: Multi-Modal Large Language Model"
2 / 2 papers shown
Title
UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model
Jiabo Ye
Anwen Hu
Haiyang Xu
Qinghao Ye
Mingshi Yan
...
Ji Zhang
Qin Jin
Liang He
Xin Lin
Feiyan Huang
VLM
MLLM
121
83
0
08 Oct 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
244
4,186
0
30 Jan 2023
1