Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.11281
Cited By
Can Large Multimodal Models Uncover Deep Semantics Behind Images?
17 February 2024
Yixin Yang
Zheng Li
Qingxiu Dong
Heming Xia
Zhifang Sui
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Can Large Multimodal Models Uncover Deep Semantics Behind Images?"
2 / 2 papers shown
Title
Cracking the Code of Juxtaposition: Can AI Models Understand the Humorous Contradictions
Zhe Hu
Tuo Liang
Jing Li
Yiren Lu
Yunlai Zhou
Yiran Qiao
Jing Ma
Yu Yin
36
4
0
29 May 2024
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
380
4,010
0
28 Jan 2022
1