Can Large Multimodal Models Uncover Deep Semantics Behind Images?

17 February 2024

Qingxiu Dong

Zhifang Sui

Papers citing "Can Large Multimodal Models Uncover Deep Semantics Behind Images?"

2 / 2 papers shown

Title
Cracking the Code of Juxtaposition: Can AI Models Understand the Humorous Contradictions Zhe Hu Tuo Liang Jing Li Yiren Lu Yunlai Zhou Yiran Qiao Jing Ma Yu Yin 36 4 0 29 May 2024
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation Junnan Li Dongxu Li Caiming Xiong S. Hoi MLLM BDL VLM CLIP 380 4,010 0 28 Jan 2022