Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2404.02145
Cited By
Iterated Learning Improves Compositionality in Large Vision-Language Models
2 April 2024
Chenhao Zheng
Jieyu Zhang
Aniruddha Kembhavi
Ranjay Krishna
VLM
CoGe
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Iterated Learning Improves Compositionality in Large Vision-Language Models"
4 / 4 papers shown
Title
Enhancing Vision-Language Compositional Understanding with Multimodal Synthetic Data
Haoxin Li
Boyang Li
CoGe
60
0
0
03 Mar 2025
Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality
Anuj Diwan
Layne Berry
Eunsol Choi
David F. Harwath
Kyle Mahowald
CoGe
73
41
0
01 Nov 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
378
4,010
0
28 Jan 2022
Iterated learning for emergent systematicity in VQA
Ankit Vani
Max Schwarzer
Yucheng Lu
Eeshan Gunesh Dhekane
Aaron C. Courville
33
19
0
03 May 2021
1