Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2409.17958
Cited By
The Hard Positive Truth about Vision-Language Compositionality
26 September 2024
Amita Kamath
Cheng-Yu Hsieh
Kai-Wei Chang
Ranjay Krishna
CLIP
CoGe
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Hard Positive Truth about Vision-Language Compositionality"
5 / 5 papers shown
Title
Decoupled Global-Local Alignment for Improving Compositional Understanding
Xiaoxing Hu
Kaicheng Yang
J. Z. Wang
Haoran Xu
Ziyong Feng
Y. Wang
VLM
38
0
0
23 Apr 2025
VAQUUM: Are Vague Quantifiers Grounded in Visual Data?
Hugh Mee Wong
Rick Nouwen
Albert Gatt
41
0
0
17 Feb 2025
Natural Language Inference Improves Compositionality in Vision-Language Models
Paola Cascante-Bonilla
Yu Hou
Yang Trista Cao
Hal Daumé III
Rachel Rudinger
ReLM
CoGe
VLM
31
3
0
29 Oct 2024
VDebugger: Harnessing Execution Feedback for Debugging Visual Programs
Xueqing Wu
Zongyu Lin
Songyan Zhao
Te-Lin Wu
Pan Lu
Nanyun Peng
Kai-Wei Chang
LRM
42
1
0
19 Jun 2024
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
380
4,010
0
28 Jan 2022
1