Diagnosing the Compositional Knowledge of Vision Language Models from a
Game-Theoretic View

Diagnosing the Compositional Knowledge of Vision Language Models from a Game-Theoretic View

27 May 2024

Jin Wang

Ping Luo

Papers citing "Diagnosing the Compositional Knowledge of Vision Language Models from a Game-Theoretic View"

8 / 8 papers shown

Title
Equivariant Similarity for Vision-Language Foundation Models Tan Wang Kevin Qinghong Lin Linjie Li Chung-Ching Lin Zhengyuan Yang Hanwang Zhang Zicheng Liu Lijuan Wang CoGe 30 44 0 25 Mar 2023
CyCLIP: Cyclic Contrastive Language-Image Pretraining Shashank Goel Hritik Bansal S. Bhatia Ryan A. Rossi Vishwa Vinay Aditya Grover CLIP VLM 160 131 0 28 May 2022
GroupViT: Semantic Segmentation Emerges from Text Supervision Jiarui Xu Shalini De Mello Sifei Liu Wonmin Byeon Thomas Breuel Jan Kautz X. Wang ViT VLM 175 494 0 22 Feb 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation Junnan Li Dongxu Li Caiming Xiong S. Hoi MLLM BDL VLM CLIP 382 4,010 0 28 Jan 2022
Discovering and Explaining the Representation Bottleneck of DNNs Huiqi Deng Qihan Ren Hao Zhang Quanshi Zhang 17 59 0 11 Nov 2021
Open-vocabulary Object Detection via Vision and Language Knowledge Distillation Xiuye Gu Tsung-Yi Lin Weicheng Kuo Yin Cui VLM ObjD 220 698 0 28 Apr 2021
A Unified Game-Theoretic Interpretation of Adversarial Robustness Jie Ren Die Zhang Yisen Wang Lu Chen Zhanpeng Zhou ... Xu Cheng Xin Eric Wang Meng Zhou Jie Shi Quanshi Zhang AAML 64 22 0 12 Mar 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision Chao Jia Yinfei Yang Ye Xia Yi-Ting Chen Zarana Parekh Hieu H. Pham Quoc V. Le Yun-hsuan Sung Zhen Li Tom Duerig VLM CLIP 293 2,875 0 11 Feb 2021