Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2012.11528
Cited By
Overcoming Language Priors with Self-supervised Learning for Visual Question Answering
International Joint Conference on Artificial Intelligence (IJCAI), 2020
17 December 2020
Xi Zhu
Zhendong Mao
Chunxiao Liu
Peng Zhang
Bin Wang
Yongdong Zhang
SSL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Overcoming Language Priors with Self-supervised Learning for Visual Question Answering"
45 / 45 papers shown
Toward Automatic Safe Driving Instruction: A Large-Scale Vision Language Model Approach
Haruki Sakajo
Hiroshi Takato
Hiroshi Tsutsui
Komei Soda
Hidetaka Kamigaito
Taro Watanabe
MLLM
195
0
0
28 Nov 2025
Integrating Object Interaction Self-Attention and GAN-Based Debiasing for Visual Question Answering
Zhifei Li
Feng Qiu
Yiran Wang
Yujing Xia
Kui Xiao
Miao Zhang
Yan Zhang
239
0
0
25 Sep 2025
QIRL: Boosting Visual Question Answering via Optimized Question-Image Relation Learning
Quanxing Xu
Ling Zhou
Zhuo Zhou
Feifei Zhang
Rubing Huang
Chia-Wen Lin
202
0
0
04 Apr 2025
FortisAVQA and MAVEN: a Benchmark Dataset and Debiasing Framework for Robust Multimodal Reasoning
Jie Ma
Zhitao Gao
Qi Chai
Jing Liu
Peijie Wang
Jing Tao
Zhou Su
440
6
0
01 Apr 2025
Debiasing Multimodal Large Language Models via Noise-Aware Preference Optimization
Computer Vision and Pattern Recognition (CVPR), 2025
Zefeng Zhang
Hengzhu Tang
Shuaiyi Nie
Ying Tai
Yiming Ren
Zhenyang Li
Dawei Yin
Duohe Ma
Tingwen Liu
377
13
0
23 Mar 2025
Overcoming Language Priors for Visual Question Answering Based on Knowledge Distillation
IEEE International Conference on Multimedia and Expo (ICME), 2024
Daowan Peng
Wei Wei
935
2
0
10 Jan 2025
SAFE-MEME: Structured Reasoning Framework for Robust Hate Speech Detection in Memes
Palash Nandi
Shivam Sharma
Tanmoy Chakraborty
277
5
0
31 Dec 2024
CELLO: Causal Evaluation of Large Vision-Language Models
Meiqi Chen
Bo Peng
Yan Zhang
Chaochao Lu
LRM
ELM
289
9
0
27 Jun 2024
MemeMQA: Multimodal Question Answering for Memes via Rationale-Based Inferencing
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Siddhant Agarwal
Shivam Sharma
Preslav Nakov
Tanmoy Chakraborty
297
12
0
18 May 2024
Look, Listen, and Answer: Overcoming Biases for Audio-Visual Question Answering
Jie Ma
Min Hu
Pinghui Wang
Wangchun Sun
Lingyun Song
Hongbin Pei
Jun Liu
Youtian Du
654
22
0
18 Apr 2024
Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding
Xintong Wang
Jingheng Pan
Liang Ding
Christian Biemann
MLLM
388
180
0
27 Mar 2024
Quantifying and Mitigating Unimodal Biases in Multimodal Large Language Models: A Causal Perspective
Meiqi Chen
Yixin Cao
Yan Zhang
Chaochao Lu
574
37
0
27 Mar 2024
Debiasing Multimodal Large Language Models via Penalization of Language Priors
Yi-Fan Zhang
Weichen Yu
Qingsong Wen
Qingsong Wen
Zhang Zhang
Wenjing Yang
Rong Jin
Tien-Ping Tan
Rong Jin
467
14
0
08 Mar 2024
Improving Data Augmentation for Robust Visual Question Answering with Effective Curriculum Learning
International Conference on Multimedia Retrieval (ICMR), 2024
Yuhang Zheng
Zhen Wang
Long Chen
257
3
0
28 Jan 2024
Object Attribute Matters in Visual Question Answering
Peize Li
Q. Si
Peng Fu
Zheng Lin
Yan Wang
297
1
0
20 Dec 2023
Making the V in Text-VQA Matter
Shamanthak Hegde
Soumya Jahagirdar
Shankar Gangisetty
CoGe
238
4
0
01 Aug 2023
Robust Visual Question Answering: Datasets, Methods, and Future Challenges
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Jie Ma
Pinghui Wang
Dechen Kong
Zewei Wang
Jun Liu
Hongbin Pei
Junzhou Zhao
OOD
401
46
0
21 Jul 2023
Unveiling Cross Modality Bias in Visual Question Answering: A Causal View with Possible Worlds VQA
Ali Vosoughi
Shijian Deng
Songyang Zhang
Yapeng Tian
Chenliang Xu
Jiebo Luo
CML
250
3
0
31 May 2023
MEMEX: Detecting Explanatory Evidence for Memes via Knowledge-Enriched Contextualization
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Shivam Sharma
S Ramaneswaran
Udit Arora
Md. Shad Akhtar
Tanmoy Chakraborty
319
16
0
25 May 2023
Meta Neural Coordination
Yuwei Sun
OOD
212
0
0
20 May 2023
Visual Question Answering: A Survey on Techniques and Common Trends in Recent Literature
Ana Claudia Akemi Matsuki de Faria
Felype de Castro Bastos
Jose Victor Nogueira Alves da Silva
Vitor Lopes Fabris
Valeska Uchôa
Décio Gonccalves de Aguiar Neto
C. F. G. Santos
372
30
0
18 May 2023
An Empirical Study on the Language Modal in Visual Question Answering
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Daowan Peng
Wei Wei
Xian-Ling Mao
Yuanyuan Fu
Dangyang Chen
286
5
0
17 May 2023
SC-ML: Self-supervised Counterfactual Metric Learning for Debiased Visual Question Answering
Xinyao Shu
Shiyang Yan
Xu Yang
Ziheng Wu
Zhongfeng Chen
Zhenyu Lu
SSL
219
0
0
04 Apr 2023
What do you MEME? Generating Explanations for Visual Semantic Role Labelling in Memes
AAAI Conference on Artificial Intelligence (AAAI), 2022
Shivam Sharma
Siddhant Agarwal
Tharun Suresh
Preslav Nakov
Md. Shad Akhtar
Tanmoy Charkraborty
VLM
349
31
0
01 Dec 2022
Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Q. Si
Fandong Meng
Mingyu Zheng
Zheng Lin
Yuanxin Liu
Peng Fu
Yanan Cao
Weiping Wang
Jie Zhou
205
33
0
10 Oct 2022
Towards Robust Visual Question Answering: Making the Most of Biased Samples via Contrastive Learning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Q. Si
Yuanxin Liu
Fandong Meng
Zheng Lin
Peng Fu
Yanan Cao
Weiping Wang
Jie Zhou
312
29
0
10 Oct 2022
Overcoming Language Priors in Visual Question Answering via Distinguishing Superficially Similar Instances
International Conference on Computational Linguistics (COLING), 2022
Yike Wu
Yu Zhao
Shiwan Zhao
Ying Zhang
Xiaojie Yuan
Guoqing Zhao
Ning Jiang
247
26
0
18 Sep 2022
Bidirectional Contrastive Split Learning for Visual Question Answering
AAAI Conference on Artificial Intelligence (AAAI), 2022
Yuwei Sun
H. Ochiai
363
2
0
24 Aug 2022
Generative Bias for Robust Visual Question Answering
Computer Vision and Pattern Recognition (CVPR), 2022
Jae-Won Cho
Dong-Jin Kim
H. Ryu
In So Kweon
OOD
CML
444
34
0
01 Aug 2022
Visual Perturbation-aware Collaborative Learning for Overcoming the Language Prior Problem
Yudong Han
Liqiang Nie
Jianhua Yin
Yue Yu
Yan Yan
281
26
0
24 Jul 2022
Rethinking Data Augmentation for Robust Visual Question Answering
European Conference on Computer Vision (ECCV), 2022
Long Chen
Yuhang Zheng
Jun Xiao
OOD
297
54
0
18 Jul 2022
Visual Commonsense in Pretrained Unimodal and Multimodal Models
North American Chapter of the Association for Computational Linguistics (NAACL), 2022
Chenyu Zhang
Benjamin Van Durme
Zhuowan Li
Elias Stengel-Eskin
VLM
SSL
244
44
0
04 May 2022
COIN: Counterfactual Image Generation for VQA Interpretation
Zeyd Boukhers
Timo Hartmann
Jan Jurjens
182
7
0
10 Jan 2022
Language bias in Visual Question Answering: A Survey and Taxonomy
Desen Yuan
262
18
0
16 Nov 2021
Introspective Distillation for Robust Question Answering
Neural Information Processing Systems (NeurIPS), 2021
Yulei Niu
Hanwang Zhang
338
72
0
01 Nov 2021
Counterfactual Samples Synthesizing and Training for Robust Visual Question Answering
Long Chen
Yuhang Zheng
Yulei Niu
Hanwang Zhang
Jun Xiao
AAML
OOD
334
48
0
03 Oct 2021
Discovering the Unknown Knowns: Turning Implicit Knowledge in the Dataset into Explicit Training Examples for Visual Question Answering
Jihyung Kil
Cheng Zhang
D. Xuan
Wei-Lun Chao
318
23
0
13 Sep 2021
X-GGM: Graph Generative Modeling for Out-of-Distribution Generalization in Visual Question Answering
ACM Multimedia (ACM MM), 2021
Jingjing Jiang
Zi-yi Liu
Yifan Liu
Jingjing Jiang
N. Zheng
OOD
286
20
0
24 Jul 2021
Check It Again: Progressive Visual Question Answering via Visual Entailment
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Q. Si
Zheng Lin
Mingyu Zheng
Peng Fu
Weiping Wang
167
55
0
08 Jun 2021
LPF: A Language-Prior Feedback Objective Function for De-biased Visual Question Answering
Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2021
Zujie Liang
Haifeng Hu
Jiaying Zhu
245
45
0
29 May 2021
Cross-Modal Generative Augmentation for Visual Question Answering
British Machine Vision Conference (BMVC), 2021
Zixu Wang
Yishu Miao
Lucia Specia
251
11
0
11 May 2021
Answer Questions with Right Image Regions: A Visual Attention Regularization Approach
Zichen Liu
Yangyang Guo
Jianhua Yin
Xuemeng Song
Weifeng Liu
Liqiang Nie
212
36
0
03 Feb 2021
Learning content and context with language bias for Visual Question Answering
IEEE International Conference on Multimedia and Expo (ICME), 2020
Chao Yang
Su Feng
Dongsheng Li
Huawei Shen
Guoqing Wang
Bin Jiang
218
25
0
21 Dec 2020
Loss re-scaling VQA: Revisiting the LanguagePrior Problem from a Class-imbalance View
IEEE Transactions on Image Processing (TIP), 2020
Yangyang Guo
Liqiang Nie
Zhiyong Cheng
Q. Tian
Min Zhang
409
83
0
30 Oct 2020
Counterfactual VQA: A Cause-Effect Look at Language Bias
Yulei Niu
Kaihua Tang
Hanwang Zhang
Zhiwu Lu
Xiansheng Hua
Ji-Rong Wen
CML
624
499
0
08 Jun 2020
1
Page 1 of 1