Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1810.03649
Cited By
v1
v2 (latest)
Overcoming Language Priors in Visual Question Answering with Adversarial Regularization
8 October 2018
S. Ramakrishnan
Aishwarya Agrawal
Stefan Lee
AAML
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Overcoming Language Priors in Visual Question Answering with Adversarial Regularization"
50 / 138 papers shown
Title
PAI-Bench: A Comprehensive Benchmark For Physical AI
Fengzhe Zhou
Jiannan Huang
Jialuo Li
Deva Ramanan
Humphrey Shi
VGen
116
0
0
01 Dec 2025
Benchmark Designers Should "Train on the Test Set" to Expose Exploitable Non-Visual Shortcuts
Ellis L Brown
Jihan Yang
Shusheng Yang
Rob Fergus
Saining Xie
VLM
226
5
0
06 Nov 2025
Integrating Object Interaction Self-Attention and GAN-Based Debiasing for Visual Question Answering
Zhifei Li
Feng Qiu
Yiran Wang
Yujing Xia
Kui Xiao
Miao Zhang
Yan Zhang
140
0
0
25 Sep 2025
Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval
Dohwan Ko
Ji Soo Lee
M. Choi
Zihang Meng
Hyunwoo J. Kim
304
1
0
31 Jul 2025
MM-Prompt: Cross-Modal Prompt Tuning for Continual Visual Question Answering
Xu Li
Fan Lyu
LRM
172
0
0
26 May 2025
MLLMs are Deeply Affected by Modality Bias
Xu Zheng
Chenfei Liao
Yuqian Fu
Kaiyu Lei
Yuanhuiyi Lyu
...
Yu Jiang
Andrii Zadaianchuk
Dacheng Tao
Luc Van Gool
Xuming Hu
288
11
0
24 May 2025
QIRL: Boosting Visual Question Answering via Optimized Question-Image Relation Learning
Quanxing Xu
Ling Zhou
Zhuo Zhou
Feifei Zhang
Rubing Huang
Chia-Wen Lin
166
0
0
04 Apr 2025
FortisAVQA and MAVEN: a Benchmark Dataset and Debiasing Framework for Robust Multimodal Reasoning
Jie Ma
Zhitao Gao
Qi Chai
Jing Liu
Peijie Wang
Jing Tao
Zhou Su
351
5
0
01 Apr 2025
MASS: Overcoming Language Bias in Image-Text Matching
AAAI Conference on Artificial Intelligence (AAAI), 2025
Jiwan Chung
Seungwon Lim
Sangkyu Lee
Youngjae Yu
VLM
197
0
0
20 Jan 2025
Overcoming Language Priors for Visual Question Answering Based on Knowledge Distillation
IEEE International Conference on Multimedia and Expo (ICME), 2024
Daowan Peng
Wei Wei
903
2
0
10 Jan 2025
A Review of Multimodal Explainable Artificial Intelligence: Past, Present and Future
Shilin Sun
Wenbin An
Feng Tian
Fang Nan
Qidong Liu
Jing Liu
N. Shah
Ping Chen
336
19
0
18 Dec 2024
Task Progressive Curriculum Learning for Robust Visual Question Answering
Ahmed Akl
Abdelwahed Khamis
Zhe Wang
Ali Cheraghian
Sara Khalifa
Kewen Wang
OOD
250
0
0
26 Nov 2024
A Comprehensive Survey on Visual Question Answering Datasets and Algorithms
Raihan Kabir
Naznin Haque
Md. Saiful Islam
Marium-E. Jannat
CoGe
253
8
0
17 Nov 2024
Modality-Fair Preference Optimization for Trustworthy MLLM Alignment
International Joint Conference on Artificial Intelligence (IJCAI), 2024
Songtao Jiang
Yan Zhang
Ruizhe Chen
Yeying Jin
Zuozhu Liu
Qinglin He
Yang Feng
Jian Wu
Zuozhu Liu
MoE
MLLM
287
18
0
20 Oct 2024
NaturalBench: Evaluating Vision-Language Models on Natural Adversarial Samples
Neural Information Processing Systems (NeurIPS), 2024
Baiqi Li
Zhiqiu Lin
Wenxuan Peng
Jean de Dieu Nyandwi
Daniel Jiang
Zixian Ma
Simran Khanuja
Ranjay Krishna
Graham Neubig
Deva Ramanan
AAML
CoGe
VLM
604
59
0
18 Oct 2024
Leveraging Grammar Induction for Language Understanding and Generation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Jushi Kai
Shengyuan Hou
Yusheng Huang
Zhouhan Lin
126
3
0
07 Oct 2024
Efficient Bias Mitigation Without Privileged Information
European Conference on Computer Vision (ECCV), 2024
Mateo Espinosa Zarlenga
Swami Sankaranarayanan
Jerone T. A. Andrews
Z. Shams
M. Jamnik
Alice Xiang
268
6
0
26 Sep 2024
VolDoGer: LLM-assisted Datasets for Domain Generalization in Vision-Language Tasks
Juhwan Choi
Junehyoung Kwon
Jungmin Yun
Seunguk Yu
Youngbin Kim
269
3
0
29 Jul 2024
Unveiling and Mitigating Bias in Audio Visual Segmentation
Peiwen Sun
Honggang Zhang
Di Hu
216
11
0
23 Jul 2024
Look, Listen, and Answer: Overcoming Biases for Audio-Visual Question Answering
Jie Ma
Min Hu
Pinghui Wang
Wangchun Sun
Lingyun Song
Hongbin Pei
Jun Liu
Youtian Du
463
15
0
18 Apr 2024
FocusCLIP: Multimodal Subject-Level Guidance for Zero-Shot Transfer in Human-Centric Tasks
Muhammad Gul Zain Ali Khan
Muhammad Ferjad Naeem
F. Tombari
Luc Van Gool
Didier Stricker
Muhammad Zeshan Afzal
VLM
CLIP
169
0
0
11 Mar 2024
Causal Mode Multiplexer: A Novel Framework for Unbiased Multispectral Pedestrian Detection
Taeheon Kim
Sebin Shin
Youngjoon Yu
Hak Gu Kim
Y. Ro
264
14
0
02 Mar 2024
Grounding Language Models for Visual Entity Recognition
Zilin Xiao
Ming Gong
Paola Cascante-Bonilla
Xingyao Zhang
Jie Wu
Vicente Ordonez
VLM
239
13
0
28 Feb 2024
Revisiting the Dataset Bias Problem from a Statistical Perspective
European Conference on Artificial Intelligence (ECAI), 2024
Kien Do
D. Nguyen
Hung Le
T. Le
Dang Nguyen
Haripriya Harikumar
T. Tran
Santu Rana
Svetha Venkatesh
156
0
0
05 Feb 2024
From Text to Multimodal: A Comprehensive Survey of Adversarial Example Generation in Question Answering Systems
Gulsum Yigit
M. Amasyalı
AAML
153
0
0
26 Dec 2023
Object Attribute Matters in Visual Question Answering
Peize Li
Q. Si
Peng Fu
Zheng Lin
Yan Wang
213
0
0
20 Dec 2023
Understanding Unimodal Bias in Multimodal Deep Linear Networks
International Conference on Machine Learning (ICML), 2023
Yedi Zhang
Peter E. Latham
Andrew Saxe
248
14
0
01 Dec 2023
Debiasing Multimodal Models via Causal Information Minimization
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Vaidehi Patil
A. Maharana
Mohit Bansal
CML
223
4
0
28 Nov 2023
Large Language Models are Temporal and Causal Reasoners for Video Question Answering
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Dohwan Ko
Ji Soo Lee
Wooyoung Kang
Byungseok Roh
Hyunwoo J. Kim
LRM
338
53
0
24 Oct 2023
Dataset Bias Mitigation in Multiple-Choice Visual Question Answering and Beyond
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Zhecan Wang
Long Chen
Haoxuan You
Keyang Xu
Yicheng He
Wenhao Li
Noal Codella
Kai-Wei Chang
Shih-Fu Chang
282
7
0
23 Oct 2023
Open-vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models
IEEE International Conference on Computer Vision (ICCV), 2023
Dohwan Ko
Ji Soo Lee
M. Choi
Jaewon Chu
Jihwan Park
Hyunwoo J. Kim
151
6
0
18 Aug 2023
Robust Visual Question Answering: Datasets, Methods, and Future Challenges
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Jie Ma
Pinghui Wang
Dechen Kong
Zewei Wang
Jun Liu
Hongbin Pei
Junzhou Zhao
OOD
287
43
0
21 Jul 2023
Improving Selective Visual Question Answering by Learning from Your Peers
Computer Vision and Pattern Recognition (CVPR), 2023
Corentin Dancette
Spencer Whitehead
Rishabh Maheshwary
Ramakrishna Vedantam
Stefan Scherer
Xinlei Chen
Matthieu Cord
Marcus Rohrbach
AAML
OOD
194
24
0
14 Jun 2023
Unveiling Cross Modality Bias in Visual Question Answering: A Causal View with Possible Worlds VQA
Ali Vosoughi
Shijian Deng
Songyang Zhang
Yapeng Tian
Chenliang Xu
Jiebo Luo
CML
200
3
0
31 May 2023
Run Like a Girl! Sports-Related Gender Bias in Language and Vision
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
S. Harrison
Eleonora Gualdoni
Gemma Boleda
121
6
0
23 May 2023
An Empirical Study on the Language Modal in Visual Question Answering
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Daowan Peng
Wei Wei
Xian-Ling Mao
Yuanyuan Fu
Dangyang Chen
204
5
0
17 May 2023
SC-ML: Self-supervised Counterfactual Metric Learning for Debiased Visual Question Answering
Xinyao Shu
Shiyang Yan
Xu Yang
Ziheng Wu
Zhongfeng Chen
Zhenyu Lu
SSL
143
0
0
04 Apr 2023
Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasoning
Shi Chen
Qi Zhao
160
8
0
18 Mar 2023
Debiased Fine-Tuning for Vision-language Models by Prompt Regularization
AAAI Conference on Artificial Intelligence (AAAI), 2023
B. Zhu
Yulei Niu
Saeil Lee
Minhoe Hur
Hanwang Zhang
VLM
VPVLM
345
31
0
29 Jan 2023
Understanding ME? Multimodal Evaluation for Fine-grained Visual Commonsense
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Zhecan Wang
Haoxuan You
Yicheng He
Wenhao Li
Kai-Wei Chang
Shih-Fu Chang
227
6
0
10 Nov 2022
Compressing And Debiasing Vision-Language Pre-Trained Models for Visual Question Answering
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Q. Si
Yuanxin Liu
Zheng Lin
Peng Fu
Weiping Wang
VLM
245
2
0
26 Oct 2022
Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Q. Si
Fandong Meng
Mingyu Zheng
Zheng Lin
Yuanxin Liu
Peng Fu
Yanan Cao
Weiping Wang
Jie Zhou
137
30
0
10 Oct 2022
Towards Robust Visual Question Answering: Making the Most of Biased Samples via Contrastive Learning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Q. Si
Yuanxin Liu
Fandong Meng
Zheng Lin
Peng Fu
Yanan Cao
Weiping Wang
Jie Zhou
201
28
0
10 Oct 2022
Overcoming Language Priors in Visual Question Answering via Distinguishing Superficially Similar Instances
International Conference on Computational Linguistics (COLING), 2022
Yike Wu
Yu Zhao
Shiwan Zhao
Ying Zhang
Xiaojie Yuan
Guoqing Zhao
Ning Jiang
203
25
0
18 Sep 2022
Generative Bias for Robust Visual Question Answering
Computer Vision and Pattern Recognition (CVPR), 2022
Jae-Won Cho
Dong-Jin Kim
H. Ryu
In So Kweon
OOD
CML
302
30
0
01 Aug 2022
Visual Perturbation-aware Collaborative Learning for Overcoming the Language Prior Problem
Yudong Han
Liqiang Nie
Jianhua Yin
Yue Yu
Yan Yan
211
23
0
24 Jul 2022
Semantic-aware Modular Capsule Routing for Visual Question Answering
IEEE Transactions on Image Processing (IEEE TIP), 2022
Yudong Han
Jianhua Yin
Yue Yu
Yin-wei Wei
Liqiang Nie
179
10
0
21 Jul 2022
Rethinking Data Augmentation for Robust Visual Question Answering
European Conference on Computer Vision (ECCV), 2022
Long Chen
Yuhang Zheng
Jun Xiao
OOD
168
51
0
18 Jul 2022
PReGAN: Answer Oriented Passage Ranking with Weakly Supervised GAN
Pan Du
J. Nie
Yutao Zhu
Hao Jiang
Lixin Zou
Xiaohui Yan
694
3
0
05 Jul 2022
Guiding Visual Question Answering with Attention Priors
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
T. Le
Vuong Le
Sunil R. Gupta
Svetha Venkatesh
T. Tran
186
8
0
25 May 2022
1
2
3
Next