ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.03649
  4. Cited By
Overcoming Language Priors in Visual Question Answering with Adversarial
  Regularization
v1v2 (latest)

Overcoming Language Priors in Visual Question Answering with Adversarial Regularization

8 October 2018
S. Ramakrishnan
Aishwarya Agrawal
Stefan Lee
    AAML
ArXiv (abs)PDFHTML

Papers citing "Overcoming Language Priors in Visual Question Answering with Adversarial Regularization"

50 / 138 papers shown
Title
PAI-Bench: A Comprehensive Benchmark For Physical AI
Fengzhe Zhou
Jiannan Huang
Jialuo Li
Deva Ramanan
Humphrey Shi
VGen
116
0
0
01 Dec 2025
Benchmark Designers Should "Train on the Test Set" to Expose Exploitable Non-Visual Shortcuts
Benchmark Designers Should "Train on the Test Set" to Expose Exploitable Non-Visual Shortcuts
Ellis L Brown
Jihan Yang
Shusheng Yang
Rob Fergus
Saining Xie
VLM
226
5
0
06 Nov 2025
Integrating Object Interaction Self-Attention and GAN-Based Debiasing for Visual Question Answering
Integrating Object Interaction Self-Attention and GAN-Based Debiasing for Visual Question Answering
Zhifei Li
Feng Qiu
Yiran Wang
Yujing Xia
Kui Xiao
Miao Zhang
Yan Zhang
140
0
0
25 Sep 2025
Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval
Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval
Dohwan Ko
Ji Soo Lee
M. Choi
Zihang Meng
Hyunwoo J. Kim
304
1
0
31 Jul 2025
MM-Prompt: Cross-Modal Prompt Tuning for Continual Visual Question Answering
MM-Prompt: Cross-Modal Prompt Tuning for Continual Visual Question Answering
Xu Li
Fan Lyu
LRM
172
0
0
26 May 2025
MLLMs are Deeply Affected by Modality Bias
MLLMs are Deeply Affected by Modality Bias
Xu Zheng
Chenfei Liao
Yuqian Fu
Kaiyu Lei
Yuanhuiyi Lyu
...
Yu Jiang
Andrii Zadaianchuk
Dacheng Tao
Luc Van Gool
Xuming Hu
288
11
0
24 May 2025
QIRL: Boosting Visual Question Answering via Optimized Question-Image Relation Learning
QIRL: Boosting Visual Question Answering via Optimized Question-Image Relation Learning
Quanxing Xu
Ling Zhou
Zhuo Zhou
Feifei Zhang
Rubing Huang
Chia-Wen Lin
166
0
0
04 Apr 2025
FortisAVQA and MAVEN: a Benchmark Dataset and Debiasing Framework for Robust Multimodal Reasoning
FortisAVQA and MAVEN: a Benchmark Dataset and Debiasing Framework for Robust Multimodal Reasoning
Jie Ma
Zhitao Gao
Qi Chai
Jing Liu
Peijie Wang
Jing Tao
Zhou Su
351
5
0
01 Apr 2025
MASS: Overcoming Language Bias in Image-Text Matching
MASS: Overcoming Language Bias in Image-Text MatchingAAAI Conference on Artificial Intelligence (AAAI), 2025
Jiwan Chung
Seungwon Lim
Sangkyu Lee
Youngjae Yu
VLM
197
0
0
20 Jan 2025
Overcoming Language Priors for Visual Question Answering Based on Knowledge Distillation
Overcoming Language Priors for Visual Question Answering Based on Knowledge DistillationIEEE International Conference on Multimedia and Expo (ICME), 2024
Daowan Peng
Wei Wei
903
2
0
10 Jan 2025
A Review of Multimodal Explainable Artificial Intelligence: Past,
  Present and Future
A Review of Multimodal Explainable Artificial Intelligence: Past, Present and Future
Shilin Sun
Wenbin An
Feng Tian
Fang Nan
Qidong Liu
Jing Liu
N. Shah
Ping Chen
336
19
0
18 Dec 2024
Task Progressive Curriculum Learning for Robust Visual Question
  Answering
Task Progressive Curriculum Learning for Robust Visual Question Answering
Ahmed Akl
Abdelwahed Khamis
Zhe Wang
Ali Cheraghian
Sara Khalifa
Kewen Wang
OOD
250
0
0
26 Nov 2024
A Comprehensive Survey on Visual Question Answering Datasets and Algorithms
Raihan Kabir
Naznin Haque
Md. Saiful Islam
Marium-E. Jannat
CoGe
253
8
0
17 Nov 2024
Modality-Fair Preference Optimization for Trustworthy MLLM Alignment
Modality-Fair Preference Optimization for Trustworthy MLLM AlignmentInternational Joint Conference on Artificial Intelligence (IJCAI), 2024
Songtao Jiang
Yan Zhang
Ruizhe Chen
Yeying Jin
Zuozhu Liu
Qinglin He
Yang Feng
Jian Wu
Zuozhu Liu
MoEMLLM
287
18
0
20 Oct 2024
NaturalBench: Evaluating Vision-Language Models on Natural Adversarial Samples
NaturalBench: Evaluating Vision-Language Models on Natural Adversarial SamplesNeural Information Processing Systems (NeurIPS), 2024
Baiqi Li
Zhiqiu Lin
Wenxuan Peng
Jean de Dieu Nyandwi
Daniel Jiang
Zixian Ma
Simran Khanuja
Ranjay Krishna
Graham Neubig
Deva Ramanan
AAMLCoGeVLM
604
59
0
18 Oct 2024
Leveraging Grammar Induction for Language Understanding and Generation
Leveraging Grammar Induction for Language Understanding and GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Jushi Kai
Shengyuan Hou
Yusheng Huang
Zhouhan Lin
126
3
0
07 Oct 2024
Efficient Bias Mitigation Without Privileged Information
Efficient Bias Mitigation Without Privileged InformationEuropean Conference on Computer Vision (ECCV), 2024
Mateo Espinosa Zarlenga
Swami Sankaranarayanan
Jerone T. A. Andrews
Z. Shams
M. Jamnik
Alice Xiang
268
6
0
26 Sep 2024
VolDoGer: LLM-assisted Datasets for Domain Generalization in Vision-Language Tasks
VolDoGer: LLM-assisted Datasets for Domain Generalization in Vision-Language Tasks
Juhwan Choi
Junehyoung Kwon
Jungmin Yun
Seunguk Yu
Youngbin Kim
269
3
0
29 Jul 2024
Unveiling and Mitigating Bias in Audio Visual Segmentation
Unveiling and Mitigating Bias in Audio Visual Segmentation
Peiwen Sun
Honggang Zhang
Di Hu
216
11
0
23 Jul 2024
Look, Listen, and Answer: Overcoming Biases for Audio-Visual Question Answering
Look, Listen, and Answer: Overcoming Biases for Audio-Visual Question Answering
Jie Ma
Min Hu
Pinghui Wang
Wangchun Sun
Lingyun Song
Hongbin Pei
Jun Liu
Youtian Du
463
15
0
18 Apr 2024
FocusCLIP: Multimodal Subject-Level Guidance for Zero-Shot Transfer in
  Human-Centric Tasks
FocusCLIP: Multimodal Subject-Level Guidance for Zero-Shot Transfer in Human-Centric Tasks
Muhammad Gul Zain Ali Khan
Muhammad Ferjad Naeem
F. Tombari
Luc Van Gool
Didier Stricker
Muhammad Zeshan Afzal
VLMCLIP
169
0
0
11 Mar 2024
Causal Mode Multiplexer: A Novel Framework for Unbiased Multispectral
  Pedestrian Detection
Causal Mode Multiplexer: A Novel Framework for Unbiased Multispectral Pedestrian Detection
Taeheon Kim
Sebin Shin
Youngjoon Yu
Hak Gu Kim
Y. Ro
264
14
0
02 Mar 2024
Grounding Language Models for Visual Entity Recognition
Grounding Language Models for Visual Entity Recognition
Zilin Xiao
Ming Gong
Paola Cascante-Bonilla
Xingyao Zhang
Jie Wu
Vicente Ordonez
VLM
239
13
0
28 Feb 2024
Revisiting the Dataset Bias Problem from a Statistical Perspective
Revisiting the Dataset Bias Problem from a Statistical PerspectiveEuropean Conference on Artificial Intelligence (ECAI), 2024
Kien Do
D. Nguyen
Hung Le
T. Le
Dang Nguyen
Haripriya Harikumar
T. Tran
Santu Rana
Svetha Venkatesh
156
0
0
05 Feb 2024
From Text to Multimodal: A Comprehensive Survey of Adversarial Example
  Generation in Question Answering Systems
From Text to Multimodal: A Comprehensive Survey of Adversarial Example Generation in Question Answering Systems
Gulsum Yigit
M. Amasyalı
AAML
153
0
0
26 Dec 2023
Object Attribute Matters in Visual Question Answering
Object Attribute Matters in Visual Question Answering
Peize Li
Q. Si
Peng Fu
Zheng Lin
Yan Wang
213
0
0
20 Dec 2023
Understanding Unimodal Bias in Multimodal Deep Linear Networks
Understanding Unimodal Bias in Multimodal Deep Linear NetworksInternational Conference on Machine Learning (ICML), 2023
Yedi Zhang
Peter E. Latham
Andrew Saxe
248
14
0
01 Dec 2023
Debiasing Multimodal Models via Causal Information Minimization
Debiasing Multimodal Models via Causal Information MinimizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Vaidehi Patil
A. Maharana
Mohit Bansal
CML
223
4
0
28 Nov 2023
Large Language Models are Temporal and Causal Reasoners for Video
  Question Answering
Large Language Models are Temporal and Causal Reasoners for Video Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Dohwan Ko
Ji Soo Lee
Wooyoung Kang
Byungseok Roh
Hyunwoo J. Kim
LRM
338
53
0
24 Oct 2023
Dataset Bias Mitigation in Multiple-Choice Visual Question Answering and
  Beyond
Dataset Bias Mitigation in Multiple-Choice Visual Question Answering and BeyondConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Zhecan Wang
Long Chen
Haoxuan You
Keyang Xu
Yicheng He
Wenhao Li
Noal Codella
Kai-Wei Chang
Shih-Fu Chang
282
7
0
23 Oct 2023
Open-vocabulary Video Question Answering: A New Benchmark for Evaluating
  the Generalizability of Video Question Answering Models
Open-vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering ModelsIEEE International Conference on Computer Vision (ICCV), 2023
Dohwan Ko
Ji Soo Lee
M. Choi
Jaewon Chu
Jihwan Park
Hyunwoo J. Kim
151
6
0
18 Aug 2023
Robust Visual Question Answering: Datasets, Methods, and Future
  Challenges
Robust Visual Question Answering: Datasets, Methods, and Future ChallengesIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Jie Ma
Pinghui Wang
Dechen Kong
Zewei Wang
Jun Liu
Hongbin Pei
Junzhou Zhao
OOD
287
43
0
21 Jul 2023
Improving Selective Visual Question Answering by Learning from Your
  Peers
Improving Selective Visual Question Answering by Learning from Your PeersComputer Vision and Pattern Recognition (CVPR), 2023
Corentin Dancette
Spencer Whitehead
Rishabh Maheshwary
Ramakrishna Vedantam
Stefan Scherer
Xinlei Chen
Matthieu Cord
Marcus Rohrbach
AAMLOOD
194
24
0
14 Jun 2023
Unveiling Cross Modality Bias in Visual Question Answering: A Causal
  View with Possible Worlds VQA
Unveiling Cross Modality Bias in Visual Question Answering: A Causal View with Possible Worlds VQA
Ali Vosoughi
Shijian Deng
Songyang Zhang
Yapeng Tian
Chenliang Xu
Jiebo Luo
CML
200
3
0
31 May 2023
Run Like a Girl! Sports-Related Gender Bias in Language and Vision
Run Like a Girl! Sports-Related Gender Bias in Language and VisionAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
S. Harrison
Eleonora Gualdoni
Gemma Boleda
121
6
0
23 May 2023
An Empirical Study on the Language Modal in Visual Question Answering
An Empirical Study on the Language Modal in Visual Question AnsweringInternational Joint Conference on Artificial Intelligence (IJCAI), 2023
Daowan Peng
Wei Wei
Xian-Ling Mao
Yuanyuan Fu
Dangyang Chen
204
5
0
17 May 2023
SC-ML: Self-supervised Counterfactual Metric Learning for Debiased
  Visual Question Answering
SC-ML: Self-supervised Counterfactual Metric Learning for Debiased Visual Question Answering
Xinyao Shu
Shiyang Yan
Xu Yang
Ziheng Wu
Zhongfeng Chen
Zhenyu Lu
SSL
143
0
0
04 Apr 2023
Divide and Conquer: Answering Questions with Object Factorization and
  Compositional Reasoning
Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasoning
Shi Chen
Qi Zhao
160
8
0
18 Mar 2023
Debiased Fine-Tuning for Vision-language Models by Prompt Regularization
Debiased Fine-Tuning for Vision-language Models by Prompt RegularizationAAAI Conference on Artificial Intelligence (AAAI), 2023
B. Zhu
Yulei Niu
Saeil Lee
Minhoe Hur
Hanwang Zhang
VLMVPVLM
345
31
0
29 Jan 2023
Understanding ME? Multimodal Evaluation for Fine-grained Visual
  Commonsense
Understanding ME? Multimodal Evaluation for Fine-grained Visual CommonsenseConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Zhecan Wang
Haoxuan You
Yicheng He
Wenhao Li
Kai-Wei Chang
Shih-Fu Chang
227
6
0
10 Nov 2022
Compressing And Debiasing Vision-Language Pre-Trained Models for Visual
  Question Answering
Compressing And Debiasing Vision-Language Pre-Trained Models for Visual Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Q. Si
Yuanxin Liu
Zheng Lin
Peng Fu
Weiping Wang
VLM
245
2
0
26 Oct 2022
Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut
  Learning in VQA
Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQAConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Q. Si
Fandong Meng
Mingyu Zheng
Zheng Lin
Yuanxin Liu
Peng Fu
Yanan Cao
Weiping Wang
Jie Zhou
137
30
0
10 Oct 2022
Towards Robust Visual Question Answering: Making the Most of Biased
  Samples via Contrastive Learning
Towards Robust Visual Question Answering: Making the Most of Biased Samples via Contrastive LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Q. Si
Yuanxin Liu
Fandong Meng
Zheng Lin
Peng Fu
Yanan Cao
Weiping Wang
Jie Zhou
201
28
0
10 Oct 2022
Overcoming Language Priors in Visual Question Answering via
  Distinguishing Superficially Similar Instances
Overcoming Language Priors in Visual Question Answering via Distinguishing Superficially Similar InstancesInternational Conference on Computational Linguistics (COLING), 2022
Yike Wu
Yu Zhao
Shiwan Zhao
Ying Zhang
Xiaojie Yuan
Guoqing Zhao
Ning Jiang
203
25
0
18 Sep 2022
Generative Bias for Robust Visual Question Answering
Generative Bias for Robust Visual Question AnsweringComputer Vision and Pattern Recognition (CVPR), 2022
Jae-Won Cho
Dong-Jin Kim
H. Ryu
In So Kweon
OODCML
302
30
0
01 Aug 2022
Visual Perturbation-aware Collaborative Learning for Overcoming the
  Language Prior Problem
Visual Perturbation-aware Collaborative Learning for Overcoming the Language Prior Problem
Yudong Han
Liqiang Nie
Jianhua Yin
Yue Yu
Yan Yan
211
23
0
24 Jul 2022
Semantic-aware Modular Capsule Routing for Visual Question Answering
Semantic-aware Modular Capsule Routing for Visual Question AnsweringIEEE Transactions on Image Processing (IEEE TIP), 2022
Yudong Han
Jianhua Yin
Yue Yu
Yin-wei Wei
Liqiang Nie
179
10
0
21 Jul 2022
Rethinking Data Augmentation for Robust Visual Question Answering
Rethinking Data Augmentation for Robust Visual Question AnsweringEuropean Conference on Computer Vision (ECCV), 2022
Long Chen
Yuhang Zheng
Jun Xiao
OOD
168
51
0
18 Jul 2022
PReGAN: Answer Oriented Passage Ranking with Weakly Supervised GAN
PReGAN: Answer Oriented Passage Ranking with Weakly Supervised GAN
Pan Du
J. Nie
Yutao Zhu
Hao Jiang
Lixin Zou
Xiaohui Yan
694
3
0
05 Jul 2022
Guiding Visual Question Answering with Attention Priors
Guiding Visual Question Answering with Attention PriorsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
T. Le
Vuong Le
Sunil R. Gupta
Svetha Venkatesh
T. Tran
186
8
0
25 May 2022
123
Next