ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.03649
  4. Cited By
Overcoming Language Priors in Visual Question Answering with Adversarial
  Regularization
v1v2 (latest)

Overcoming Language Priors in Visual Question Answering with Adversarial Regularization

8 October 2018
S. Ramakrishnan
Aishwarya Agrawal
Stefan Lee
    AAML
ArXiv (abs)PDFHTML

Papers citing "Overcoming Language Priors in Visual Question Answering with Adversarial Regularization"

50 / 138 papers shown
Title
QLEVR: A Diagnostic Dataset for Quantificational Language and Elementary
  Visual Reasoning
QLEVR: A Diagnostic Dataset for Quantificational Language and Elementary Visual Reasoning
Zechen Li
Anders Søgaard
114
7
0
06 May 2022
Multimodal Adaptive Distillation for Leveraging Unimodal Encoders for
  Vision-Language Tasks
Multimodal Adaptive Distillation for Leveraging Unimodal Encoders for Vision-Language Tasks
Zhecan Wang
Noel Codella
Yen-Chun Chen
Luowei Zhou
Xiyang Dai
...
Jianwei Yang
Haoxuan You
Kai-Wei Chang
Shih-Fu Chang
Lu Yuan
VLMOffRL
192
27
0
22 Apr 2022
OccamNets: Mitigating Dataset Bias by Favoring Simpler Hypotheses
OccamNets: Mitigating Dataset Bias by Favoring Simpler HypothesesEuropean Conference on Computer Vision (ECCV), 2022
Robik Shrestha
Kushal Kafle
Christopher Kanan
CML
284
14
0
05 Apr 2022
SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context
  in Visual Question Answering
SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question AnsweringComputer Vision and Pattern Recognition (CVPR), 2022
Vipul Gupta
Zhuowan Li
Adam Kortylewski
Chenyu Zhang
Yingwei Li
Alan Yuille
167
53
0
05 Apr 2022
A Closer Look at Debiased Temporal Sentence Grounding in Videos:
  Dataset, Metric, and Approach
A Closer Look at Debiased Temporal Sentence Grounding in Videos: Dataset, Metric, and Approach
Xiaohan Lan
Yitian Yuan
Xin Eric Wang
Long Chen
Zhi Wang
Lin Ma
Wenwu Zhu
CML
164
19
0
10 Mar 2022
On Modality Bias Recognition and Reduction
On Modality Bias Recognition and Reduction
Yangyang Guo
Liqiang Nie
Harry Cheng
Zhiyong Cheng
Mohan S. Kankanhalli
Marco Bertini
254
48
0
25 Feb 2022
Webly Supervised Concept Expansion for General Purpose Vision Models
Webly Supervised Concept Expansion for General Purpose Vision ModelsEuropean Conference on Computer Vision (ECCV), 2022
Amita Kamath
Christopher Clark
Tanmay Gupta
Eric Kolve
Derek Hoiem
Aniruddha Kembhavi
VLM
264
65
0
04 Feb 2022
Grounding Answers for Visual Questions Asked by Visually Impaired People
Grounding Answers for Visual Questions Asked by Visually Impaired PeopleComputer Vision and Pattern Recognition (CVPR), 2022
Chongyan Chen
Samreen Anjum
Danna Gurari
244
59
0
04 Feb 2022
Language-biased image classification: evaluation based on semantic
  representations
Language-biased image classification: evaluation based on semantic representationsInternational Conference on Learning Representations (ICLR), 2022
Yoann Lemesle
Masataka Sawayama
Guillermo Valle Pérez
Maxime Adolphe
Hélene Sauzéon
Pierre-Yves Oudeyer
VLM
122
8
0
26 Jan 2022
Improving the fusion of acoustic and text representations in RNN-T
Improving the fusion of acoustic and text representations in RNN-TIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Chao Zhang
Yue Liu
Zhiyun Lu
Tara N. Sainath
Shuo-yiin Chang
AI4CE
178
13
0
25 Jan 2022
CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks
CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks
Zhecan Wang
Noel Codella
Yen-Chun Chen
Luowei Zhou
Jianwei Yang
Xiyang Dai
Bin Xiao
Haoxuan You
Shih-Fu Chang
Lu Yuan
CLIPVLM
181
44
0
15 Jan 2022
Learning Sample Importance for Cross-Scenario Video Temporal Grounding
Learning Sample Importance for Cross-Scenario Video Temporal GroundingInternational Conference on Multimedia Retrieval (ICMR), 2022
P. Bao
Yadong Mu
130
13
0
08 Jan 2022
General Greedy De-bias Learning
General Greedy De-bias LearningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Xinzhe Han
Shuhui Wang
Chi Su
Qingming Huang
Qi Tian
423
17
0
20 Dec 2021
Medical Visual Question Answering: A Survey
Medical Visual Question Answering: A Survey
Zhihong Lin
Donghao Zhang
Qingyi Tao
Danli Shi
Gholamreza Haffari
Qi Wu
M. He
Z. Ge
279
171
0
19 Nov 2021
Language bias in Visual Question Answering: A Survey and Taxonomy
Language bias in Visual Question Answering: A Survey and Taxonomy
Desen Yuan
213
15
0
16 Nov 2021
Towards Debiasing Temporal Sentence Grounding in Video
Towards Debiasing Temporal Sentence Grounding in Video
Hao Zhang
Aixin Sun
Wei Jing
Qiufeng Wang
170
20
0
08 Nov 2021
Introspective Distillation for Robust Question Answering
Introspective Distillation for Robust Question AnsweringNeural Information Processing Systems (NeurIPS), 2021
Yulei Niu
Hanwang Zhang
250
72
0
01 Nov 2021
Perceptual Score: What Data Modalities Does Your Model Perceive?
Perceptual Score: What Data Modalities Does Your Model Perceive?
Itai Gat
Idan Schwartz
Alex Schwing
177
42
0
27 Oct 2021
Review-Based Domain Disentanglement without Duplicate Users or Contexts
  for Cross-Domain Recommendation
Review-Based Domain Disentanglement without Duplicate Users or Contexts for Cross-Domain RecommendationInternational Conference on Information and Knowledge Management (CIKM), 2021
Yoonhyuk Choi
Jiho Choi
Taewook Ko
HyungHo Byun
Qiongxiong Ma
200
19
0
25 Oct 2021
Counterfactual Samples Synthesizing and Training for Robust Visual
  Question Answering
Counterfactual Samples Synthesizing and Training for Robust Visual Question Answering
Long Chen
Yuhang Zheng
Yulei Niu
Hanwang Zhang
Jun Xiao
AAMLOOD
251
46
0
03 Oct 2021
Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real
  Images
Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images
Zhuowan Li
Elias Stengel-Eskin
Yixiao Zhang
Cihang Xie
Q. Tran
Benjamin Van Durme
Alan Yuille
VLM
153
17
0
01 Oct 2021
Raising context awareness in motion forecasting
Raising context awareness in motion forecasting
H. Ben-younes
Éloi Zablocki
Mickaël Chen
P. Pérez
Matthieu Cord
TTA
305
12
0
16 Sep 2021
Discovering the Unknown Knowns: Turning Implicit Knowledge in the
  Dataset into Explicit Training Examples for Visual Question Answering
Discovering the Unknown Knowns: Turning Implicit Knowledge in the Dataset into Explicit Training Examples for Visual Question Answering
Jihyung Kil
Cheng Zhang
D. Xuan
Wei-Lun Chao
228
23
0
13 Sep 2021
On the Significance of Question Encoder Sequence Model in the
  Out-of-Distribution Performance in Visual Question Answering
On the Significance of Question Encoder Sequence Model in the Out-of-Distribution Performance in Visual Question Answering
K. Gouthaman
Anurag Mittal
CML
206
0
0
28 Aug 2021
Greedy Gradient Ensemble for Robust Visual Question Answering
Greedy Gradient Ensemble for Robust Visual Question AnsweringIEEE International Conference on Computer Vision (ICCV), 2021
Xinzhe Han
Shuhui Wang
Chi Su
Qingming Huang
Q. Tian
198
89
0
27 Jul 2021
Neural Abstructions: Abstractions that Support Construction for Grounded
  Language Learning
Neural Abstructions: Abstractions that Support Construction for Grounded Language Learning
Kaylee Burns
Christopher D. Manning
Li Fei-Fei
178
0
0
20 Jul 2021
Separating Skills and Concepts for Novel Visual Question Answering
Separating Skills and Concepts for Novel Visual Question AnsweringComputer Vision and Pattern Recognition (CVPR), 2021
Spencer Whitehead
Hui Wu
Heng Ji
Rogerio Feris
Kate Saenko
CoGe
171
38
0
19 Jul 2021
Check It Again: Progressive Visual Question Answering via Visual
  Entailment
Check It Again: Progressive Visual Question Answering via Visual EntailmentAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Q. Si
Zheng Lin
Mingyu Zheng
Peng Fu
Weiping Wang
139
52
0
08 Jun 2021
LPF: A Language-Prior Feedback Objective Function for De-biased Visual
  Question Answering
LPF: A Language-Prior Feedback Objective Function for De-biased Visual Question AnsweringAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2021
Zujie Liang
Haifeng Hu
Jiaying Zhu
179
44
0
29 May 2021
AdaVQA: Overcoming Language Priors with Adapted Margin Cosine Loss
AdaVQA: Overcoming Language Priors with Adapted Margin Cosine LossInternational Joint Conference on Artificial Intelligence (IJCAI), 2021
Yangyang Guo
Liqiang Nie
Zhiyong Cheng
Feng Ji
Ji Zhang
Marco Bertini
125
41
0
05 May 2021
Worst of Both Worlds: Biases Compound in Pre-trained Vision-and-Language
  Models
Worst of Both Worlds: Biases Compound in Pre-trained Vision-and-Language Models
Tejas Srinivasan
Yonatan Bisk
VLM
288
63
0
18 Apr 2021
Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in
  Visual Question Answering
Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in Visual Question AnsweringIEEE International Conference on Computer Vision (ICCV), 2021
Corentin Dancette
Rémi Cadène
Damien Teney
Matthieu Cord
CML
291
91
0
07 Apr 2021
Improved and efficient inter-vehicle distance estimation using road
  gradients of both ego and target vehicles
Improved and efficient inter-vehicle distance estimation using road gradients of both ego and target vehiclesInternational Conference on Autonomic and Autonomous Systems (ICAAS), 2021
Robik Shrestha
Jinkyu Lee
Kushal Kafle
S. Hwang
Il Yong Chun
144
13
0
01 Apr 2021
Domain-robust VQA with diverse datasets and methods but no target labels
Domain-robust VQA with diverse datasets and methods but no target labelsComputer Vision and Pattern Recognition (CVPR), 2021
Ruotong Wang
Tristan D. Maidment
Ahmad Diab
Adriana Kovashka
R. Hwa
OOD
266
25
0
29 Mar 2021
Detecting Spurious Correlations with Sanity Tests for Artificial
  Intelligence Guided Radiology Systems
Detecting Spurious Correlations with Sanity Tests for Artificial Intelligence Guided Radiology SystemsFrontiers in Digital Health (FDH), 2021
U. Mahmood
Robik Shrestha
D. Bates
L. Mannelli
G. Corrias
Y. Erdi
Christopher Kanan
159
19
0
04 Mar 2021
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize
  Long-Tail Visual Concepts
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual ConceptsComputer Vision and Pattern Recognition (CVPR), 2021
Soravit Changpinyo
P. Sharma
Nan Ding
Radu Soricut
VLM
1.1K
1,348
0
17 Feb 2021
Answer Questions with Right Image Regions: A Visual Attention
  Regularization Approach
Answer Questions with Right Image Regions: A Visual Attention Regularization Approach
Zichen Liu
Yangyang Guo
Jianhua Yin
Xuemeng Song
Weifeng Liu
Liqiang Nie
165
34
0
03 Feb 2021
Mitigating the Position Bias of Transformer Models in Passage Re-Ranking
Mitigating the Position Bias of Transformer Models in Passage Re-RankingEuropean Conference on Information Retrieval (ECIR), 2021
Sebastian Hofstatter
Aldo Lipani
Sophia Althammer
Markus Zlabinger
Allan Hanbury
298
21
0
18 Jan 2021
Explainability of deep vision-based autonomous driving systems: Review
  and challenges
Explainability of deep vision-based autonomous driving systems: Review and challengesInternational Journal of Computer Vision (IJCV), 2021
Éloi Zablocki
H. Ben-younes
P. Pérez
Matthieu Cord
XAI
428
205
0
13 Jan 2021
Object-Centric Diagnosis of Visual Reasoning
Object-Centric Diagnosis of Visual Reasoning
Jianwei Yang
Jiayuan Mao
Jiajun Wu
Devi Parikh
David D. Cox
J. Tenenbaum
Chuang Gan
OCL
174
17
0
21 Dec 2020
Learning content and context with language bias for Visual Question
  Answering
Learning content and context with language bias for Visual Question AnsweringIEEE International Conference on Multimedia and Expo (ICME), 2020
Chao Yang
Su Feng
Dongsheng Li
Huawei Shen
Guoqing Wang
Bin Jiang
148
24
0
21 Dec 2020
Trying Bilinear Pooling in Video-QA
Trying Bilinear Pooling in Video-QA
T. Winterbottom
S. Xiao
A. McLean
Noura Al Moubayed
183
4
0
18 Dec 2020
On Modality Bias in the TVQA Dataset
On Modality Bias in the TVQA DatasetBritish Machine Vision Conference (BMVC), 2020
T. Winterbottom
S. Xiao
A. McLean
Noura Al Moubayed
169
44
0
18 Dec 2020
Overcoming Language Priors with Self-supervised Learning for Visual
  Question Answering
Overcoming Language Priors with Self-supervised Learning for Visual Question AnsweringInternational Joint Conference on Artificial Intelligence (IJCAI), 2020
Xi Zhu
Zhendong Mao
Chunxiao Liu
Peng Zhang
Bin Wang
Yongdong Zhang
SSL
151
132
0
17 Dec 2020
Loss re-scaling VQA: Revisiting the LanguagePrior Problem from a
  Class-imbalance View
Loss re-scaling VQA: Revisiting the LanguagePrior Problem from a Class-imbalance ViewIEEE Transactions on Image Processing (TIP), 2020
Yangyang Guo
Liqiang Nie
Zhiyong Cheng
Q. Tian
Min Zhang
327
79
0
30 Oct 2020
Removing Bias in Multi-modal Classifiers: Regularization by Maximizing
  Functional Entropies
Removing Bias in Multi-modal Classifiers: Regularization by Maximizing Functional EntropiesNeural Information Processing Systems (NeurIPS), 2020
Itai Gat
Idan Schwartz
Alex Schwing
Tamir Hazan
234
98
0
21 Oct 2020
SOrT-ing VQA Models : Contrastive Gradient Learning for Improved
  Consistency
SOrT-ing VQA Models : Contrastive Gradient Learning for Improved ConsistencyNorth American Chapter of the Association for Computational Linguistics (NAACL), 2020
Sameer Dharur
Purva Tendulkar
Dhruv Batra
Devi Parikh
Ramprasaath R. Selvaraju
131
2
0
20 Oct 2020
Multimodal Research in Vision and Language: A Review of Current and
  Emerging Trends
Multimodal Research in Vision and Language: A Review of Current and Emerging Trends
Shagun Uppal
Sarthak Bhagat
Devamanyu Hazarika
Navonil Majumdar
Soujanya Poria
Roger Zimmermann
Amir Zadeh
269
6
0
19 Oct 2020
Multimodal Speech Recognition with Unstructured Audio Masking
Multimodal Speech Recognition with Unstructured Audio Masking
Tejas Srinivasan
Ramon Sanabria
Florian Metze
Desmond Elliott
CVBM
96
10
0
16 Oct 2020
Counterfactual Variable Control for Robust and Interpretable Question
  Answering
Counterfactual Variable Control for Robust and Interpretable Question Answering
S. Yu
Yulei Niu
Shuohang Wang
Jing Jiang
Qianru Sun
AAMLOOD
239
9
0
12 Oct 2020
Previous
123
Next