Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1902.03751
Cited By
v1
v2 (latest)
Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded
IEEE International Conference on Computer Vision (ICCV), 2019
11 February 2019
Ramprasaath R. Selvaraju
Stefan Lee
Yilin Shen
Hongxia Jin
Shalini Ghosh
Larry Heck
Dhruv Batra
Devi Parikh
FAtt
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded"
49 / 149 papers shown
AdaVQA: Overcoming Language Priors with Adapted Margin Cosine Loss
International Joint Conference on Artificial Intelligence (IJCAI), 2021
Yangyang Guo
Liqiang Nie
Zhiyong Cheng
Feng Ji
Ji Zhang
Marco Bertini
144
41
0
05 May 2021
Improved and efficient inter-vehicle distance estimation using road gradients of both ego and target vehicles
International Conference on Autonomic and Autonomous Systems (ICAAS), 2021
Robik Shrestha
Jinkyu Lee
Kushal Kafle
S. Hwang
Il Yong Chun
152
13
0
01 Apr 2021
Towards Interpreting and Mitigating Shortcut Learning Behavior of NLU Models
North American Chapter of the Association for Computational Linguistics (NAACL), 2021
Mengnan Du
Varun Manjunatha
R. Jain
Ruchi Deshpande
Franck Dernoncourt
Jiuxiang Gu
Tong Sun
Helen Zhou
285
118
0
11 Mar 2021
Detecting Spurious Correlations with Sanity Tests for Artificial Intelligence Guided Radiology Systems
Frontiers in Digital Health (FDH), 2021
U. Mahmood
Robik Shrestha
D. Bates
L. Mannelli
G. Corrias
Y. Erdi
Christopher Kanan
186
20
0
04 Mar 2021
EnD: Entangling and Disentangling deep representations for bias correction
Computer Vision and Pattern Recognition (CVPR), 2021
Enzo Tartaglione
C. Barbano
Marco Grangetto
306
137
0
02 Mar 2021
When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data
Peter Hase
Joey Tianyi Zhou
XAI
455
91
0
03 Feb 2021
Answer Questions with Right Image Regions: A Visual Attention Regularization Approach
Zichen Liu
Yangyang Guo
Jianhua Yin
Xuemeng Song
Weifeng Liu
Liqiang Nie
188
34
0
03 Feb 2021
Object-Centric Diagnosis of Visual Reasoning
Jianwei Yang
Jiayuan Mao
Jiajun Wu
Devi Parikh
David D. Cox
J. Tenenbaum
Chuang Gan
OCL
193
17
0
21 Dec 2020
Learning content and context with language bias for Visual Question Answering
IEEE International Conference on Multimedia and Expo (ICME), 2020
Chao Yang
Su Feng
Dongsheng Li
Huawei Shen
Guoqing Wang
Bin Jiang
156
24
0
21 Dec 2020
Overcoming Language Priors with Self-supervised Learning for Visual Question Answering
International Joint Conference on Artificial Intelligence (IJCAI), 2020
Xi Zhu
Zhendong Mao
Chunxiao Liu
Peng Zhang
Bin Wang
Yongdong Zhang
SSL
172
132
0
17 Dec 2020
A Closer Look at the Robustness of Vision-and-Language Pre-trained Models
Linjie Li
Zhe Gan
Jingjing Liu
VLM
264
50
0
15 Dec 2020
Debiased-CAM to mitigate image perturbations with faithful visual explanations of machine learning
International Conference on Human Factors in Computing Systems (CHI), 2020
Wencan Zhang
Mariella Dimiccoli
Brian Y. Lim
FAtt
372
20
0
10 Dec 2020
CASTing Your Model: Learning to Localize Improves Self-Supervised Representations
Ramprasaath R. Selvaraju
Karan Desai
Justin Johnson
Nikhil Naik
SSL
170
84
0
08 Dec 2020
ProtoPShare: Prototype Sharing for Interpretable Image Classification and Similarity Discovery
Knowledge Discovery and Data Mining (KDD), 2020
Dawid Rymarczyk
Lukasz Struski
Jacek Tabor
Bartosz Zieliñski
215
135
0
29 Nov 2020
Right for the Right Concept: Revising Neuro-Symbolic Concepts by Interacting with their Explanations
Computer Vision and Pattern Recognition (CVPR), 2020
Wolfgang Stammer
P. Schramowski
Kristian Kersting
FAtt
518
127
0
25 Nov 2020
mForms : Multimodal Form-Filling with Question Answering
International Conference on Language Resources and Evaluation (LREC), 2020
Larry Heck
S. Heck
Anirudh S. Sundar
357
7
0
24 Nov 2020
Loss re-scaling VQA: Revisiting the LanguagePrior Problem from a Class-imbalance View
IEEE Transactions on Image Processing (TIP), 2020
Yangyang Guo
Liqiang Nie
Zhiyong Cheng
Q. Tian
Min Zhang
362
79
0
30 Oct 2020
Removing Bias in Multi-modal Classifiers: Regularization by Maximizing Functional Entropies
Neural Information Processing Systems (NeurIPS), 2020
Itai Gat
Idan Schwartz
Alex Schwing
Tamir Hazan
261
100
0
21 Oct 2020
SOrT-ing VQA Models : Contrastive Gradient Learning for Improved Consistency
North American Chapter of the Association for Computational Linguistics (NAACL), 2020
Sameer Dharur
Purva Tendulkar
Dhruv Batra
Devi Parikh
Ramprasaath R. Selvaraju
162
2
0
20 Oct 2020
Room-Across-Room: Multilingual Vision-and-Language Navigation with Dense Spatiotemporal Grounding
Alexander Ku
Peter Anderson
Roma Patel
Eugene Ie
Jason Baldridge
240
417
0
15 Oct 2020
Remembering for the Right Reasons: Explanations Reduce Catastrophic Forgetting
International Conference on Learning Representations (ICLR), 2020
Sayna Ebrahimi
Suzanne Petryk
Akash Gokul
William Gan
Joseph E. Gonzalez
Marcus Rohrbach
Trevor Darrell
CLL
220
51
0
04 Oct 2020
Trustworthy Convolutional Neural Networks: A Gradient Penalized-based Approach
Nicholas F Halliwell
Freddy Lecue
FAtt
226
9
0
29 Sep 2020
AiR: Attention with Reasoning Capability
European Conference on Computer Vision (ECCV), 2020
Shi Chen
Ming Jiang
Jinhui Yang
Qi Zhao
LRM
150
44
0
28 Jul 2020
Comprehensive Image Captioning via Scene Graph Decomposition
European Conference on Computer Vision (ECCV), 2020
Yiwu Zhong
Liwei Wang
Jianshu Chen
Dong Yu
Yin Li
249
138
0
23 Jul 2020
Reducing Language Biases in Visual Question Answering with Visually-Grounded Question Encoder
European Conference on Computer Vision (ECCV), 2020
K. Gouthaman
Anurag Mittal
373
88
0
13 Jul 2020
Improving VQA and its Explanations \\ by Comparing Competing Explanations
Jialin Wu
Liyan Chen
Raymond J. Mooney
FAtt
AAML
210
18
0
28 Jun 2020
Overcoming Statistical Shortcuts for Open-ended Visual Counting
Corentin Dancette
Rémi Cadène
Xinlei Chen
Matthieu Cord
207
3
0
17 Jun 2020
Estimating semantic structure for the VQA answer space
Corentin Kervadec
G. Antipov
M. Baccouche
Christian Wolf
180
5
0
10 Jun 2020
Roses Are Red, Violets Are Blue... but Should Vqa Expect Them To?
Corentin Kervadec
G. Antipov
M. Baccouche
Christian Wolf
OOD
280
99
0
09 Jun 2020
Counterfactual VQA: A Cause-Effect Look at Language Bias
Yulei Niu
Kaihua Tang
Hanwang Zhang
Zhiwu Lu
Xiansheng Hua
Ji-Rong Wen
CML
537
479
0
08 Jun 2020
Hierarchical Class-Based Curriculum Loss
Palash Goyal
Shalini Ghosh
124
9
0
05 Jun 2020
On the Value of Out-of-Distribution Testing: An Example of Goodhart's Law
Damien Teney
Kushal Kafle
Robik Shrestha
Ehsan Abbasnejad
Christopher Kanan
Anton Van Den Hengel
OODD
OOD
240
153
0
19 May 2020
Learning What Makes a Difference from Counterfactual Examples and Gradient Supervision
European Conference on Computer Vision (ECCV), 2020
Damien Teney
Ehsan Abbasnejad
Anton Van Den Hengel
OOD
SSL
CML
223
125
0
20 Apr 2020
Visual Grounding Methods for VQA are Working for the Wrong Reasons!
Robik Shrestha
Kushal Kafle
Christopher Kanan
CML
306
34
0
12 Apr 2020
Egoshots, an ego-vision life-logging dataset and semantic fidelity metric to evaluate diversity in image captioning models
International Conference on Learning Representations (ICLR), 2020
Pranav Agarwal
Alejandro Betancourt
V. Panagiotou
Natalia Díaz Rodríguez
EGVM
223
11
0
26 Mar 2020
Counterfactual Samples Synthesizing for Robust Visual Question Answering
Computer Vision and Pattern Recognition (CVPR), 2020
Long Chen
Xin Yan
Jun Xiao
Hanwang Zhang
Shiliang Pu
Yueting Zhuang
OOD
AAML
386
319
0
14 Mar 2020
Explainable Deep Classification Models for Domain Generalization
Andrea Zunino
Sarah Adel Bargal
Riccardo Volpi
M. Sameki
Jianming Zhang
Stan Sclaroff
Vittorio Murino
Kate Saenko
FAtt
168
45
0
13 Mar 2020
Cross-modal Learning for Multi-modal Video Categorization
Palash Goyal
Saurabh Sahu
Shalini Ghosh
Chul Lee
275
10
0
07 Mar 2020
Exploiting Temporal Coherence for Multi-modal Video Categorization
Palash Goyal
Saurabh Sahu
Shalini Ghosh
Chul Lee
132
1
0
07 Feb 2020
SQuINTing at VQA Models: Introspecting VQA Models with Sub-Questions
Ramprasaath R. Selvaraju
Purva Tendulkar
Devi Parikh
Eric Horvitz
Marco Tulio Ribeiro
Besmira Nushi
Ece Kamar
LRM
160
14
0
20 Jan 2020
Making deep neural networks right for the right scientific reasons by interacting with their explanations
Nature Machine Intelligence (NMI), 2020
P. Schramowski
Wolfgang Stammer
Stefano Teso
Anna Brugger
Xiaoting Shao
Hans-Georg Luigs
Anne-Katrin Mahlein
Kristian Kersting
609
240
0
15 Jan 2020
Explain and Improve: LRP-Inference Fine-Tuning for Image Captioning Models
Information Fusion (Inf. Fusion), 2020
Jiamei Sun
Sebastian Lapuschkin
Wojciech Samek
Alexander Binder
FAtt
631
36
0
04 Jan 2020
Connecting Vision and Language with Localized Narratives
European Conference on Computer Vision (ECCV), 2019
Jordi Pont-Tuset
J. Uijlings
Soravit Changpinyo
Radu Soricut
V. Ferrari
ObjD
498
287
0
06 Dec 2019
Bilinear Graph Networks for Visual Question Answering
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2019
Dalu Guo
Chang Xu
Dacheng Tao
GNN
199
68
0
23 Jul 2019
Learning to Generate Grounded Visual Captions without Localization Supervision
Chih-Yao Ma
Yannis Kalantidis
Ghassan AlRegib
Peter Vajda
Marcus Rohrbach
Z. Kira
SSL
394
10
0
01 Jun 2019
Self-Critical Reasoning for Robust Visual Question Answering
Neural Information Processing Systems (NeurIPS), 2019
Jialin Wu
Raymond J. Mooney
OOD
NAI
238
170
0
24 May 2019
VQA with no questions-answers training
Computer Vision and Pattern Recognition (CVPR), 2018
B. Vatashsky
S. Ullman
238
13
0
20 Nov 2018
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering
Yash Goyal
Tejas Khot
D. Summers-Stay
Dhruv Batra
Devi Parikh
CoGe
1.2K
3,820
0
02 Dec 2016
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization
International Journal of Computer Vision (IJCV), 2016
Ramprasaath R. Selvaraju
Michael Cogswell
Abhishek Das
Ramakrishna Vedantam
Devi Parikh
Dhruv Batra
FAtt
922
24,286
0
07 Oct 2016
Previous
1
2
3