v1v2 (latest)

Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded

IEEE International Conference on Computer Vision (ICCV), 2019

11 February 2019

Ramprasaath R. Selvaraju

Devi Parikh

Papers citing "Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded"

49 / 149 papers shown

AdaVQA: Overcoming Language Priors with Adapted Margin Cosine LossInternational Joint Conference on Artificial Intelligence (IJCAI), 2021

Ji Zhang

144

05 May 2021

Improved and efficient inter-vehicle distance estimation using road gradients of both ego and target vehiclesInternational Conference on Autonomic and Autonomous Systems (ICAAS), 2021

Robik Shrestha

Jinkyu Lee

Kushal Kafle

S. Hwang

Il Yong Chun

152

01 Apr 2021

Towards Interpreting and Mitigating Shortcut Learning Behavior of NLU ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

Jiuxiang Gu

285

118

11 Mar 2021

Detecting Spurious Correlations with Sanity Tests for Artificial Intelligence Guided Radiology SystemsFrontiers in Digital Health (FDH), 2021

186

04 Mar 2021

EnD: Entangling and Disentangling deep representations for bias correctionComputer Vision and Pattern Recognition (CVPR), 2021

Enzo Tartaglione

C. Barbano

Marco Grangetto

306

137

02 Mar 2021

When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data

Peter Hase

Joey Tianyi Zhou

XAI

455

03 Feb 2021

Answer Questions with Right Image Regions: A Visual Attention Regularization Approach

188

03 Feb 2021

Object-Centric Diagnosis of Visual Reasoning

Jianwei Yang

Jiayuan Mao

Jiajun Wu

Devi Parikh

David D. Cox

J. Tenenbaum

Chuang Gan

OCL

193

21 Dec 2020

Learning content and context with language bias for Visual Question AnsweringIEEE International Conference on Multimedia and Expo (ICME), 2020

156

21 Dec 2020

Overcoming Language Priors with Self-supervised Learning for Visual Question AnsweringInternational Joint Conference on Artificial Intelligence (IJCAI), 2020

172

132

17 Dec 2020

A Closer Look at the Robustness of Vision-and-Language Pre-trained Models

264

15 Dec 2020

Debiased-CAM to mitigate image perturbations with faithful visual explanations of machine learningInternational Conference on Human Factors in Computing Systems (CHI), 2020

372

10 Dec 2020

CASTing Your Model: Learning to Localize Improves Self-Supervised Representations

Ramprasaath R. Selvaraju

170

08 Dec 2020

ProtoPShare: Prototype Sharing for Interpretable Image Classification and Similarity DiscoveryKnowledge Discovery and Data Mining (KDD), 2020

215

135

29 Nov 2020

Right for the Right Concept: Revising Neuro-Symbolic Concepts by Interacting with their ExplanationsComputer Vision and Pattern Recognition (CVPR), 2020

518

127

25 Nov 2020

mForms : Multimodal Form-Filling with Question AnsweringInternational Conference on Language Resources and Evaluation (LREC), 2020

Larry Heck

S. Heck

Anirudh S. Sundar

357

24 Nov 2020

Loss re-scaling VQA: Revisiting the LanguagePrior Problem from a Class-imbalance ViewIEEE Transactions on Image Processing (TIP), 2020

Min Zhang

362

30 Oct 2020

Removing Bias in Multi-modal Classifiers: Regularization by Maximizing Functional EntropiesNeural Information Processing Systems (NeurIPS), 2020

261

100

21 Oct 2020

SOrT-ing VQA Models : Contrastive Gradient Learning for Improved ConsistencyNorth American Chapter of the Association for Computational Linguistics (NAACL), 2020

Sameer Dharur

Purva Tendulkar

Dhruv Batra

Devi Parikh

Ramprasaath R. Selvaraju

162

20 Oct 2020

Room-Across-Room: Multilingual Vision-and-Language Navigation with Dense Spatiotemporal Grounding

240

417

15 Oct 2020

Remembering for the Right Reasons: Explanations Reduce Catastrophic ForgettingInternational Conference on Learning Representations (ICLR), 2020

220

04 Oct 2020

Trustworthy Convolutional Neural Networks: A Gradient Penalized-based Approach

Nicholas F Halliwell

Freddy Lecue

FAtt

226

29 Sep 2020

AiR: Attention with Reasoning CapabilityEuropean Conference on Computer Vision (ECCV), 2020

150

28 Jul 2020

Comprehensive Image Captioning via Scene Graph DecompositionEuropean Conference on Computer Vision (ECCV), 2020

Yiwu Zhong

Liwei Wang

Jianshu Chen

Dong Yu

Yin Li

249

138

23 Jul 2020

Reducing Language Biases in Visual Question Answering with Visually-Grounded Question EncoderEuropean Conference on Computer Vision (ECCV), 2020

K. Gouthaman

Anurag Mittal

373

13 Jul 2020

$Improving VQA and its Explanations \\ by Comparing Competing Explanations$

Improving VQA and its Explanations \\ by Comparing Competing Explanations

Jialin Wu

Liyan Chen

Raymond J. Mooney

FAtt AAML

210

28 Jun 2020

Overcoming Statistical Shortcuts for Open-ended Visual Counting

207

17 Jun 2020

Estimating semantic structure for the VQA answer space

180

10 Jun 2020

Roses Are Red, Violets Are Blue... but Should Vqa Expect Them To?

280

09 Jun 2020

Counterfactual VQA: A Cause-Effect Look at Language Bias

537

479

08 Jun 2020

Hierarchical Class-Based Curriculum Loss

Palash Goyal

Shalini Ghosh

124

05 Jun 2020

On the Value of Out-of-Distribution Testing: An Example of Goodhart's Law

240

153

19 May 2020

Learning What Makes a Difference from Counterfactual Examples and Gradient SupervisionEuropean Conference on Computer Vision (ECCV), 2020

223

125

20 Apr 2020

Visual Grounding Methods for VQA are Working for the Wrong Reasons!

306

12 Apr 2020

Egoshots, an ego-vision life-logging dataset and semantic fidelity metric to evaluate diversity in image captioning modelsInternational Conference on Learning Representations (ICLR), 2020

Pranav Agarwal

Alejandro Betancourt

V. Panagiotou

Natalia Díaz Rodríguez

EGVM

223

26 Mar 2020

Counterfactual Samples Synthesizing for Robust Visual Question AnsweringComputer Vision and Pattern Recognition (CVPR), 2020

386

319

14 Mar 2020

Explainable Deep Classification Models for Domain Generalization

168

13 Mar 2020

Cross-modal Learning for Multi-modal Video Categorization

275

07 Mar 2020

Exploiting Temporal Coherence for Multi-modal Video Categorization

132

07 Feb 2020

SQuINTing at VQA Models: Introspecting VQA Models with Sub-Questions

Ramprasaath R. Selvaraju

Devi Parikh

160

20 Jan 2020

Making deep neural networks right for the right scientific reasons by interacting with their explanationsNature Machine Intelligence (NMI), 2020

609

240

15 Jan 2020

Explain and Improve: LRP-Inference Fine-Tuning for Image Captioning ModelsInformation Fusion (Inf. Fusion), 2020

631

04 Jan 2020

Connecting Vision and Language with Localized NarrativesEuropean Conference on Computer Vision (ECCV), 2019

498

287

06 Dec 2019

Bilinear Graph Networks for Visual Question AnsweringIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2019

199

23 Jul 2019

Learning to Generate Grounded Visual Captions without Localization Supervision

394

01 Jun 2019

Self-Critical Reasoning for Robust Visual Question AnsweringNeural Information Processing Systems (NeurIPS), 2019

Jialin Wu

Raymond J. Mooney

OOD NAI

238

170

24 May 2019

VQA with no questions-answers trainingComputer Vision and Pattern Recognition (CVPR), 2018

B. Vatashsky

S. Ullman

238

20 Nov 2018

Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering

Devi Parikh

1.2K

3,820

02 Dec 2016

Grad-CAM: Visual Explanations from Deep Networks via Gradient-based LocalizationInternational Journal of Computer Vision (IJCV), 2016

Ramprasaath R. Selvaraju

Devi Parikh

922

24,286

07 Oct 2016