Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1902.03751
Cited By
v1
v2 (latest)
Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded
IEEE International Conference on Computer Vision (ICCV), 2019
11 February 2019
Ramprasaath R. Selvaraju
Stefan Lee
Yilin Shen
Hongxia Jin
Shalini Ghosh
Larry Heck
Dhruv Batra
Devi Parikh
FAtt
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded"
50 / 149 papers shown
Title
OAD-Promoter: Enhancing Zero-shot VQA using Large Language Models with Object Attribute Description
Quanxing Xu
Ling Zhou
Feifei Zhang
Jinyu Tian
Rubing Huang
VLM
120
0
0
15 Nov 2025
Onto-Epistemological Analysis of AI Explanations
Martina Mattioli
Eike Petersen
Aasa Feragen
Marcello Pelillo
Siavash Bigdeli
176
0
0
03 Oct 2025
Resolving Ambiguity in Gaze-Facilitated Visual Assistant Interaction Paradigm
Zeyu Wang
Baiyu Chen
Kun Yan
Hongjing Piao
Hao Xue
Flora D. Salim
Yuanchun Shi
Yuntao Wang
76
0
0
26 Sep 2025
Integrating Object Interaction Self-Attention and GAN-Based Debiasing for Visual Question Answering
Zhifei Li
Feng Qiu
Yiran Wang
Yujing Xia
Kui Xiao
Miao Zhang
Yan Zhang
120
0
0
25 Sep 2025
Towards trustworthy AI in materials mechanics through domain-guided attention
Jesco Talies
Eric Breitbarth
D. Melching
OOD
79
0
0
28 Jul 2025
Model Guidance via Robust Feature Attribution
Mihnea Ghitu
Vihari Piratla
Matthew Wicker
AAML
165
0
0
24 Jun 2025
GLIMPSE: Holistic Cross-Modal Explainability for Large Vision-Language Models
Guanxi Shen
132
0
0
23 Jun 2025
QIRL: Boosting Visual Question Answering via Optimized Question-Image Relation Learning
Quanxing Xu
Ling Zhou
Zhuo Zhou
Feifei Zhang
Rubing Huang
Chia-Wen Lin
134
0
0
04 Apr 2025
Language Guided Concept Bottleneck Models for Interpretable Continual Learning
Computer Vision and Pattern Recognition (CVPR), 2025
Lu Yu
Haoyu Han
Zhe Tao
Hantao Yao
Changsheng Xu
CLL
219
9
0
30 Mar 2025
Overcoming Language Priors for Visual Question Answering Based on Knowledge Distillation
IEEE International Conference on Multimedia and Expo (ICME), 2024
Daowan Peng
Wei Wei
871
2
0
10 Jan 2025
Task Progressive Curriculum Learning for Robust Visual Question Answering
Ahmed Akl
Abdelwahed Khamis
Zhe Wang
Ali Cheraghian
Sara Khalifa
Kewen Wang
OOD
230
0
0
26 Nov 2024
Improving Medical Diagnostics with Vision-Language Models: Convex Hull-Based Uncertainty Analysis
Ferhat Ozgur Catak
Murat Kuzlu
Taylor Patrick
252
2
0
24 Nov 2024
LibraGrad: Balancing Gradient Flow for Universally Better Vision Transformer Attributions
Computer Vision and Pattern Recognition (CVPR), 2024
Faridoun Mehri
Mahdieh Soleymani Baghshah
Mohammad Taher Pilehvar
276
1
0
24 Nov 2024
The Master-Slave Encoder Model for Improving Patent Text Summarization: A New Approach to Combining Specifications and Claims
Shu Zhou
Xin Wang
Zhengda Zhou
Haohan Yi
Xuhui Zheng
Hao Wan
229
2
0
21 Nov 2024
A Comprehensive Survey on Visual Question Answering Datasets and Algorithms
Raihan Kabir
Naznin Haque
Md. Saiful Islam
Marium-E. Jannat
CoGe
229
8
0
17 Nov 2024
Model Debiasing by Learnable Data Augmentation
Pietro Morerio
R. Ragonesi
Vittorio Murino
177
1
0
09 Aug 2024
Unveiling and Mitigating Bias in Audio Visual Segmentation
Peiwen Sun
Honggang Zhang
Di Hu
164
10
0
23 Jul 2024
Benchmarking the Attribution Quality of Vision Models
Robin Hesse
Simone Schaub-Meyer
Stefan Roth
FAtt
308
4
0
16 Jul 2024
A look under the hood of the Interactive Deep Learning Enterprise (No-IDLE)
Daniel Sonntag
Michael Barz
Thiago S. Gouvêa
VLM
216
6
0
27 Jun 2024
On the Role of Visual Grounding in VQA
Daniel Reich
Tanja Schultz
173
2
0
26 Jun 2024
FocusCLIP: Multimodal Subject-Level Guidance for Zero-Shot Transfer in Human-Centric Tasks
Muhammad Gul Zain Ali Khan
Muhammad Ferjad Naeem
F. Tombari
Luc Van Gool
Didier Stricker
Muhammad Zeshan Afzal
VLM
CLIP
161
0
0
11 Mar 2024
Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training
David Wan
Jaemin Cho
Elias Stengel-Eskin
Mohit Bansal
VLM
ObjD
259
48
0
04 Mar 2024
Right on Time: Revising Time Series Models by Constraining their Explanations
Maurice Kraus
David Steinmann
Antonia Wüst
Andre Kokozinski
Kristian Kersting
AI4TS
351
6
0
20 Feb 2024
Uncovering the Full Potential of Visual Grounding Methods in VQA
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Daniel Reich
Tanja Schultz
229
7
0
15 Jan 2024
Object Attribute Matters in Visual Question Answering
Peize Li
Q. Si
Peng Fu
Zheng Lin
Yan Wang
205
0
0
20 Dec 2023
Debiasing Multimodal Models via Causal Information Minimization
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Vaidehi Patil
A. Maharana
Mohit Bansal
CML
199
3
0
28 Nov 2023
Learning by Self-Explaining
Wolfgang Stammer
Felix Friedrich
David Steinmann
Manuel Brack
Hikaru Shindo
Kristian Kersting
375
15
0
15 Sep 2023
Distance-Aware eXplanation Based Learning
IEEE International Conference on Tools with Artificial Intelligence (ICTAI), 2023
Misgina Tsighe Hagos
Niamh Belton
Kathleen M. Curran
Brian Mac Namee
FAtt
149
1
0
11 Sep 2023
Interpretable Visual Question Answering via Reasoning Supervision
International Conference on Information Photonics (ICIP), 2023
Maria Parelli
Dimitrios Mallis
Markos Diomataris
Vassilis Pitsikalis
LRM
239
5
0
07 Sep 2023
A Joint Study of Phrase Grounding and Task Performance in Vision and Language Models
Noriyuki Kojima
Hadar Averbuch-Elor
Yoav Artzi
235
2
0
06 Sep 2023
Interpretability Benchmark for Evaluating Spatial Misalignment of Prototypical Parts Explanations
AAAI Conference on Artificial Intelligence (AAAI), 2023
Mikolaj Sacha
Bartosz Jura
Dawid Rymarczyk
Lukasz Struski
Jacek Tabor
Bartosz Zieliñski
138
23
0
16 Aug 2023
Making the V in Text-VQA Matter
Shamanthak Hegde
Soumya Jahagirdar
Shankar Gangisetty
CoGe
161
4
0
01 Aug 2023
Robust Visual Question Answering: Datasets, Methods, and Future Challenges
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Jie Ma
Pinghui Wang
Dechen Kong
Zewei Wang
Jun Liu
Hongbin Pei
Junzhou Zhao
OOD
271
42
0
21 Jul 2023
Learning from Exemplary Explanations
Misgina Tsighe Hagos
Kathleen M. Curran
Brian Mac Namee
FAtt
185
1
0
12 Jul 2023
Multimodal Explainable Artificial Intelligence: A Comprehensive Review of Methodological Advances and Future Research Directions
IEEE Access (IEEE Access), 2023
N. Rodis
Christos Sardianos
Panagiotis I. Radoglou-Grammatikis
Panagiotis G. Sarigiannidis
Iraklis Varlamis
Georgios Th. Papadopoulos
249
36
0
09 Jun 2023
Unveiling Cross Modality Bias in Visual Question Answering: A Causal View with Possible Worlds VQA
Ali Vosoughi
Shijian Deng
Songyang Zhang
Yapeng Tian
Chenliang Xu
Jiebo Luo
CML
168
3
0
31 May 2023
Measuring Faithful and Plausible Visual Grounding in VQA
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Daniel Reich
F. Putze
Tanja Schultz
190
6
0
24 May 2023
An Empirical Study on the Language Modal in Visual Question Answering
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Daowan Peng
Wei Wei
Xian-Ling Mao
Yuanyuan Fu
Dangyang Chen
192
5
0
17 May 2023
Adaptive loose optimization for robust question answering
Jie Ma
Pinghui Wang
Ze-you Wang
Dechen Kong
Min Hu
Tingxu Han
Jun Liu
OOD
345
4
0
06 May 2023
Human Attention-Guided Explainable Artificial Intelligence for Computer Vision Models
Neural Networks (Neural Netw.), 2023
Guoyang Liu
Jindi Zhang
Antoni B. Chan
J. H. Hsiao
205
32
0
05 May 2023
One Explanation Does Not Fit XIL
Felix Friedrich
David Steinmann
Kristian Kersting
LRM
194
3
0
14 Apr 2023
Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasoning
Shi Chen
Qi Zhao
148
8
0
18 Mar 2023
ICICLE: Interpretable Class Incremental Continual Learning
IEEE International Conference on Computer Vision (ICCV), 2023
Dawid Rymarczyk
Joost van de Weijer
Bartosz Zieliñski
Bartlomiej Twardowski
CLL
211
30
0
14 Mar 2023
Use Perturbations when Learning from Explanations
Neural Information Processing Systems (NeurIPS), 2023
Juyeon Heo
Vihari Piratla
Matthew Wicker
Adrian Weller
AAML
168
2
0
11 Mar 2023
IFAN: An Explainability-Focused Interaction Framework for Humans and NLP Models
International Joint Conference on Natural Language Processing (IJCNLP), 2023
Edoardo Mosca
Daryna Dementieva
Tohid Ebrahim Ajdari
Maximilian Kummeth
Kirill Gringauz
Yutong Zhou
Georg Groh
212
12
0
06 Mar 2023
Learning to Agree on Vision Attention for Visual Commonsense Reasoning
IEEE transactions on multimedia (IEEE TMM), 2023
Zhenyang Li
Yangyang Guo
Ke-Jyun Wang
Fan Liu
Liqiang Nie
Mohan S. Kankanhalli
222
12
0
04 Feb 2023
ProtoSeg: Interpretable Semantic Segmentation with Prototypical Parts
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Mikolaj Sacha
Dawid Rymarczyk
Lukasz Struski
Jacek Tabor
Bartosz Zieliñski
VLM
263
41
0
28 Jan 2023
Explaining Cross-Domain Recognition with Interpretable Deep Classifier
Yiheng Zhang
Ting Yao
Zhaofan Qiu
Tao Mei
OOD
170
3
0
15 Nov 2022
Visually Grounded VQA by Lattice-based Retrieval
Daniel Reich
F. Putze
Tanja Schultz
151
2
0
15 Nov 2022
Prophet Attention: Predicting Attention with Future Attention for Image Captioning
Neural Information Processing Systems (NeurIPS), 2022
Fenglin Liu
Xuancheng Ren
Xian Wu
Wei Fan
Yuexian Zou
Xu Sun
192
50
0
19 Oct 2022
1
2
3
Next