Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.11073
Cited By
Grad-SAM: Explaining Transformers via Gradient Self-Attention Maps
23 April 2022
Oren Barkan
Edan Hauon
Avi Caciularu
Ori Katz
Itzik Malkiel
Omri Armstrong
Noam Koenigstein
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Grad-SAM: Explaining Transformers via Gradient Self-Attention Maps"
24 / 24 papers shown
Title
Reasoning-Grounded Natural Language Explanations for Language Models
Vojtech Cahlik
Rodrigo Alves
Pavel Kordík
LRM
53
1
0
14 Mar 2025
Generalized Attention Flow: Feature Attribution for Transformer Models via Maximum Flow
Behrooz Azarkhalili
Maxwell Libbrecht
39
0
0
14 Feb 2025
Evaluating the Effectiveness of XAI Techniques for Encoder-Based Language Models
Melkamu Mersha
Mesay Gemeda Yigezu
Jugal Kalita
ELM
51
3
0
26 Jan 2025
Automated Trustworthiness Oracle Generation for Machine Learning Text Classifiers
Lam Nguyen Tung
Steven Cho
Xiaoning Du
Neelofar Neelofar
Valerio Terragni
Stefano Ruberto
Aldeida Aleti
148
2
0
30 Oct 2024
Enhancing Conditional Image Generation with Explainable Latent Space Manipulation
Kshitij Pathania
DiffM
25
0
0
29 Aug 2024
Explainable Generative AI (GenXAI): A Survey, Conceptualization, and Research Agenda
Johannes Schneider
83
26
0
15 Apr 2024
ASAP: Interpretable Analysis and Summarization of AI-generated Image Patterns at Scale
Jinbin Huang
Cheng Chen
Aditi Mishra
Bum Chul Kwon
Zhicheng Liu
Chris Bryan
47
4
0
03 Apr 2024
Token Transformation Matters: Towards Faithful Post-hoc Explanation for Vision Transformer
Junyi Wu
Bin Duan
Weitai Kang
Hao Tang
Yan Yan
36
6
0
21 Mar 2024
From Understanding to Utilization: A Survey on Explainability for Large Language Models
Haoyan Luo
Lucia Specia
56
20
0
23 Jan 2024
Better Explain Transformers by Illuminating Important Information
Linxin Song
Yan Cui
Ao Luo
Freddy Lecue
Irene Z Li
FAtt
28
1
0
18 Jan 2024
Explainability of Vision Transformers: A Comprehensive Review and New Perspectives
Rojina Kashefi
Leili Barekatain
Mohammad Sabokrou
Fatemeh Aghaeipoor
ViT
37
9
0
12 Nov 2023
Visual Explanations via Iterated Integrated Attributions
Oren Barkan
Yehonatan Elisha
Yuval Asher
Amit Eshel
Noam Koenigstein
FAtt
XAI
28
18
0
28 Oct 2023
Learning to Explain: A Model-Agnostic Framework for Explaining Black Box Models
Oren Barkan
Yuval Asher
Amit Eshel
Yehonatan Elisha
Noam Koenigstein
35
5
0
25 Oct 2023
Deep Integrated Explanations
Oren Barkan
Yehonatan Elisha
Jonathan Weill
Yuval Asher
Amit Eshel
Noam Koenigstein
FAtt
41
7
0
23 Oct 2023
From Language Modeling to Instruction Following: Understanding the Behavior Shift in LLMs after Instruction Tuning
Xuansheng Wu
Wenlin Yao
Jianshu Chen
Xiaoman Pan
Xiaoyang Wang
Ninghao Liu
Dong Yu
LRM
20
26
0
30 Sep 2023
Interpretability-Aware Vision Transformer
Yao Qiang
Chengyin Li
Prashant Khanduri
D. Zhu
ViT
82
7
0
14 Sep 2023
Explainability for Large Language Models: A Survey
Haiyan Zhao
Hanjie Chen
Fan Yang
Ninghao Liu
Huiqi Deng
Hengyi Cai
Shuaiqiang Wang
Dawei Yin
Jundong Li
LRM
26
409
0
02 Sep 2023
B-cos Alignment for Inherently Interpretable CNNs and Vision Transformers
Moritz D Boehle
Navdeeppal Singh
Mario Fritz
Bernt Schiele
56
27
0
19 Jun 2023
Weakly Supervised Intracranial Hemorrhage Segmentation using Head-Wise Gradient-Infused Self-Attention Maps from a Swin Transformer in Categorical Learning
Amir Rasoulian
Soorena Salari
Yiming Xiao
14
8
0
11 Apr 2023
Holistically Explainable Vision Transformers
Moritz D Boehle
Mario Fritz
Bernt Schiele
ViT
35
9
0
20 Jan 2023
Interpreting BERT-based Text Similarity via Activation and Saliency Maps
Itzik Malkiel
Dvir Ginzburg
Oren Barkan
Avi Caciularu
Jonathan Weill
Noam Koenigstein
30
20
0
13 Aug 2022
GAM: Explainable Visual Similarity and Classification via Gradient Activation Maps
Oren Barkan
Omri Armstrong
Amir Hertz
Avi Caciularu
Ori Katz
Itzik Malkiel
Noam Koenigstein
GAN
FAtt
24
11
0
02 Sep 2021
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
297
6,959
0
20 Apr 2018
Bayesian Neural Word Embedding
Oren Barkan
BDL
131
87
0
21 Mar 2016
1