Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1908.04626
Cited By
v1
v2 (latest)
Attention is not not Explanation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
13 August 2019
Sarah Wiegreffe
Yuval Pinter
XAI
AAML
FAtt
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Attention is not not Explanation"
50 / 559 papers shown
Malicious Image Analysis via Vision-Language Segmentation Fusion: Detection, Element, and Location in One-shot
Sheng Hang
Chaoxiang He
Hongsheng Hu
Hanqing Hu
B. Zhu
Shi-Feng Sun
Dawu Gu
Shuo Wang
189
0
0
04 Dec 2025
A Self-explainable Model of Long Time Series by Extracting Informative Structured Causal Patterns
Ziqian Wang
Yuxiao Cheng
J. Suo
AI4TS
CML
BDL
213
0
0
01 Dec 2025
Graphing the Truth: Structured Visualizations for Automated Hallucination Detection in LLMs
Tanmay Agrawal
HILM
358
0
0
29 Nov 2025
Quantifying Modality Contributions via Disentangling Multimodal Representations
Padegal Amit
Omkar Mahesh Kashyap
Namitha Rayasam
Nidhi Shekhar
Surabhi Narayan
149
0
0
22 Nov 2025
CID: Measuring Feature Importance Through Counterfactual Distributions
Eddie Conti
Álvaro Parafita
Axel Brando
FAtt
CML
513
0
0
19 Nov 2025
Order-Level Attention Similarity Across Language Models: A Latent Commonality
Jinglin Liang
Jin Zhong
Shuangping Huang
Yunqing Hu
Huiyuan Zhang
Huifang Li
Lixin Fan
Hanlin Gu
166
1
0
07 Nov 2025
A Dual-Use Framework for Clinical Gait Analysis: Attention-Based Sensor Optimization and Automated Dataset Auditing
Hamidreza Sadeghsalehi
123
0
0
03 Nov 2025
A Video Is Not Worth a Thousand Words
Sam Pollard
Michael Wray
139
0
0
27 Oct 2025
When LRP Diverges from Leave-One-Out in Transformers
Weiqiu You
Siqi Zeng
Yao-Hung Hubert Tsai
Makoto Yamada
Han Zhao
184
0
0
21 Oct 2025
EEGChaT: A Transformer-Based Modular Channel Selector for SEEG Analysis
Chen Wang
Y. Wang
Dongqi Han
Zilong Wang
Dongsheng Li
115
0
0
15 Oct 2025
Discursive Circuits: How Do Language Models Understand Discourse Relations?
Yisong Miao
Min-Yen Kan
177
4
0
13 Oct 2025
Everyone prefers human writers, including AI
Wouter Haverals
Meredith Martin
145
2
0
09 Oct 2025
Introspection in Learned Semantic Scene Graph Localisation
Manshika Charvi Bissessur
Efimia Panagiotaki
Daniele De Martini
SSL
252
0
0
08 Oct 2025
DecEx-RAG: Boosting Agentic Retrieval-Augmented Generation with Decision and Execution Optimization via Process Supervision
Yongqi Leng
Yikun Lei
Xikai Liu
M. Zhong
Bojian Xiong
Y. Zhang
Yan Gao
Yi-Chen Wu
Yao Hu
Deyi Xiong
125
0
0
07 Oct 2025
There is More to Attention: Statistical Filtering Enhances Explanations in Vision Transformers
Meghna P. Ayyar
Jenny Benois-Pineau
A. Zemmari
211
1
0
07 Oct 2025
Evaluation Framework for Highlight Explanations of Context Utilisation in Language Models
Jingyi Sun
Pepa Atanasova
Sagnik Ray Choudhury
Sekh Mainul Islam
Isabelle Augenstein
222
0
0
03 Oct 2025
AttentionDep: Domain-Aware Attention for Explainable Depression Severity Assessment
Yusif Ibrahimov
Tarique Anwar
Tommy Yuan
Turan Mutallimov
Elgun Hasanov
102
0
0
01 Oct 2025
Analyzing Latent Concepts in Code Language Models
Arushi Sharma
Vedant Pungliya
Christopher Quinn
Ali Jannesari
345
1
0
01 Oct 2025
DeepProv: Behavioral Characterization and Repair of Neural Networks via Inference Provenance Graph Analysis
Firas Ben Hmida
Abderrahmen Amich
Ata Kaboudi
Birhanu Eshete
AAML
GNN
229
0
0
30 Sep 2025
Sparse Autoencoders Make Audio Foundation Models more Explainable
Théo Mariotte
Martin Lebourdais
Antonio Almudévar
Marie Tahon
Alfonso Ortega
Nicolas Dugué
152
1
0
29 Sep 2025
TDHook: A Lightweight Framework for Interpretability
Yoann Poupart
AI4CE
190
0
0
29 Sep 2025
Explaining Fine Tuned LLMs via Counterfactuals A Knowledge Graph Driven Framework
Y Samuel Wang
Ziyang Chen
Md Faisal Kabir
OffRL
229
1
0
25 Sep 2025
AIBA: Attention-based Instrument Band Alignment for Text-to-Audio Diffusion
Junyoung Koh
Soo Yong Kim
Gyu Hyeong Choi
Yongwon Choi
262
0
0
25 Sep 2025
When Long Helps Short: How Context Length in Supervised Fine-tuning Affects Behavior of Large Language Models
Yingming Zheng
Hanqi Li
Kai Yu
Lu Chen
302
0
0
23 Sep 2025
Cross-Attention is Half Explanation in Speech-to-Text Models
Sara Papi
Dennis Fucci
Marco Gaido
Matteo Negri
L. Bentivogli
LRM
226
1
0
22 Sep 2025
ConceptViz: A Visual Analytics Approach for Exploring Concepts in Large Language Models
Xue Yang
Zhen Wen
Qiqi Jiang
Chenxiao Li
Yuwei Wu
Y. Yang
Yiyao Wang
Xiuqi Huang
Minfeng Zhu
Wei Chen
192
1
0
20 Sep 2025
Subject Matter Expertise vs Professional Management in Collective Sequential Decision Making
David Shoresh
Yonatan Loewenstein
65
0
0
18 Sep 2025
Copycat vs. Original: Multi-modal Pretraining and Variable Importance in Box-office Prediction
Qin Chao
Eunsoo Kim
Boyang Albert Li
172
0
0
18 Sep 2025
ORACLE: Explaining Feature Interactions in Neural Networks with ANOVA
Dongseok Kim
Wonjun Jeong
Mohamed Jismy Aashik Rasool
Gisung Oh
276
0
0
13 Sep 2025
Whisper Has an Internal Word Aligner
Sung-Lin Yeh
Yen Meng
Hao Tang
166
1
0
12 Sep 2025
An Autoencoder and Vision Transformer-based Interpretability Analysis of the Differences in Automated Staging of Second and Third Molars
Barkin Buyukcakir
Jannick De Tobel
Patrick Thevissen
Dirk Vandermeulen
P. Claes
MedIm
207
0
0
12 Sep 2025
Do LLMs Adhere to Label Definitions? Examining Their Receptivity to External Label Definitions
Seyedali Mohammadi
Bhaskara Hanuma Vedula
Hemank Lamba
Edward Raff
Ponnurangam Kumaraguru
Francis Ferraro
Manas Gaur
221
1
0
02 Sep 2025
MindGuard: Intrinsic Decision Inspection for Securing LLM Agents Against Metadata Poisoning
Zhiqiang Wang
Junyang Zhang
Guanquan Shi
Haoran Cheng
Yunhao Yao
Kaiwen Guo
Haohua Du
Xiang-Yang Li
400
5
0
28 Aug 2025
MMTok: Multimodal Coverage Maximization for Efficient Inference of VLMs
Sixun Dong
Juhua Hu
Mian Zhang
Ming Yin
Yanjie Fu
Qi Qian
177
6
0
25 Aug 2025
On the effectiveness of multimodal privileged knowledge distillation in two vision transformer based diagnostic applications
Simon Baur
Alexandra Benova
Emilio Dolgener Cantú
Jackie Ma
MedIm
122
1
0
06 Aug 2025
User Perception of Attention Visualizations: Effects on Interpretability Across Evidence-Based Medical Documents
Andrés Carvallo
Denis Parra
Peter Brusilovsky
Hernan Valdivieso
G. Rada
Ivania Donoso
Vladimir Araujo
166
1
0
05 Aug 2025
AttnTrace: Attention-based Context Traceback for Long-Context LLMs
Yanting Wang
Runpeng Geng
Ying Chen
Jinyuan Jia
LLMAG
258
2
1
05 Aug 2025
Distilling Knowledge from Large Language Models: A Concept Bottleneck Model for Hate and Counter Speech Recognition
Roberto Labadie-Tamayo
D. Slijepcevic
Xihui Chen
Adrian Jaques Böck
Andreas Babic
Liz Freimann
Christiane Atzmüller Matthias Zeppelzauer
156
2
0
30 Jul 2025
Contrast-CAT: Contrasting Activations for Enhanced Interpretability in Transformer-based Text Classifiers
Conference on Uncertainty in Artificial Intelligence (UAI), 2025
Sungmin Han
Jeonghyun Lee
Sangkyun Lee
301
2
0
27 Jul 2025
Interpretable Open-Vocabulary Referring Object Detection with Reverse Contrast Attention
Drandreb Earl O. Juanico
Rowel O. Atienza
Jeffrey Kenneth Go
ObjD
316
0
0
26 Jul 2025
SFT-GO: Supervised Fine-Tuning with Group Optimization for Large Language Models
Gyuhak Kim
Sumiran Thakur
Su Min Park
Wei Wei
Yujia Bao
196
3
0
17 Jun 2025
Rethinking Explainability in the Era of Multimodal AI
Chirag Agarwal
304
3
0
16 Jun 2025
Towards Large Language Models with Self-Consistent Natural Language Explanations
Sahar Admoni
Ofra Amir
Assaf Hallak
Yftah Ziser
LRM
225
2
0
09 Jun 2025
Interpretation Meets Safety: A Survey on Interpretation Methods and Tools for Improving LLM Safety
Seongmin Lee
Aeree Cho
Grace C. Kim
ShengYun Peng
Mansi Phute
Duen Horng Chau
LM&MA
AI4CE
385
6
0
05 Jun 2025
Interpretable phenotyping of Heart Failure patients with Dutch discharge letters
Vittorio Torri
Machteld J. Boonstra
Marielle C. van de Veerdonk
Deborah N. Kalkman
Alicia Uijl
Francesca Ieva
Ameen Abu-Hanna
Folkert W. Asselbergs
Iacer Calixto
187
0
0
30 May 2025
From Directions to Cones: Exploring Multidimensional Representations of Propositional Facts in LLMs
Stanley Yu
Vaidehi Bulusu
Oscar Yasunaga
Clayton Lau
Cole Blondin
Sean O'Brien
Kevin Zhu
Sean O Brien
247
2
0
27 May 2025
LayerIF: Estimating Layer Quality for Large Language Models using Influence Functions
Hadi Askari
Shivanshu Gupta
Fei Wang
Anshuman Chhabra
Muhao Chen
TDI
471
8
0
27 May 2025
SCAR: Shapley Credit Assignment for More Efficient RLHF
Meng Cao
Shuyuan Zhang
Xiao-Wen Chang
Doina Precup
453
9
0
26 May 2025
ExPLAIND: Unifying Model, Data, and Training Attribution to Study Model Behavior
Florian Eichin
Yupei Du
Philipp Mondorf
Maria Matveev
Barbara Plank
Michael A. Hedderich
FAtt
534
0
0
26 May 2025
Advancing the Scientific Method with Large Language Models: From Hypothesis to Discovery
Yanbo Zhang
S. Khan
Adnan Mahmud
Huck Yang
Alexander Lavin
...
James A. Evans
Alan R. Bundy
Jannis Brugger
Jesper Tegner
Hector Zenil
LM&MA
453
7
0
22 May 2025
1
2
3
4
...
10
11
12
Next
Page 1 of 12
Page
of 12
Go