On the Robustness of Interpretability Methods

21 June 2018

David Alvarez-Melis

Tommi Jaakkola

ArXiv (abs)PDF HTML

Papers citing "On the Robustness of Interpretability Methods"

50 / 302 papers shown

SX-GeoTree: Self-eXplaining Geospatial Regression Tree Incorporating the Spatial Similarity of Feature Attributions

136

25 Nov 2025

Correlation-Aware Feature Attribution Based Explainable AI

123

20 Nov 2025

CID: Measuring Feature Importance Through Counterfactual Distributions

512

19 Nov 2025

Fair and Explainable Credit-Scoring under Concept Drift: Adaptive Explanation Frameworks for Evolving Populations

Shivogo John

FAtt

572

05 Nov 2025

Before the Clinic: Transparent and Operable Design Principles for Healthcare AI

Alexander Bakumenko

Aaron J. Masino

Janine Hoelscher

195

31 Oct 2025

Embedding Explainable AI in NHS Clinical Safety: The Explainability-Enabled Clinical Safety Framework (ECSF)

Robert Gigiu

148

24 Oct 2025

ProtoSiTex: Learning Semi-Interpretable Prototypes for Multi-label Text Classification

207

14 Oct 2025

Beyond single-model XAI: aggregating multi-model explanations for enhanced trustworthiness

13 Oct 2025

o-MEGA: Optimized Methods for Explanation Generation and Analysis

224

30 Sep 2025

On The Variability of Concept Activation Vectors

Julia Wenkmann

Damien Garreau

AAML

153

28 Sep 2025

Evaluating the stability of model explanations in instance-dependent cost-sensitive credit scoringEuropean Journal of Operational Research (EJOR), 2025

215

01 Sep 2025

How can we trust opaque systems? Criteria for robust explanations in XAI

Florian J. Boge

Annika Schuster

AAML

164

18 Aug 2025

On Spectral Properties of Gradient-based Explanation MethodsEuropean Conference on Computer Vision (ECCV), 2025

191

14 Aug 2025

Beyond Technocratic XAI: The Who, What & How in Explanation Design

213

12 Aug 2025

OrdShap: Feature Position Importance for Sequential Black-Box Models

362

16 Jul 2025

TriGuard: Testing Model Safety with Attribution Entropy, Verification, and Drift

230

17 Jun 2025

Rethinking Explainability in the Era of Multimodal AI

Chirag Agarwal

304

16 Jun 2025

Local MDI+: Local Feature Importances for Tree-Based Models

Zhongyuan Liang

Zachary T. Rewolinski

Abhineet Agarwal

Tiffany M. Tang

Bin Yu

195

10 Jun 2025

XAI-Units: Benchmarking Explainability Methods with Unit TestsConference on Fairness, Accountability and Transparency (FAccT), 2025

Jun Rui Lee

Sadegh Emami

Michael David Hollins

Timothy C. H. Wong

Carlos Ignacio Villalobos Sánchez

Francesca Toni

Dekai Zhang

Adam Dejl

235

01 Jun 2025

A Necessary Step toward Faithfulness: Measuring and Improving Consistency in Free-Text Explanations

Lingjun Zhao

Hal Daumé III

479

25 May 2025

Fixed Point Explainability

475

18 May 2025

Enhanced Photonic Chip Design via Interpretable Machine Learning Techniques

397

14 May 2025

Gender Bias in Explainability: Investigating Performance Disparity in Post-hoc MethodsConference on Fairness, Accountability and Transparency (FAccT), 2025

400

02 May 2025

Explanations Go Linear: Post-hoc Explainability for Tabular Data with Interpretable Meta-Encoding

1.2K

29 Apr 2025

Are We Merely Justifying Results ex Post Facto? Quantifying Explanatory Inversion in Post-Hoc Model Explanations

384

11 Apr 2025

Axiomatic Explainer Globalness via Optimal TransportInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024

555

13 Mar 2025

Counterfactual Explanations for Model Ensembles Using Entropic Risk MeasuresAdaptive Agents and Multi-Agent Systems (AAMAS), 2025

300

11 Mar 2025

Show Me the Work: Fact-Checkers' Requirements for Explainable Automated Fact-CheckingInternational Conference on Human Factors in Computing Systems (CHI), 2025

876

13 Feb 2025

Feature Importance Depends on Properties of the Data: Towards Choosing the Correct Explanations for Your Data and Decision Trees based Models

451

11 Feb 2025

The Effect of Similarity Measures on Accurate Stability Estimates for Local Surrogate Models in Text-based Explainable AI

421

20 Jan 2025

Towards Robust and Accurate Stability Estimation of Local Surrogate Models in Text-based Explainable AI

308

03 Jan 2025

Q-LIME

π

: A Quantum-Inspired Extension to LIME

Nelson Colón Vargas

FAtt

248

23 Dec 2024

Advancing Attribution-Based Neural Network Explainability through Relative Absolute Magnitude Layer-Wise Relevance Propagation and Multi-Component EvaluationACM Transactions on Intelligent Systems and Technology (ACM TIST), 2024

303

12 Dec 2024

A Unified Framework for Evaluating the Effectiveness and Enhancing the Transparency of Explainable AI Methods in Real-World Applications

285

05 Dec 2024

Establishing and Evaluating Trustworthy AI: Overview and Research Challenges

Dominik Kowald

S. Scher

Viktoria Pammer-Schindler

...

293

15 Nov 2024

Benchmarking XAI Explanations with Human-Aligned Evaluations

...

511

04 Nov 2024

Transparent Trade-offs between Properties of ExplanationsConference on Uncertainty in Artificial Intelligence (UAI), 2024

448

31 Oct 2024

Prototype-Based Methods in Explainable AI and Emerging Opportunities in the Geosciences

Anushka Narayanan

Karianne J. Bergen

356

22 Oct 2024

A mechanistically interpretable neural network for regulatory genomics

173

08 Oct 2024

Faithfulness and the Notion of Adversarial Sensitivity in NLP ExplanationsBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackBoxNLP), 2024

Supriya Manna

Niladri Sett

AAML

398

26 Sep 2024

A Fuzzy-based Approach to Predict Human Interaction by Functional Near-Infrared SpectroscopyIEEE transactions on fuzzy systems (IEEE Trans. Fuzzy Syst.), 2024

Xiaowei Jiang

Chin-Teng Lin

326

26 Sep 2024

The FIX Benchmark: Extracting Features Interpretable to eXperts

...

445

20 Sep 2024

Aligning Judgment Using Task Context and Explanations to Improve Human-Recommender System Performance

Divya K. Srivastava

Karen Feigh

161

16 Sep 2024

Beyond Model Interpretability: Socio-Structural Explanations in Machine LearningAi & Society (AS), 2024

Andrew Smart

Atoosa Kasirzadeh

318

05 Sep 2024

Evaluating Explainable AI Methods in Deep Learning Models for Early Detection of Cerebral PalsyIEEE Access (IEEE Access), 2024

Espen Alexander F. Ihlen

198

14 Aug 2024

More Questions than Answers? Lessons from Integrating Explainable AI into a Cyber-AI Tool

201

08 Aug 2024

BEExAI: Benchmark to Evaluate Explainable AI

Samuel Sithakoul

Sara Meftah

Clément Feutry

432

29 Jul 2024

Revisiting the robustness of post-hoc interpretability methods

477

29 Jul 2024

Auditing Local Explanations is Hard

Robi Bhattacharjee

U. V. Luxburg

LRM MLAU FAtt

318

18 Jul 2024

Robustness of Explainable Artificial Intelligence in Industrial Process Modelling

255

12 Jul 2024