v1v2 (latest)

Interpretation of Neural Networks is Fragile

AAAI Conference on Artificial Intelligence (AAAI), 2017

29 October 2017

Papers citing "Interpretation of Neural Networks is Fragile"

50 / 489 papers shown

DeepAID: Interpreting and Improving Deep Learning-based Anomaly Detection in Security ApplicationsConference on Computer and Communications Security (CCS), 2021

Dongqi Han

166

105

23 Sep 2021

Ranking Feature-Block Importance in Artificial Multiblock Neural Networks

133

21 Sep 2021

FUTURE-AI: Guiding Principles and Consensus Recommendations for Trustworthy Artificial Intelligence in Medical Imaging

...

Nickolas Papanikolaou

358

20 Sep 2021

Self-learn to Explain Siamese Networks Robustly

163

15 Sep 2021

Rationales for Sequential Predictions

218

14 Sep 2021

Logic Traps in Evaluating Attribution Scores

Yuanzhe Zhang

Jun Zhao

276

12 Sep 2021

EG-Booster: Explanation-Guided Booster of ML Evasion AttacksConference on Data and Application Security and Privacy (CODASPY), 2021

Abderrahmen Amich

Birhanu Eshete

AAML

146

31 Aug 2021

Enjoy the Salience: Towards Better Transformer-based Faithful Explanations with Word SalienceConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

G. Chrysostomou

Nikolaos Aletras

197

31 Aug 2021

Finding Representative Interpretations on Convolutional Neural NetworksIEEE International Conference on Computer Vision (ICCV), 2021

Yong Zhang

175

13 Aug 2021

Jujutsu: A Two-stage Defense against Adversarial Patch Attacks on Deep Neural NetworksACM Asia Conference on Computer and Communications Security (AsiaCCS), 2021

339

11 Aug 2021

Perturbing Inputs for Fragile Interpretations in Deep Natural Language ProcessingBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackBoxNLP), 2021

249

11 Aug 2021

Harnessing value from data science in business: ensuring explainability and fairness of solutions

Krzysztof Chomiak

Michal Miktus

10 Aug 2021

Explainable AI and susceptibility to adversarial attacks: a case study in classification of breast ultrasound imagesIUS (IUS), 2021

Hamza Rasaee

H. Rivaz

AAML

09 Aug 2021

Jointly Attacking Graph Neural Network and its ExplanationsIEEE International Conference on Data Engineering (ICDE), 2021

246

07 Aug 2021

Resisting Out-of-Distribution Data Problem in Perturbation of XAI

235

27 Jul 2021

Robust Explainability: A Tutorial on Gradient-Based Attribution Methods for Deep Neural NetworksIEEE Signal Processing Magazine (IEEE SPM), 2021

359

101

23 Jul 2021

Trustworthy AI: A Computational Perspective

Xiaorui Liu

399

256

12 Jul 2021

Robust Counterfactual Explanations on Graph Neural NetworksNeural Information Processing Systems (NeurIPS), 2021

Yong Zhang

406

114

08 Jul 2021

When and How to Fool Explainable Models (and Humans) with Adversarial Examples

261

05 Jul 2021

Certifiably Robust Interpretation via Renyi Differential Privacy

Chuang Gan

143

04 Jul 2021

Explanation-Guided Diagnosis of Machine Learning Evasion AttacksSecurity and Privacy in Communication Networks (SecureComm), 2021

Abderrahmen Amich

Birhanu Eshete

AAML

115

30 Jun 2021

On Locality of Local Explanation Models

Sahra Ghalebikesabi

Lucile Ter-Minassian

Karla Diaz-Ordaz

Chris Holmes

FedML FAtt

155

24 Jun 2021

Guided Integrated Gradients: An Adaptive Path Method for Removing NoiseComputer Vision and Pattern Recognition (CVPR), 2021

A. Kapishnikov

Subhashini Venugopalan

263

123

17 Jun 2021

Best of both worlds: local and global explanations with human-understandable concepts

Alan Karthikesalingam

Been Kim

FAtt

226

16 Jun 2021

S-LIME: Stabilized-LIME for Model ExplanationKnowledge Discovery and Data Mining (KDD), 2021

253

113

15 Jun 2021

On the Lack of Robust Interpretability of Neural Text ClassifiersFindings (Findings), 2021

119

08 Jun 2021

3DB: A Framework for Debugging Computer Vision ModelsNeural Information Processing Systems (NeurIPS), 2021

...

240

07 Jun 2021

Evaluating Local Explanations using White-box Models

Amir Hossein Akhavan Rahnama

202

04 Jun 2021

DISSECT: Disentangled Simultaneous Explanations via Concept TraversalsInternational Conference on Learning Representations (ICLR), 2021

Chun-Liang Li

328

31 May 2021

The effectiveness of feature attribution methods and its correlation with automatic evaluation scoresNeural Information Processing Systems (NeurIPS), 2021

482

105

31 May 2021

Drop Clause: Enhancing Performance, Interpretability and Robustness of the Tsetlin Machine

Jivitesh Sharma

Rohan Kumar Yadav

Ole-Christoffer Granmo

Lei Jiao

VLM

206

30 May 2021

EDDA: Explanation-driven Data Augmentation to Improve Explanation Faithfulness

212

29 May 2021

Fooling Partial Dependence via Data Poisoning

296

26 May 2021

Information-theoretic Evolution of Model Agnostic Global Explanations

173

14 May 2021

XAI Handbook: Towards a Unified Framework for Explainable AI

133

14 May 2021

Leveraging Sparse Linear Layers for Debuggable Deep NetworksInternational Conference on Machine Learning (ICML), 2021

208

11 May 2021

Interpretable Semantic Photo GeolocationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021

Jonas Theiner

Eric Müller-Budack

Ralph Ewerth

181

30 Apr 2021

Towards Adversarial Patch Analysis and Certified Defense against Crowd CountingACM Multimedia (ACM MM), 2021

Xiaoqing Ye

246

22 Apr 2021

On the Sensitivity and Stability of Model Interpretations in NLPAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

248

18 Apr 2021

Evaluating Saliency Methods for Neural Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

Shuoyang Ding

Philipp Koehn

FAtt XAI

122

12 Apr 2021

A-FMI: Learning Attributions from Deep Networks via Feature Map Importance

12 Apr 2021

Sparse Oblique Decision Trees: A Tool to Understand and Manipulate Neural Net FeaturesData mining and knowledge discovery (DMKD), 2021

Suryabhan Singh Hada

Miguel Á. Carreira-Perpiñán

Arman Zharmagambetov

199

07 Apr 2021

Neural Response Interpretation through the Lens of Critical PathwaysComputer Vision and Pattern Recognition (CVPR), 2021

Christian Rupprecht

Nassir Navab

123

31 Mar 2021

Building Reliable Explanations of Unreliable Neural Networks: Locally Smoothing Perspective of Model InterpretationComputer Vision and Pattern Recognition (CVPR), 2021

160

26 Mar 2021

ExAD: An Ensemble Approach for Explanation-based Adversarial Detection

R. Vardhan

Ninghao Liu

Phakpoom Chinprutthiwong

193

22 Mar 2021

CACTUS: Detecting and Resolving Conflicts in Objective Functions

Subhajit Das

Alex Endert

13 Mar 2021

Human-Understandable Decision Making for Visual RecognitionPacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2021

Ivor Tsang

122

05 Mar 2021

Detecting Spurious Correlations with Sanity Tests for Artificial Intelligence Guided Radiology SystemsFrontiers in Digital Health (FDH), 2021

183

04 Mar 2021

Do Input Gradients Highlight Discriminative Features?Neural Information Processing Systems (NeurIPS), 2021

Harshay Shah

Prateek Jain

Praneeth Netrapalli

AAML FAtt

362

25 Feb 2021

Resilience of Bayesian Layer-Wise Explanations under Adversarial AttacksIEEE International Joint Conference on Neural Network (IJCNN), 2021

279

22 Feb 2021