Stability Guarantees for Feature Attributions with Multiplicative Smoothing

12 July 2023

Papers citing "Stability Guarantees for Feature Attributions with Multiplicative Smoothing"

10 / 10 papers shown

Title
Probabilistic Stability Guarantees for Feature Attributions Helen Jin Anton Xue Weiqiu You Surbhi Goel Eric Wong 19 0 0 18 Apr 2025
One Wave to Explain Them All: A Unifying Perspective on Post-hoc Explainability Gabriel Kasmi Amandine Brunetto Thomas Fel Jayneel Parekh AAML FAtt 22 0 0 02 Oct 2024
Enhancing Model Interpretability with Local Attribution over Global Exploration Zhiyu Zhu Zhibo Jin Jiayu Zhang Huaming Chen FAtt 21 4 0 14 Aug 2024
SmoothLLM: Defending Large Language Models Against Jailbreaking Attacks Alexander Robey Eric Wong Hamed Hassani George J. Pappas AAML 38 216 0 05 Oct 2023
Towards Faithful Model Explanation in NLP: A Survey Qing Lyu Marianna Apidianaki Chris Callison-Burch XAI 104 107 0 22 Sep 2022
The Solvability of Interpretability Evaluation Metrics Yilun Zhou J. Shah 62 8 0 18 May 2022
"Will You Find These Shortcuts?" A Protocol for Evaluating the Faithfulness of Input Salience Methods for Text Classification Jasmijn Bastings Sebastian Ebert Polina Zablotskaia Anders Sandholm Katja Filippova 107 75 0 14 Nov 2021
Certified Patch Robustness via Smoothed Vision Transformers Hadi Salman Saachi Jain Eric Wong Aleksander Mkadry AAML 57 58 0 11 Oct 2021
Adversarial Machine Learning at Scale Alexey Kurakin Ian Goodfellow Samy Bengio AAML 256 3,108 0 04 Nov 2016
ImageNet Large Scale Visual Recognition Challenge Olga Russakovsky Jia Deng Hao Su J. Krause S. Satheesh ... A. Karpathy A. Khosla Michael S. Bernstein Alexander C. Berg Li Fei-Fei VLM ObjD 282 39,170 0 01 Sep 2014