Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.05902
Cited By
Stability Guarantees for Feature Attributions with Multiplicative Smoothing
12 July 2023
Anton Xue
Rajeev Alur
Eric Wong
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Stability Guarantees for Feature Attributions with Multiplicative Smoothing"
10 / 10 papers shown
Title
Probabilistic Stability Guarantees for Feature Attributions
Helen Jin
Anton Xue
Weiqiu You
Surbhi Goel
Eric Wong
19
0
0
18 Apr 2025
One Wave to Explain Them All: A Unifying Perspective on Post-hoc Explainability
Gabriel Kasmi
Amandine Brunetto
Thomas Fel
Jayneel Parekh
AAML
FAtt
22
0
0
02 Oct 2024
Enhancing Model Interpretability with Local Attribution over Global Exploration
Zhiyu Zhu
Zhibo Jin
Jiayu Zhang
Huaming Chen
FAtt
19
4
0
14 Aug 2024
SmoothLLM: Defending Large Language Models Against Jailbreaking Attacks
Alexander Robey
Eric Wong
Hamed Hassani
George J. Pappas
AAML
38
216
0
05 Oct 2023
Towards Faithful Model Explanation in NLP: A Survey
Qing Lyu
Marianna Apidianaki
Chris Callison-Burch
XAI
104
107
0
22 Sep 2022
The Solvability of Interpretability Evaluation Metrics
Yilun Zhou
J. Shah
62
8
0
18 May 2022
"Will You Find These Shortcuts?" A Protocol for Evaluating the Faithfulness of Input Salience Methods for Text Classification
Jasmijn Bastings
Sebastian Ebert
Polina Zablotskaia
Anders Sandholm
Katja Filippova
107
75
0
14 Nov 2021
Certified Patch Robustness via Smoothed Vision Transformers
Hadi Salman
Saachi Jain
Eric Wong
Aleksander Mkadry
AAML
57
58
0
11 Oct 2021
Adversarial Machine Learning at Scale
Alexey Kurakin
Ian Goodfellow
Samy Bengio
AAML
256
3,108
0
04 Nov 2016
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky
Jia Deng
Hao Su
J. Krause
S. Satheesh
...
A. Karpathy
A. Khosla
Michael S. Bernstein
Alexander C. Berg
Li Fei-Fei
VLM
ObjD
282
39,170
0
01 Sep 2014
1