Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2002.09815
Cited By
v1
v2
v3 (latest)
Neuron Shapley: Discovering the Responsible Neurons
Neural Information Processing Systems (NeurIPS), 2020
23 February 2020
Amirata Ghorbani
James Zou
FAtt
TDI
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Neuron Shapley: Discovering the Responsible Neurons"
50 / 86 papers shown
Enforcing Orderedness to Improve Feature Consistency
Sophie L. Wang
Alex Quach
Nithin Parsan
John J. Yang
77
0
0
01 Dec 2025
Rethinking Saliency Maps: A Cognitive Human Aligned Taxonomy and Evaluation Framework for Explanations
Yehonatan Elisha
Seffi Cohen
Oren Barkan
Noam Koenigstein
FAtt
385
3
0
17 Nov 2025
MIN-Merging: Merge the Important Neurons for Model Merging
Yunfei Liang
MoMe
605
0
0
18 Oct 2025
CoopQ: Cooperative Game Inspired Layerwise Mixed Precision Quantization for LLMs
Junchen Zhao
Ali Derakhshan
Dushyant Bharadwaj
Jayden Kana Hyman
Junhao Dong
Sangeetha Abdu Jyothi
MQ
181
0
0
18 Sep 2025
ORACLE: Explaining Feature Interactions in Neural Networks with ANOVA
Dongseok Kim
Wonjun Jeong
Mohamed Jismy Aashik Rasool
Gisung Oh
280
0
0
13 Sep 2025
DFAMS: Dynamic-flow guided Federated Alignment based Multi-prototype Search
Zhibang Yang
Xinke Jiang
Rihong Qiu
Ruiqing Li
Yihang Zhang
...
Yongxin Xu
Hongxin Ding
Xu Chu
Junfeng Zhao
Yasha Wang
217
1
0
28 Aug 2025
From Indirect Object Identification to Syllogisms: Exploring Binary Mechanisms in Transformer Circuits
Jiaqi W. Ma
Shichang Zhang
163
0
0
22 Aug 2025
Where and How to Enhance: Discovering Bit-Width Contribution for Mixed Precision Quantization
International Joint Conference on Artificial Intelligence (IJCAI), 2025
Haidong Kang
Lianbo Ma
Guo-Ding Yu
Shangce Gao
MQ
314
1
0
05 Aug 2025
Improving Multimodal Learning Balance and Sufficiency through Data Remixing
Xiaoyu Ma
Hao Chen
Yongjian Deng
358
6
0
13 Jun 2025
Tensorization is a powerful but underexplored tool for compression and interpretability of neural networks
Safa Hamreras
Sukhbinder Singh
Roman Orus
448
1
0
26 May 2025
Antithetic Sampling for Top-k Shapley Identification
Patrick Kolpaczki
Tim Nielen
Eyke Hüllermeier
FAtt
TDI
493
0
0
02 Apr 2025
Discovering Influential Neuron Path in Vision Transformers
International Conference on Learning Representations (ICLR), 2025
Yifan Wang
Yifei Liu
Yingdong Shi
Chong Li
Anqi Pang
Sibei Yang
Jingyi Yu
Kan Ren
ViT
648
5
0
12 Mar 2025
FW-Shapley: Real-time Estimation of Weighted Shapley Values
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Pranoy Panda
Siddharth Tandon
V. Balasubramanian
TDI
454
3
0
09 Mar 2025
Nonlinear Sparse Generalized Canonical Correlation Analysis for Multi-view High-dimensional Data
Rong Wu
Ziqi Chen
Gen Li
Hai Shu
CML
300
0
0
26 Feb 2025
NeurFlow: Interpreting Neural Networks through Neuron Groups and Functional Interactions
International Conference on Learning Representations (ICLR), 2025
Tue Cao
Nhat X. Hoang
Hieu H. Pham
P. Nguyen
My T. Thai
665
3
0
22 Feb 2025
Faithful Counterfactual Visual Explanations (FCVE)
Knowledge-Based Systems (KBS), 2024
Bismillah Khan
Syed Ali Tariq
Tehseen Zia
Muhammad Ahsan
David Windridge
280
1
0
12 Jan 2025
Linguistically Grounded Analysis of Language Models using Shapley Head Values
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Marcell Richard Fekete
Johannes Bjerva
519
1
0
17 Oct 2024
On-the-fly Modulation for Balanced Multimodal Learning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Yake Wei
D. Hu
Henghui Du
Ji-Rong Wen
308
37
0
15 Oct 2024
PCEvE: Part Contribution Evaluation Based Model Explanation for Human Figure Drawing Assessment and Beyond
Jongseo Lee
Geo Ahn
Seong Tae Kim
Jinwoo Choi
386
0
0
26 Sep 2024
Linking in Style: Understanding learned features in deep learning models
European Conference on Computer Vision (ECCV), 2024
Maren H. Wehrheim
Pamela Osuna-Vargas
Matthias Kaschube
GAN
260
0
0
25 Sep 2024
Investigating Layer Importance in Large Language Models
BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackBoxNLP), 2024
Yang Zhang
Yanfei Dong
Kenji Kawaguchi
FAtt
300
33
0
22 Sep 2024
Optimal ablation for interpretability
Neural Information Processing Systems (NeurIPS), 2024
Maximilian Li
Lucas Janson
FAtt
449
15
0
16 Sep 2024
FAST: Boosting Uncertainty-based Test Prioritization Methods for Neural Networks via Feature Selection
International Conference on Automated Software Engineering (ASE), 2024
Jialuo Chen
Jingyi Wang
Xiyue Zhang
Youcheng Sun
Marta Kwiatkowska
Jiming Chen
Peng Cheng
279
3
0
13 Sep 2024
Interpretable Triplet Importance for Personalized Ranking
International Conference on Information and Knowledge Management (CIKM), 2024
Bowei He
Chen Ma
337
3
0
28 Jul 2024
On Behalf of the Stakeholders: Trends in NLP Model Interpretability in the Era of LLMs
Nitay Calderon
Roi Reichart
391
31
0
27 Jul 2024
How and where does CLIP process negation?
Vincent Quantmeyer
Pablo Mosteiro
Albert Gatt
CoGe
305
14
0
15 Jul 2024
Neural Dynamic Data Valuation: A Stochastic Optimal Control Approach
Zhangyong Liang
Ji Zhang
Ji Zhang
Pengfei Zhang
Zhao Li
TDI
499
1
0
30 Apr 2024
Mechanistic Interpretability for AI Safety -- A Review
Leonard Bereska
E. Gavves
AI4CE
519
364
0
22 Apr 2024
Decomposing and Editing Predictions by Modeling Model Computation
Harshay Shah
Andrew Ilyas
Aleksander Madry
KELM
330
25
0
17 Apr 2024
Incremental Residual Concept Bottleneck Models
Chenming Shang
Shiji Zhou
Hengyuan Zhang
Xinzhe Ni
Yujiu Yang
Yuwang Wang
409
47
0
13 Apr 2024
Understanding Multimodal Deep Neural Networks: A Concept Selection View
Chenming Shang
Hengyuan Zhang
Hao Wen
Yujiu Yang
325
10
0
13 Apr 2024
Red-Teaming Segment Anything Model
K. Jankowski
Bartlomiej Sobieski
Mateusz Kwiatkowski
J. Szulc
Michael F. Janik
Hubert Baniecki
P. Biecek
VLM
AAML
269
4
0
02 Apr 2024
Explaining Probabilistic Models with Distributional Values
Luca Franceschi
Michele Donini
Cédric Archambeau
Matthias Seeger
FAtt
288
3
0
15 Feb 2024
EcoVal: An Efficient Data Valuation Framework for Machine Learning
Ayush K Tarun
Vikram S Chundawat
Murari Mandal
Hong Ming Tan
Bowei Chen
Mohan Kankanhalli
TDI
537
3
0
14 Feb 2024
Thresholding Data Shapley for Data Cleansing Using Multi-Armed Bandits
Hiroyuki Namba
Shota Horiguchi
Masaki Hamamoto
Masashi Egi
TDI
241
1
0
13 Feb 2024
Black-Box Access is Insufficient for Rigorous AI Audits
Conference on Fairness, Accountability and Transparency (FAccT), 2024
Stephen Casper
Carson Ezell
Charlotte Siegmann
Noam Kolt
Taylor Lynn Curtis
...
Michael Gerovitch
David Bau
Max Tegmark
David M. Krueger
Dylan Hadfield-Menell
AAML
710
149
0
25 Jan 2024
Stabilizing Estimates of Shapley Values with Control Variates
Jeremy Goldwasser
Giles Hooker
FAtt
408
9
0
11 Oct 2023
Enhancing multimodal cooperation via sample-level modality valuation
Computer Vision and Pattern Recognition (CVPR), 2023
Yake Wei
Ruoxuan Feng
Zihe Wang
Di Hu
582
60
0
12 Sep 2023
Circuit Breaking: Removing Model Behaviors with Targeted Ablation
Maximilian Li
Xander Davies
Max Nadeau
KELM
MU
401
35
0
12 Sep 2023
Test-Time Backdoor Defense via Detecting and Repairing
Jiyang Guan
Jian Liang
Ran He
AAML
241
0
0
11 Aug 2023
Efficient Multiuser AI Downloading via Reusable Knowledge Broadcasting
IEEE Transactions on Wireless Communications (IEEE TWC), 2023
Hai Wu
Qunsong Zeng
Kaibin Huang
392
16
0
28 Jul 2023
Causal Analysis for Robust Interpretability of Neural Networks
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Ola Ahmad
Nicolas Béreux
Loïc Baret
V. Hashemi
Freddy Lecue
CML
409
13
0
15 May 2023
Data-OOB: Out-of-bag Estimate as a Simple and Efficient Data Value
International Conference on Machine Learning (ICML), 2023
Yongchan Kwon
James Zou
TDI
FedML
503
54
0
16 Apr 2023
LINe: Out-of-Distribution Detection by Leveraging Important Neurons
Computer Vision and Pattern Recognition (CVPR), 2023
Yong Hyun Ahn
Gyeong-Moon Park
Seong Tae Kim
OODD
404
48
0
24 Mar 2023
Improving Fairness in Adaptive Social Exergames via Shapley Bandits
International Conference on Intelligent User Interfaces (IUI), 2023
Robert C. Gray
Jennifer Villareale
T. Fox
Diane H Dallal
Santiago Ontañón
D. Arigo
S. Jabbari
Jichen Zhu
159
6
0
18 Feb 2023
Approximating the Shapley Value without Marginal Contributions
AAAI Conference on Artificial Intelligence (AAAI), 2023
Patrick Kolpaczki
Viktor Bengs
Maximilian Muschalik
Eyke Hüllermeier
FAtt
TDI
675
42
0
01 Feb 2023
A Survey of Explainable AI in Deep Visual Modeling: Methods and Metrics
Naveed Akhtar
XAI
VLM
269
9
0
31 Jan 2023
Training Data Influence Analysis and Estimation: A Survey
Machine-mediated learning (ML), 2022
Zayd Hammoudeh
Daniel Lowd
TDI
608
165
0
09 Dec 2022
Refiner: Data Refining against Gradient Leakage Attacks in Federated Learning
Mingyuan Fan
Cen Chen
Chengyu Wang
Ximeng Liu
Wenmeng Zhou
AAML
FedML
457
2
0
05 Dec 2022
Diagnostics for Deep Neural Networks with Automated Copy/Paste Attacks
Stephen Casper
K. Hariharan
Dylan Hadfield-Menell
AAML
458
11
0
18 Nov 2022
1
2
Next
Page 1 of 2