Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2202.12451
Cited By
Human-Centered Concept Explanations for Neural Networks
25 February 2022
Chih-Kuan Yeh
Been Kim
Pradeep Ravikumar
FAtt
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Human-Centered Concept Explanations for Neural Networks"
20 / 20 papers shown
Title
Mechanistically Interpreting a Transformer-based 2-SAT Solver: An Axiomatic Approach
Nils Palumbo
Ravi Mangal
Zifan Wang
Saranya Vijayakumar
Corina S. Pasareanu
Somesh Jha
41
1
0
18 Jul 2024
Provably Better Explanations with Optimized Aggregation of Feature Attributions
Thomas Decker
Ananta R. Bhattarai
Jindong Gu
Volker Tresp
Florian Buettner
20
2
0
07 Jun 2024
Concept-based Analysis of Neural Networks via Vision-Language Models
Ravi Mangal
Nina Narodytska
Divya Gopinath
Boyue Caroline Hu
Anirban Roy
Susmit Jha
Corina S. Pasareanu
CoGe
20
3
0
28 Mar 2024
A survey on Concept-based Approaches For Model Improvement
Avani Gupta
P. J. Narayanan
LRM
24
5
0
21 Mar 2024
Towards Modeling Uncertainties of Self-explaining Neural Networks via Conformal Prediction
Wei Qian
Chenxu Zhao
Yangyi Li
Fenglong Ma
Chao Zhang
Mengdi Huai
UQCV
40
2
0
03 Jan 2024
Estimation of Concept Explanations Should be Uncertainty Aware
Vihari Piratla
Juyeon Heo
Katherine M. Collins
Sukriti Singh
Adrian Weller
11
1
0
13 Dec 2023
Concept Distillation: Leveraging Human-Centered Explanations for Model Improvement
Avani Gupta
Saurabh Saini
P. J. Narayanan
23
6
0
26 Nov 2023
Explaining Deep Neural Networks for Bearing Fault Detection with Vibration Concepts
Thomas Decker
Michael Lebacher
Volker Tresp
FAtt
13
2
0
17 Oct 2023
Concept-Based Explanations to Test for False Causal Relationships Learned by Abusive Language Classifiers
I. Nejadgholi
S. Kiritchenko
Kathleen C. Fraser
Esma Balkir
21
0
0
04 Jul 2023
On the Role of Emergent Communication for Social Learning in Multi-Agent Reinforcement Learning
Seth Karten
Siva Kailas
Huao Li
Katia P. Sycara
OffRL
28
4
0
28 Feb 2023
Towards Procedural Fairness: Uncovering Biases in How a Toxic Language Classifier Uses Sentiment Information
I. Nejadgholi
Esma Balkir
Kathleen C. Fraser
S. Kiritchenko
23
3
0
19 Oct 2022
A.I. Robustness: a Human-Centered Perspective on Technological Challenges and Opportunities
Andrea Tocchetti
Lorenzo Corti
Agathe Balayn
Mireia Yurrita
Philip Lippmann
Marco Brambilla
Jie-jin Yang
19
10
0
17 Oct 2022
Unpacking Large Language Models with Conceptual Consistency
Pritish Sahu
Michael Cogswell
Yunye Gong
Ajay Divakaran
LRM
79
16
0
29 Sep 2022
Leveraging Explanations in Interactive Machine Learning: An Overview
Stefano Teso
Öznur Alkan
Wolfgang Stammer
Elizabeth M. Daly
XAI
FAtt
LRM
24
62
0
29 Jul 2022
Learning Unsupervised Hierarchies of Audio Concepts
Darius Afchar
Romain Hennequin
Vincent Guigue
29
2
0
21 Jul 2022
Neural Activation Patterns (NAPs): Visual Explainability of Learned Concepts
Alex Bauerle
Daniel Jonsson
Timo Ropinski
FAtt
22
11
0
20 Jun 2022
Xplique: A Deep Learning Explainability Toolbox
Thomas Fel
Lucas Hervier
David Vigouroux
Antonin Poché
Justin Plakoo
...
Agustin Picard
C. Nicodeme
Laurent Gardes
G. Flandin
Thomas Serre
11
30
0
09 Jun 2022
HEX: Human-in-the-loop Explainability via Deep Reinforcement Learning
Michael T. Lash
13
0
0
02 Jun 2022
On Completeness-aware Concept-Based Explanations in Deep Neural Networks
Chih-Kuan Yeh
Been Kim
Sercan Ö. Arik
Chun-Liang Li
Tomas Pfister
Pradeep Ravikumar
FAtt
122
297
0
17 Oct 2019
Efficient Estimation of Word Representations in Vector Space
Tomáš Mikolov
Kai Chen
G. Corrado
J. Dean
3DV
228
31,244
0
16 Jan 2013
1