ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1806.00069
  4. Cited By
Explaining Explanations: An Overview of Interpretability of Machine
  Learning

Explaining Explanations: An Overview of Interpretability of Machine Learning

31 May 2018
Leilani H. Gilpin
David Bau
Ben Z. Yuan
Ayesha Bajwa
Michael A. Specter
Lalana Kagal
    XAI
ArXivPDFHTML

Papers citing "Explaining Explanations: An Overview of Interpretability of Machine Learning"

50 / 160 papers shown
Title
Towards a Unified Representation Evaluation Framework Beyond Downstream Tasks
Towards a Unified Representation Evaluation Framework Beyond Downstream Tasks
Christos Plachouras
Julien Guinot
George Fazekas
Elio Quinton
Emmanouil Benetos
Johan Pauwels
110
1
0
09 May 2025
Reasoning Models Don't Always Say What They Think
Reasoning Models Don't Always Say What They Think
Yanda Chen
Joe Benton
Ansh Radhakrishnan
Jonathan Uesato
Carson E. Denison
...
Vlad Mikulik
Samuel R. Bowman
Jan Leike
Jared Kaplan
E. Perez
ReLM
LRM
67
12
1
08 May 2025
A Mathematical Philosophy of Explanations in Mechanistic Interpretability -- The Strange Science Part I.i
A Mathematical Philosophy of Explanations in Mechanistic Interpretability -- The Strange Science Part I.i
Kola Ayonrinde
Louis Jaburi
MILM
84
1
0
01 May 2025
Deriving Equivalent Symbol-Based Decision Models from Feedforward Neural Networks
Deriving Equivalent Symbol-Based Decision Models from Feedforward Neural Networks
Sebastian Seidel
Uwe M. Borghoff
28
0
0
16 Apr 2025
Explainable AI-Based Interface System for Weather Forecasting Model
Explainable AI-Based Interface System for Weather Forecasting Model
Soyeon Kim
Junho Choi
Yeji Choi
Subeen Lee
Artyom Stitsyuk
Minkyoung Park
Seongyeop Jeong
Youhyun Baek
Jaesik Choi
XAI
48
2
0
01 Apr 2025
Rubrik's Cube: Testing a New Rubric for Evaluating Explanations on the CUBE dataset
Rubrik's Cube: Testing a New Rubric for Evaluating Explanations on the CUBE dataset
Diana Galván-Sosa
Gabrielle Gaudeau
Pride Kavumba
Yunmeng Li
Hongyi gu
Zheng Yuan
Keisuke Sakaguchi
P. Buttery
LRM
35
0
0
31 Mar 2025
Investigating the Duality of Interpretability and Explainability in Machine Learning
Investigating the Duality of Interpretability and Explainability in Machine Learning
Moncef Garouani
Josiane Mothe
Ayah Barhrhouj
Julien Aligon
AAML
36
2
0
27 Mar 2025
Model Lakes
Model Lakes
Koyena Pal
David Bau
Renée J. Miller
63
0
0
24 Feb 2025
Policy-to-Language: Train LLMs to Explain Decisions with Flow-Matching Generated Rewards
Policy-to-Language: Train LLMs to Explain Decisions with Flow-Matching Generated Rewards
Xinyi Yang
Liang Zeng
Heng Dong
C. Yu
X. Wu
H. Yang
Yu Wang
Milind Tambe
Tonghan Wang
68
2
0
18 Feb 2025
A Study of the Plausibility of Attention between RNN Encoders in Natural Language Inference
A Study of the Plausibility of Attention between RNN Encoders in Natural Language Inference
Duc Hau Nguyen
Duc Hau Nguyen
Pascale Sébillot
42
5
0
23 Jan 2025
Attention Mechanisms Don't Learn Additive Models: Rethinking Feature Importance for Transformers
Attention Mechanisms Don't Learn Additive Models: Rethinking Feature Importance for Transformers
Tobias Leemann
Alina Fastowski
Felix Pfeiffer
Gjergji Kasneci
51
4
0
10 Jan 2025
GIFT: A Framework for Global Interpretable Faithful Textual Explanations of Vision Classifiers
GIFT: A Framework for Global Interpretable Faithful Textual Explanations of Vision Classifiers
Éloi Zablocki
Valentin Gerard
Amaia Cardiel
Eric Gaussier
Matthieu Cord
Eduardo Valle
69
0
0
23 Nov 2024
FLARE: Faithful Logic-Aided Reasoning and Exploration
FLARE: Faithful Logic-Aided Reasoning and Exploration
Erik Arakelyan
Pasquale Minervini
Pat Verga
Patrick Lewis
Isabelle Augenstein
ReLM
LRM
61
2
0
14 Oct 2024
Bridging Today and the Future of Humanity: AI Safety in 2024 and Beyond
Bridging Today and the Future of Humanity: AI Safety in 2024 and Beyond
Shanshan Han
73
1
0
09 Oct 2024
$\texttt{dattri}$: A Library for Efficient Data Attribution
dattri\texttt{dattri}dattri: A Library for Efficient Data Attribution
Junwei Deng
Ting-Wei Li
Shiyuan Zhang
Shixuan Liu
Yijun Pan
Hao Huang
Xinhe Wang
Pingbang Hu
Xingjian Zhang
Jiaqi W. Ma
TDI
34
3
0
06 Oct 2024
Interactive Example-based Explanations to Improve Health Professionals'
  Onboarding with AI for Human-AI Collaborative Decision Making
Interactive Example-based Explanations to Improve Health Professionals' Onboarding with AI for Human-AI Collaborative Decision Making
Min Hun Lee
Renee Bao Xuan Ng
Silvana Xin Yi Choo
S. Thilarajah
26
0
0
24 Sep 2024
What is the Role of Small Models in the LLM Era: A Survey
What is the Role of Small Models in the LLM Era: A Survey
Lihu Chen
Gaël Varoquaux
ALM
58
23
0
10 Sep 2024
Interpretable Clustering: A Survey
Interpretable Clustering: A Survey
Lianyu Hu
Mudi Jiang
Junjie Dong
Xinying Liu
Zengyou He
26
1
0
01 Sep 2024
Explainable Artificial Intelligence: A Survey of Needs, Techniques, Applications, and Future Direction
Explainable Artificial Intelligence: A Survey of Needs, Techniques, Applications, and Future Direction
Melkamu Mersha
Khang Lam
Joseph Wood
Ali AlShami
Jugal Kalita
XAI
AI4TS
64
28
0
30 Aug 2024
Misfitting With AI: How Blind People Verify and Contest AI Errors
Misfitting With AI: How Blind People Verify and Contest AI Errors
Rahaf Alharbi
P. Lor
Jaylin Herskovitz
S. Schoenebeck
Robin Brewer
31
10
0
13 Aug 2024
ISR: Invertible Symbolic Regression
ISR: Invertible Symbolic Regression
Tony Tohme
M. J. Khojasteh
Mohsen Sadr
Florian Meyer
Kamal Youcef-Toumi
43
0
0
10 May 2024
Differential contributions of machine learning and statistical analysis
  to language and cognitive sciences
Differential contributions of machine learning and statistical analysis to language and cognitive sciences
Kun Sun
Rong Wang
33
1
0
22 Apr 2024
Crafting Knowledge: Exploring the Creative Mechanisms of Chat-Based
  Search Engines
Crafting Knowledge: Exploring the Creative Mechanisms of Chat-Based Search Engines
Lijia Ma
Xingchen Xu
Yong-Ming Tan
32
7
0
29 Feb 2024
Explaining Probabilistic Models with Distributional Values
Explaining Probabilistic Models with Distributional Values
Luca Franceschi
Michele Donini
Cédric Archambeau
Matthias Seeger
FAtt
21
2
0
15 Feb 2024
Large Language Model Agent for Hyper-Parameter Optimization
Large Language Model Agent for Hyper-Parameter Optimization
Siyi Liu
Chen Gao
Yong Li
37
19
0
02 Feb 2024
Black-Box Access is Insufficient for Rigorous AI Audits
Black-Box Access is Insufficient for Rigorous AI Audits
Stephen Casper
Carson Ezell
Charlotte Siegmann
Noam Kolt
Taylor Lynn Curtis
...
Michael Gerovitch
David Bau
Max Tegmark
David M. Krueger
Dylan Hadfield-Menell
AAML
17
76
0
25 Jan 2024
Mathematical Algorithm Design for Deep Learning under Societal and
  Judicial Constraints: The Algorithmic Transparency Requirement
Mathematical Algorithm Design for Deep Learning under Societal and Judicial Constraints: The Algorithmic Transparency Requirement
Holger Boche
Adalbert Fono
Gitta Kutyniok
FaML
23
4
0
18 Jan 2024
B-Cos Aligned Transformers Learn Human-Interpretable Features
B-Cos Aligned Transformers Learn Human-Interpretable Features
Manuel Tran
Amal Lahiani
Yashin Dicente Cid
Melanie Boxberg
Peter Lienemann
C. Matek
S. J. Wagner
Fabian J. Theis
Eldad Klaiman
Tingying Peng
MedIm
ViT
13
2
0
16 Jan 2024
Labeling Neural Representations with Inverse Recognition
Labeling Neural Representations with Inverse Recognition
Kirill Bykov
Laura Kopf
Shinichi Nakajima
Marius Kloft
Marina M.-C. Höhne
BDL
19
15
0
22 Nov 2023
On the Relationship Between Interpretability and Explainability in
  Machine Learning
On the Relationship Between Interpretability and Explainability in Machine Learning
Benjamin Leblanc
Pascal Germain
FaML
24
0
0
20 Nov 2023
A novel post-hoc explanation comparison metric and applications
A novel post-hoc explanation comparison metric and applications
Shreyan Mitra
Leilani H. Gilpin
FAtt
26
0
0
17 Nov 2023
Deep Natural Language Feature Learning for Interpretable Prediction
Deep Natural Language Feature Learning for Interpretable Prediction
Felipe Urrutia
Cristian Buc
Valentin Barriere
23
1
0
09 Nov 2023
Notion of Explainable Artificial Intelligence -- An Empirical
  Investigation from A Users Perspective
Notion of Explainable Artificial Intelligence -- An Empirical Investigation from A Users Perspective
A. Haque
A. Najmul Islam
Patrick Mikalef
21
1
0
01 Nov 2023
Scene Text Recognition Models Explainability Using Local Features
Scene Text Recognition Models Explainability Using Local Features
M. Ty
Rowel Atienza
26
1
0
14 Oct 2023
Interpretability is not Explainability: New Quantitative XAI Approach
  with a focus on Recommender Systems in Education
Interpretability is not Explainability: New Quantitative XAI Approach with a focus on Recommender Systems in Education
Riccardo Porcedda
XAI
15
0
0
18 Sep 2023
WSAM: Visual Explanations from Style Augmentation as Adversarial
  Attacker and Their Influence in Image Classification
WSAM: Visual Explanations from Style Augmentation as Adversarial Attacker and Their Influence in Image Classification
Felipe Moreno-Vera
E. Medina
Jorge Poco
19
2
0
29 Aug 2023
RecRec: Algorithmic Recourse for Recommender Systems
RecRec: Algorithmic Recourse for Recommender Systems
Sahil Verma
Ashudeep Singh
Varich Boonsanong
John P. Dickerson
Chirag Shah
25
1
0
28 Aug 2023
ASCAPE: An open AI ecosystem to support the quality of life of cancer
  patients
ASCAPE: An open AI ecosystem to support the quality of life of cancer patients
Konstantinos Lampropoulos
T. Kosmidis
Serge Autexier
Miloš Savić
Manos Athanatos
Miltiadis Kokkonidis
Tzortzia Koutsouri
A. Vizitiu
A. Valachis
Miriam Quintero Padron
11
4
0
28 Aug 2023
Do Models Explain Themselves? Counterfactual Simulatability of Natural
  Language Explanations
Do Models Explain Themselves? Counterfactual Simulatability of Natural Language Explanations
Yanda Chen
Ruiqi Zhong
Narutatsu Ri
Chen Zhao
He He
Jacob Steinhardt
Zhou Yu
Kathleen McKeown
LRM
24
47
0
17 Jul 2023
Reliable AI: Does the Next Generation Require Quantum Computing?
Reliable AI: Does the Next Generation Require Quantum Computing?
Aras Bacho
Holger Boche
Gitta Kutyniok
24
2
0
03 Jul 2023
Explainable Predictive Maintenance
Explainable Predictive Maintenance
Sepideh Pashami
Sławomir Nowaczyk
Yuantao Fan
Jakub Jakubowski
Nuno Paiva
...
Bruno Veloso
M. Sayed-Mouchaweh
L. Rajaoarisoa
Grzegorz J. Nalepa
João Gama
27
8
0
08 Jun 2023
A Review on Explainable Artificial Intelligence for Healthcare: Why,
  How, and When?
A Review on Explainable Artificial Intelligence for Healthcare: Why, How, and When?
M. Rubaiyat
Hossain Mondal
Prajoy Podder
13
56
0
10 Apr 2023
Posthoc Interpretation via Quantization
Posthoc Interpretation via Quantization
Francesco Paissan
Cem Subakan
Mirco Ravanelli
MQ
11
6
0
22 Mar 2023
Intelligent diagnostic scheme for lung cancer screening with Raman
  spectra data by tensor network machine learning
Intelligent diagnostic scheme for lung cancer screening with Raman spectra data by tensor network machine learning
Yujia An
Shengxing Bai
Lin Cheng
Xiao‐Guang Li
Cheng Wang
Xiao Han
Gang Su
Shi-Ju Ran
Cong Wang
6
1
0
11 Mar 2023
A System's Approach Taxonomy for User-Centred XAI: A Survey
A System's Approach Taxonomy for User-Centred XAI: A Survey
Ehsan Emamirad
Pouya Ghiasnezhad Omran
A. Haller
S. Gregor
21
1
0
06 Mar 2023
Less is More: The Influence of Pruning on the Explainability of CNNs
Less is More: The Influence of Pruning on the Explainability of CNNs
David Weber
F. Merkle
Pascal Schöttle
Stephan Schlögl
Martin Nocker
FAtt
29
1
0
17 Feb 2023
Understanding User Preferences in Explainable Artificial Intelligence: A
  Survey and a Mapping Function Proposal
Understanding User Preferences in Explainable Artificial Intelligence: A Survey and a Mapping Function Proposal
M. Hashemi
Ali Darejeh
Francisco Cruz
37
3
0
07 Feb 2023
Fixed-kinetic Neural Hamiltonian Flows for enhanced interpretability and
  reduced complexity
Fixed-kinetic Neural Hamiltonian Flows for enhanced interpretability and reduced complexity
Vincent Souveton
Arnaud Guillin
J. Jasche
G. Lavaux
Manon Michel
16
3
0
03 Feb 2023
SoK: A Systematic Evaluation of Backdoor Trigger Characteristics in
  Image Classification
SoK: A Systematic Evaluation of Backdoor Trigger Characteristics in Image Classification
Gorka Abad
Jing Xu
Stefanos Koffas
Behrad Tajalli
S. Picek
Mauro Conti
AAML
54
5
0
03 Feb 2023
Faithful Chain-of-Thought Reasoning
Faithful Chain-of-Thought Reasoning
Qing Lyu
Shreya Havaldar
Adam Stein
Li Zhang
D. Rao
Eric Wong
Marianna Apidianaki
Chris Callison-Burch
ReLM
LRM
19
207
0
31 Jan 2023
1234
Next