Explaining Explanations: An Overview of Interpretability of Machine Learning

31 May 2018

Papers citing "Explaining Explanations: An Overview of Interpretability of Machine Learning"

50 / 160 papers shown

Title
Towards a Unified Representation Evaluation Framework Beyond Downstream Tasks Christos Plachouras Julien Guinot George Fazekas Elio Quinton Emmanouil Benetos Johan Pauwels 110 1 0 09 May 2025
Reasoning Models Don't Always Say What They Think Yanda Chen Joe Benton Ansh Radhakrishnan Jonathan Uesato Carson E. Denison ... Vlad Mikulik Samuel R. Bowman Jan Leike Jared Kaplan E. Perez ReLM LRM 67 12 1 08 May 2025
A Mathematical Philosophy of Explanations in Mechanistic Interpretability -- The Strange Science Part I.i Kola Ayonrinde Louis Jaburi MILM 84 1 0 01 May 2025
Deriving Equivalent Symbol-Based Decision Models from Feedforward Neural Networks Sebastian Seidel Uwe M. Borghoff 28 0 0 16 Apr 2025
Explainable AI-Based Interface System for Weather Forecasting Model Soyeon Kim Junho Choi Yeji Choi Subeen Lee Artyom Stitsyuk Minkyoung Park Seongyeop Jeong Youhyun Baek Jaesik Choi XAI 48 2 0 01 Apr 2025
Rubrik's Cube: Testing a New Rubric for Evaluating Explanations on the CUBE dataset Diana Galván-Sosa Gabrielle Gaudeau Pride Kavumba Yunmeng Li Hongyi gu Zheng Yuan Keisuke Sakaguchi P. Buttery LRM 35 0 0 31 Mar 2025
Investigating the Duality of Interpretability and Explainability in Machine Learning Moncef Garouani Josiane Mothe Ayah Barhrhouj Julien Aligon AAML 36 2 0 27 Mar 2025
Model Lakes Koyena Pal David Bau Renée J. Miller 63 0 0 24 Feb 2025
Policy-to-Language: Train LLMs to Explain Decisions with Flow-Matching Generated Rewards Xinyi Yang Liang Zeng Heng Dong C. Yu X. Wu H. Yang Yu Wang Milind Tambe Tonghan Wang 68 2 0 18 Feb 2025
A Study of the Plausibility of Attention between RNN Encoders in Natural Language Inference Duc Hau Nguyen Duc Hau Nguyen Pascale Sébillot 42 5 0 23 Jan 2025
Attention Mechanisms Don't Learn Additive Models: Rethinking Feature Importance for Transformers Tobias Leemann Alina Fastowski Felix Pfeiffer Gjergji Kasneci 51 4 0 10 Jan 2025
GIFT: A Framework for Global Interpretable Faithful Textual Explanations of Vision Classifiers Éloi Zablocki Valentin Gerard Amaia Cardiel Eric Gaussier Matthieu Cord Eduardo Valle 69 0 0 23 Nov 2024
FLARE: Faithful Logic-Aided Reasoning and Exploration Erik Arakelyan Pasquale Minervini Pat Verga Patrick Lewis Isabelle Augenstein ReLM LRM 61 2 0 14 Oct 2024
Bridging Today and the Future of Humanity: AI Safety in 2024 and Beyond Shanshan Han 73 1 0 09 Oct 2024
$$\texttt{dattri}$: A Library for Efficient Data Attribution$ $\texttt{dattri}$ : A Library for Efficient Data Attribution Junwei Deng Ting-Wei Li Shiyuan Zhang Shixuan Liu Yijun Pan Hao Huang Xinhe Wang Pingbang Hu Xingjian Zhang Jiaqi W. Ma TDI 34 3 0 06 Oct 2024
Interactive Example-based Explanations to Improve Health Professionals' Onboarding with AI for Human-AI Collaborative Decision Making Min Hun Lee Renee Bao Xuan Ng Silvana Xin Yi Choo S. Thilarajah 26 0 0 24 Sep 2024
What is the Role of Small Models in the LLM Era: A Survey Lihu Chen Gaël Varoquaux ALM 58 23 0 10 Sep 2024
Interpretable Clustering: A Survey Lianyu Hu Mudi Jiang Junjie Dong Xinying Liu Zengyou He 26 1 0 01 Sep 2024
Explainable Artificial Intelligence: A Survey of Needs, Techniques, Applications, and Future Direction Melkamu Mersha Khang Lam Joseph Wood Ali AlShami Jugal Kalita XAI AI4TS 64 28 0 30 Aug 2024
Misfitting With AI: How Blind People Verify and Contest AI Errors Rahaf Alharbi P. Lor Jaylin Herskovitz S. Schoenebeck Robin Brewer 31 10 0 13 Aug 2024
ISR: Invertible Symbolic Regression Tony Tohme M. J. Khojasteh Mohsen Sadr Florian Meyer Kamal Youcef-Toumi 43 0 0 10 May 2024
Differential contributions of machine learning and statistical analysis to language and cognitive sciences Kun Sun Rong Wang 33 1 0 22 Apr 2024
Crafting Knowledge: Exploring the Creative Mechanisms of Chat-Based Search Engines Lijia Ma Xingchen Xu Yong-Ming Tan 32 7 0 29 Feb 2024
Explaining Probabilistic Models with Distributional Values Luca Franceschi Michele Donini Cédric Archambeau Matthias Seeger FAtt 21 2 0 15 Feb 2024
Large Language Model Agent for Hyper-Parameter Optimization Siyi Liu Chen Gao Yong Li 37 19 0 02 Feb 2024
Black-Box Access is Insufficient for Rigorous AI Audits Stephen Casper Carson Ezell Charlotte Siegmann Noam Kolt Taylor Lynn Curtis ... Michael Gerovitch David Bau Max Tegmark David M. Krueger Dylan Hadfield-Menell AAML 17 76 0 25 Jan 2024
Mathematical Algorithm Design for Deep Learning under Societal and Judicial Constraints: The Algorithmic Transparency Requirement Holger Boche Adalbert Fono Gitta Kutyniok FaML 23 4 0 18 Jan 2024
B-Cos Aligned Transformers Learn Human-Interpretable Features Manuel Tran Amal Lahiani Yashin Dicente Cid Melanie Boxberg Peter Lienemann C. Matek S. J. Wagner Fabian J. Theis Eldad Klaiman Tingying Peng MedIm ViT 13 2 0 16 Jan 2024
Labeling Neural Representations with Inverse Recognition Kirill Bykov Laura Kopf Shinichi Nakajima Marius Kloft Marina M.-C. Höhne BDL 19 15 0 22 Nov 2023
On the Relationship Between Interpretability and Explainability in Machine Learning Benjamin Leblanc Pascal Germain FaML 24 0 0 20 Nov 2023
A novel post-hoc explanation comparison metric and applications Shreyan Mitra Leilani H. Gilpin FAtt 26 0 0 17 Nov 2023
Deep Natural Language Feature Learning for Interpretable Prediction Felipe Urrutia Cristian Buc Valentin Barriere 23 1 0 09 Nov 2023
Notion of Explainable Artificial Intelligence -- An Empirical Investigation from A Users Perspective A. Haque A. Najmul Islam Patrick Mikalef 21 1 0 01 Nov 2023
Scene Text Recognition Models Explainability Using Local Features M. Ty Rowel Atienza 26 1 0 14 Oct 2023
Interpretability is not Explainability: New Quantitative XAI Approach with a focus on Recommender Systems in Education Riccardo Porcedda XAI 15 0 0 18 Sep 2023
WSAM: Visual Explanations from Style Augmentation as Adversarial Attacker and Their Influence in Image Classification Felipe Moreno-Vera E. Medina Jorge Poco 19 2 0 29 Aug 2023
RecRec: Algorithmic Recourse for Recommender Systems Sahil Verma Ashudeep Singh Varich Boonsanong John P. Dickerson Chirag Shah 25 1 0 28 Aug 2023
ASCAPE: An open AI ecosystem to support the quality of life of cancer patients Konstantinos Lampropoulos T. Kosmidis Serge Autexier Miloš Savić Manos Athanatos Miltiadis Kokkonidis Tzortzia Koutsouri A. Vizitiu A. Valachis Miriam Quintero Padron 11 4 0 28 Aug 2023
Do Models Explain Themselves? Counterfactual Simulatability of Natural Language Explanations Yanda Chen Ruiqi Zhong Narutatsu Ri Chen Zhao He He Jacob Steinhardt Zhou Yu Kathleen McKeown LRM 24 47 0 17 Jul 2023
Reliable AI: Does the Next Generation Require Quantum Computing? Aras Bacho Holger Boche Gitta Kutyniok 24 2 0 03 Jul 2023
Explainable Predictive Maintenance Sepideh Pashami Sławomir Nowaczyk Yuantao Fan Jakub Jakubowski Nuno Paiva ... Bruno Veloso M. Sayed-Mouchaweh L. Rajaoarisoa Grzegorz J. Nalepa João Gama 27 8 0 08 Jun 2023
A Review on Explainable Artificial Intelligence for Healthcare: Why, How, and When? M. Rubaiyat Hossain Mondal Prajoy Podder 13 56 0 10 Apr 2023
Posthoc Interpretation via Quantization Francesco Paissan Cem Subakan Mirco Ravanelli MQ 11 6 0 22 Mar 2023
Intelligent diagnostic scheme for lung cancer screening with Raman spectra data by tensor network machine learning Yujia An Shengxing Bai Lin Cheng Xiao‐Guang Li Cheng Wang Xiao Han Gang Su Shi-Ju Ran Cong Wang 6 1 0 11 Mar 2023
A System's Approach Taxonomy for User-Centred XAI: A Survey Ehsan Emamirad Pouya Ghiasnezhad Omran A. Haller S. Gregor 21 1 0 06 Mar 2023
Less is More: The Influence of Pruning on the Explainability of CNNs David Weber F. Merkle Pascal Schöttle Stephan Schlögl Martin Nocker FAtt 29 1 0 17 Feb 2023
Understanding User Preferences in Explainable Artificial Intelligence: A Survey and a Mapping Function Proposal M. Hashemi Ali Darejeh Francisco Cruz 37 3 0 07 Feb 2023
Fixed-kinetic Neural Hamiltonian Flows for enhanced interpretability and reduced complexity Vincent Souveton Arnaud Guillin J. Jasche G. Lavaux Manon Michel 16 3 0 03 Feb 2023
SoK: A Systematic Evaluation of Backdoor Trigger Characteristics in Image Classification Gorka Abad Jing Xu Stefanos Koffas Behrad Tajalli S. Picek Mauro Conti AAML 54 5 0 03 Feb 2023
Faithful Chain-of-Thought Reasoning Qing Lyu Shreya Havaldar Adam Stein Li Zhang D. Rao Eric Wong Marianna Apidianaki Chris Callison-Burch ReLM LRM 19 207 0 31 Jan 2023