v1v2v3 (latest)

Explanation-Based Human Debugging of NLP Models: A Survey

Transactions of the Association for Computational Linguistics (TACL), 2021

30 April 2021

Piyawat Lertvittayakumjorn

Francesca Toni

LRM

ArXiv (abs)PDF HTML

Papers citing "Explanation-Based Human Debugging of NLP Models: A Survey"

50 / 55 papers shown

Title
Bridging Fairness and Explainability: Can Input-Based Explanations Promote Fairness in Hate Speech Detection? Yifan Wang Mayank Jobanputra Ji-Ung Lee Soyoung Oh Isabel Valera Vera Demberg 106 1 0 26 Sep 2025
Attribution Explanations for Deep Neural Networks: A Theoretical Perspective Huiqi Deng Hongbin Pei Quanshi Zhang Mengnan Du FAtt 146 1 0 11 Aug 2025
Automated Trustworthiness Oracle Generation for Machine Learning Text Classifiers Lam Nguyen Tung Steven Cho Xiaoning Du Neelofar Neelofar Valerio Terragni Stefano Ruberto Aldeida Aleti 1.1K 2 0 30 Oct 2024
To Err Is AI! Debugging as an Intervention to Facilitate Appropriate Reliance on AI SystemsACM Conference on Hypertext & Social Media (HT), 2024 Gaole He Abri Bharos U. Gadiraju 194 5 0 22 Sep 2024
Joint Universal Adversarial Perturbations with Interpretations Liang-bo Ning Zeyu Dai Wenqi Fan Jingran Su Chao Pan Luning Wang Qing Li AAML 254 3 0 03 Aug 2024
On Behalf of the Stakeholders: Trends in NLP Model Interpretability in the Era of LLMs Nitay Calderon Roi Reichart 302 23 0 27 Jul 2024
A look under the hood of the Interactive Deep Learning Enterprise (No-IDLE) Daniel Sonntag Michael Barz Thiago S. Gouvêa VLM 224 6 0 27 Jun 2024
CoXQL: A Dataset for Parsing Explanation Requests in Conversational XAI Systems Qianli Wang Tatiana Anikina Nils Feldhus Simon Ostermann Sebastian Möller 254 4 0 12 Jun 2024
Helpful or Harmful Data? Fine-tuning-free Shapley Attribution for Explaining Language Model PredictionsInternational Conference on Machine Learning (ICML), 2024 Jingtan Wang Xiaoqiang Lin Rui Qiao Chuan-Sheng Foo Bryan Kian Hsiang Low TDI 164 8 0 07 Jun 2024
Contestable AI needs Computational ArgumentationInternational Conference on Principles of Knowledge Representation and Reasoning (KR), 2024 Francesco Leofante Hamed Ayoobi Adam Dejl Gabriel Freedman Deniz Gorur ... Anna Rapberger Fabrizio Russo Xiang Yin Dekai Zhang Francesca Toni 204 12 0 17 May 2024
Facilitating Opinion Diversity through Hybrid NLP ApproachesNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024 Michiel van der Meer 265 3 0 15 May 2024
Properties and Challenges of LLM-Generated Explanations Jenny Kunz Marco Kuhlmann 189 29 0 16 Feb 2024
ALMANACS: A Simulatability Benchmark for Language Model Explainability Edmund Mills Shiye Su Stuart J. Russell Scott Emmons 461 9 0 20 Dec 2023
What if you said that differently?: How Explanation Formats Affect Human Feedback Efficacy and User Perception Chaitanya Malaviya Subin Lee Dan Roth Mark Yatskar 223 2 0 16 Nov 2023
Interpretable by Design: Wrapper Boxes Combine Neural Performance with Faithful Attribution of Model Decisions to Training DataBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2023 Yiheng Su Junyi Jessy Li Matthew Lease AAML FAtt 148 1 0 15 Nov 2023
QualEval: Qualitative Evaluation for Model ImprovementNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023 Vishvak Murahari Ameet Deshpande Peter Clark Tanmay Rajpurohit Ashish Sabharwal Karthik Narasimhan Ashwin Kalyan 190 8 0 06 Nov 2023
Explanation-based Training with Differentiable Insertion/Deletion Metric-aware RegularizersInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023 Yuya Yoshikawa Tomoharu Iwata 207 1 0 19 Oct 2023
Graph of Thoughts: Solving Elaborate Problems with Large Language ModelsAAAI Conference on Artificial Intelligence (AAAI), 2023 Maciej Besta Nils Blach Aleš Kubíček Robert Gerstenberger Michal Podstawski ... Joanna Gajda Tomasz Lehmann H. Niewiadomski Piotr Nyczyk Torsten Hoefler LRM AI4CE LM&Ro 485 1,004 0 18 Aug 2023
Human-centered NLP Fact-checking: Co-Designing with Fact-checkers using Matchmaking for AI Houjiang Liu Anubrata Das Alexander Boltz Didi Zhou Daisy Pinaroc Matthew Lease Min Kyung Lee HAI 232 26 0 14 Aug 2023
Towards Explainable Evaluation Metrics for Machine TranslationJournal of machine learning research (JMLR), 2023 Christoph Leiter Piyawat Lertvittayakumjorn M. Fomicheva Wei Zhao Yang Gao Steffen Eger ELM 264 23 0 22 Jun 2023
Disentanglement via Latent QuantizationNeural Information Processing Systems (NeurIPS), 2023 Kyle Hsu W. Dorrell James C. R. Whittington Jiajun Wu Chelsea Finn DRL 293 34 0 28 May 2023
Interpretation of Time-Series Deep Models: A Survey Ziqi Zhao Yucheng Shi Shushan Wu Fan Yang Wenzhan Song Ninghao Liu AI4TS 257 13 0 23 May 2023
Are Your Explanations Reliable? Investigating the Stability of LIME in Explaining Text Classifiers by Marrying XAI and Adversarial AttackConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 Christopher Burger Lingwei Chen Thai Le FAtt AAML 192 16 0 21 May 2023
ConvXAI: Delivering Heterogeneous AI Explanations via Conversations to Support Human-AI Scientific Writing Hua Shen Huang Chieh-Yang Tongshuang Wu Ting-Hao 'Kenneth' Huang 329 43 0 16 May 2023
Multi-resolution Interpretation and Diagnostics Tool for Natural Language Classifiers P. Jalali Nengfeng Zhou Yufei Yu AAML 125 0 0 06 Mar 2023
IFAN: An Explainability-Focused Interaction Framework for Humans and NLP ModelsInternational Joint Conference on Natural Language Processing (IJCNLP), 2023 Edoardo Mosca Daryna Dementieva Tohid Ebrahim Ajdari Maximilian Kummeth Kirill Gringauz Yutong Zhou Georg Groh 220 12 0 06 Mar 2023
Cross-lingual German Biomedical Information Extraction: from Zero-shot to Human-in-the-Loop Siting Liang Mareike Hartmann Daniel Sonntag 146 3 0 24 Jan 2023
Explainability of Text Processing and Retrieval Methods: A Survey Sourav Saha Debapriyo Majumdar Mandar Mitra 258 5 0 14 Dec 2022
Going Beyond XAI: A Systematic Survey for Explanation-Guided LearningACM Computing Surveys (ACM CSUR), 2022 Yuyang Gao Siyi Gu Junji Jiang S. Hong Dazhou Yu Bo Pan 251 54 0 07 Dec 2022
GLUE-X: Evaluating Natural Language Understanding Models from an Out-of-distribution Generalization PerspectiveAnnual Meeting of the Association for Computational Linguistics (ACL), 2022 Linyi Yang Shuibai Zhang Libo Qin Yafu Li Yidong Wang Hanmeng Liu Yongfeng Zhang Xingxu Xie Yue Zhang ELM 584 96 0 15 Nov 2022
Understanding Text Classification Data and Models Using Aggregated Input Salience Sebastian Ebert Alice Shoshana Jakobovits Katja Filippova FAtt 271 3 0 10 Nov 2022
XMD: An End-to-End Framework for Interactive Explanation-Based Debugging of NLP ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022 Dong-Ho Lee Akshen Kadakia Brihi Joshi Aaron Chan Ziyi Liu ... Takashi Shibuya Ryosuke Mitani Toshiyuki Sekiya Jay Pujara Xiang Ren LRM 168 11 0 30 Oct 2022
Cascading Biases: Investigating the Effect of Heuristic Annotation Strategies on Data and ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022 Chaitanya Malaviya Sudeep Bhatia Mark Yatskar 168 4 0 24 Oct 2022
On the Explainability of Natural Language Processing Deep ModelsACM Computing Surveys (ACM CSUR), 2022 Julia El Zini M. Awad 216 105 0 13 Oct 2022
Leveraging Explanations in Interactive Machine Learning: An OverviewFrontiers in Artificial Intelligence (FAI), 2022 Stefano Teso Öznur Alkan Wolfgang Stammer Elizabeth M. Daly XAI FAtt LRM 485 75 0 29 Jul 2022
Human-Centric Research for NLP: Towards a Definition and Guiding Questions Bhushan Kotnis Kiril Gashteovski J. Gastinger G. Serra Francesco Alesiani T. Sztyler Ammar Shaker Na Gong Carolin (Haas) Lawrence Zhao Xu 138 11 0 10 Jul 2022
Challenges in Applying Explainability Methods to Improve the Fairness of NLP Models Esma Balkir S. Kiritchenko I. Nejadgholi Kathleen C. Fraser 257 40 0 08 Jun 2022
Concept-level Debugging of Part-Prototype NetworksInternational Conference on Learning Representations (ICLR), 2022 A. Bontempelli Stefano Teso Katya Tentori Fausto Giunchiglia Baptiste Caramiaux 313 59 0 31 May 2022
Argumentative Explanations for Pattern-Based Text Classifiers Piyawat Lertvittayakumjorn Francesca Toni 244 5 0 22 May 2022
Causal Discovery and Knowledge Injection for Contestable Neural Networks (with Appendices)European Conference on Artificial Intelligence (ECAI), 2022 Fabrizio Russo Francesca Toni CML 218 8 0 19 May 2022
SparCAssist: A Model Risk Assessment Assistant Based on Sparse Generated CounterfactualsAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2022 Zijian Zhang Vinay Setty Avishek Anand 145 7 0 03 May 2022
A survey on improving NLP models with human explanations Mareike Hartmann Daniel Sonntag LRM 191 26 0 19 Apr 2022
Can language models learn from explanations in context?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022 Andrew Kyle Lampinen Ishita Dasgupta Stephanie C. Y. Chan Kory Matthewson Michael Henry Tessler Antonia Creswell James L. McClelland Jane X. Wang Felix Hill LRM ReLM 512 347 0 05 Apr 2022
A Rationale-Centric Framework for Human-in-the-loop Machine LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2022 Jinghui Lu Linyi Yang Brian Mac Namee Yue Zhang 161 43 0 24 Mar 2022
Towards Explainable Evaluation Metrics for Natural Language Generation Christoph Leiter Piyawat Lertvittayakumjorn M. Fomicheva Wei Zhao Yang Gao Steffen Eger AAML ELM 214 21 0 21 Mar 2022
A Survey of Adversarial Defences and Robustness in NLP Shreyansh Goyal Sumanth Doddapaneni Mitesh M.Khapra B. Ravindran AAML 363 35 0 12 Mar 2022
Towards a Science of Human-AI Decision Making: A Survey of Empirical Studies Vivian Lai Chacha Chen Q. V. Liao Alison Smith-Renner Chenhao Tan 249 208 0 21 Dec 2021
Tell me why! Explanations support learning relational and causal structure Andrew Kyle Lampinen Nicholas A. Roy Ishita Dasgupta Stephanie C. Y. Chan Allison C. Tam ... Chen Yan Adam Santoro Neil C. Rabinowitz Jane X. Wang Felix Hill 305 49 0 07 Dec 2021
What to Learn, and How: Toward Effective Learning from Rationales Samuel Carton Surya Kanoria Chenhao Tan 375 28 0 30 Nov 2021
Interpreting Deep Learning Models in Natural Language Processing: A Review Xiaofei Sun Diyi Yang Xiaoya Li Tianwei Zhang Yuxian Meng Han Qiu Guoyin Wang Eduard H. Hovy Jiwei Li 179 51 0 20 Oct 2021