Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2104.15135
Cited By
v1
v2
v3 (latest)
Explanation-Based Human Debugging of NLP Models: A Survey
Transactions of the Association for Computational Linguistics (TACL), 2021
30 April 2021
Piyawat Lertvittayakumjorn
Francesca Toni
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Explanation-Based Human Debugging of NLP Models: A Survey"
50 / 55 papers shown
Title
Bridging Fairness and Explainability: Can Input-Based Explanations Promote Fairness in Hate Speech Detection?
Yifan Wang
Mayank Jobanputra
Ji-Ung Lee
Soyoung Oh
Isabel Valera
Vera Demberg
106
1
0
26 Sep 2025
Attribution Explanations for Deep Neural Networks: A Theoretical Perspective
Huiqi Deng
Hongbin Pei
Quanshi Zhang
Mengnan Du
FAtt
146
1
0
11 Aug 2025
Automated Trustworthiness Oracle Generation for Machine Learning Text Classifiers
Lam Nguyen Tung
Steven Cho
Xiaoning Du
Neelofar Neelofar
Valerio Terragni
Stefano Ruberto
Aldeida Aleti
1.1K
2
0
30 Oct 2024
To Err Is AI! Debugging as an Intervention to Facilitate Appropriate Reliance on AI Systems
ACM Conference on Hypertext & Social Media (HT), 2024
Gaole He
Abri Bharos
U. Gadiraju
194
5
0
22 Sep 2024
Joint Universal Adversarial Perturbations with Interpretations
Liang-bo Ning
Zeyu Dai
Wenqi Fan
Jingran Su
Chao Pan
Luning Wang
Qing Li
AAML
254
3
0
03 Aug 2024
On Behalf of the Stakeholders: Trends in NLP Model Interpretability in the Era of LLMs
Nitay Calderon
Roi Reichart
302
23
0
27 Jul 2024
A look under the hood of the Interactive Deep Learning Enterprise (No-IDLE)
Daniel Sonntag
Michael Barz
Thiago S. Gouvêa
VLM
224
6
0
27 Jun 2024
CoXQL: A Dataset for Parsing Explanation Requests in Conversational XAI Systems
Qianli Wang
Tatiana Anikina
Nils Feldhus
Simon Ostermann
Sebastian Möller
254
4
0
12 Jun 2024
Helpful or Harmful Data? Fine-tuning-free Shapley Attribution for Explaining Language Model Predictions
International Conference on Machine Learning (ICML), 2024
Jingtan Wang
Xiaoqiang Lin
Rui Qiao
Chuan-Sheng Foo
Bryan Kian Hsiang Low
TDI
164
8
0
07 Jun 2024
Contestable AI needs Computational Argumentation
International Conference on Principles of Knowledge Representation and Reasoning (KR), 2024
Francesco Leofante
Hamed Ayoobi
Adam Dejl
Gabriel Freedman
Deniz Gorur
...
Anna Rapberger
Fabrizio Russo
Xiang Yin
Dekai Zhang
Francesca Toni
204
12
0
17 May 2024
Facilitating Opinion Diversity through Hybrid NLP Approaches
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Michiel van der Meer
265
3
0
15 May 2024
Properties and Challenges of LLM-Generated Explanations
Jenny Kunz
Marco Kuhlmann
189
29
0
16 Feb 2024
ALMANACS: A Simulatability Benchmark for Language Model Explainability
Edmund Mills
Shiye Su
Stuart J. Russell
Scott Emmons
461
9
0
20 Dec 2023
What if you said that differently?: How Explanation Formats Affect Human Feedback Efficacy and User Perception
Chaitanya Malaviya
Subin Lee
Dan Roth
Mark Yatskar
223
2
0
16 Nov 2023
Interpretable by Design: Wrapper Boxes Combine Neural Performance with Faithful Attribution of Model Decisions to Training Data
BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2023
Yiheng Su
Junyi Jessy Li
Matthew Lease
AAML
FAtt
148
1
0
15 Nov 2023
QualEval: Qualitative Evaluation for Model Improvement
North American Chapter of the Association for Computational Linguistics (NAACL), 2023
Vishvak Murahari
Ameet Deshpande
Peter Clark
Tanmay Rajpurohit
Ashish Sabharwal
Karthik Narasimhan
Ashwin Kalyan
190
8
0
06 Nov 2023
Explanation-based Training with Differentiable Insertion/Deletion Metric-aware Regularizers
International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Yuya Yoshikawa
Tomoharu Iwata
207
1
0
19 Oct 2023
Graph of Thoughts: Solving Elaborate Problems with Large Language Models
AAAI Conference on Artificial Intelligence (AAAI), 2023
Maciej Besta
Nils Blach
Aleš Kubíček
Robert Gerstenberger
Michal Podstawski
...
Joanna Gajda
Tomasz Lehmann
H. Niewiadomski
Piotr Nyczyk
Torsten Hoefler
LRM
AI4CE
LM&Ro
485
1,004
0
18 Aug 2023
Human-centered NLP Fact-checking: Co-Designing with Fact-checkers using Matchmaking for AI
Houjiang Liu
Anubrata Das
Alexander Boltz
Didi Zhou
Daisy Pinaroc
Matthew Lease
Min Kyung Lee
HAI
232
26
0
14 Aug 2023
Towards Explainable Evaluation Metrics for Machine Translation
Journal of machine learning research (JMLR), 2023
Christoph Leiter
Piyawat Lertvittayakumjorn
M. Fomicheva
Wei Zhao
Yang Gao
Steffen Eger
ELM
264
23
0
22 Jun 2023
Disentanglement via Latent Quantization
Neural Information Processing Systems (NeurIPS), 2023
Kyle Hsu
W. Dorrell
James C. R. Whittington
Jiajun Wu
Chelsea Finn
DRL
293
34
0
28 May 2023
Interpretation of Time-Series Deep Models: A Survey
Ziqi Zhao
Yucheng Shi
Shushan Wu
Fan Yang
Wenzhan Song
Ninghao Liu
AI4TS
257
13
0
23 May 2023
Are Your Explanations Reliable? Investigating the Stability of LIME in Explaining Text Classifiers by Marrying XAI and Adversarial Attack
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Christopher Burger
Lingwei Chen
Thai Le
FAtt
AAML
192
16
0
21 May 2023
ConvXAI: Delivering Heterogeneous AI Explanations via Conversations to Support Human-AI Scientific Writing
Hua Shen
Huang Chieh-Yang
Tongshuang Wu
Ting-Hao 'Kenneth' Huang
329
43
0
16 May 2023
Multi-resolution Interpretation and Diagnostics Tool for Natural Language Classifiers
P. Jalali
Nengfeng Zhou
Yufei Yu
AAML
125
0
0
06 Mar 2023
IFAN: An Explainability-Focused Interaction Framework for Humans and NLP Models
International Joint Conference on Natural Language Processing (IJCNLP), 2023
Edoardo Mosca
Daryna Dementieva
Tohid Ebrahim Ajdari
Maximilian Kummeth
Kirill Gringauz
Yutong Zhou
Georg Groh
220
12
0
06 Mar 2023
Cross-lingual German Biomedical Information Extraction: from Zero-shot to Human-in-the-Loop
Siting Liang
Mareike Hartmann
Daniel Sonntag
146
3
0
24 Jan 2023
Explainability of Text Processing and Retrieval Methods: A Survey
Sourav Saha
Debapriyo Majumdar
Mandar Mitra
258
5
0
14 Dec 2022
Going Beyond XAI: A Systematic Survey for Explanation-Guided Learning
ACM Computing Surveys (ACM CSUR), 2022
Yuyang Gao
Siyi Gu
Junji Jiang
S. Hong
Dazhou Yu
Bo Pan
251
54
0
07 Dec 2022
GLUE-X: Evaluating Natural Language Understanding Models from an Out-of-distribution Generalization Perspective
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Linyi Yang
Shuibai Zhang
Libo Qin
Yafu Li
Yidong Wang
Hanmeng Liu
Yongfeng Zhang
Xingxu Xie
Yue Zhang
ELM
584
96
0
15 Nov 2022
Understanding Text Classification Data and Models Using Aggregated Input Salience
Sebastian Ebert
Alice Shoshana Jakobovits
Katja Filippova
FAtt
271
3
0
10 Nov 2022
XMD: An End-to-End Framework for Interactive Explanation-Based Debugging of NLP Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Dong-Ho Lee
Akshen Kadakia
Brihi Joshi
Aaron Chan
Ziyi Liu
...
Takashi Shibuya
Ryosuke Mitani
Toshiyuki Sekiya
Jay Pujara
Xiang Ren
LRM
168
11
0
30 Oct 2022
Cascading Biases: Investigating the Effect of Heuristic Annotation Strategies on Data and Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Chaitanya Malaviya
Sudeep Bhatia
Mark Yatskar
168
4
0
24 Oct 2022
On the Explainability of Natural Language Processing Deep Models
ACM Computing Surveys (ACM CSUR), 2022
Julia El Zini
M. Awad
216
105
0
13 Oct 2022
Leveraging Explanations in Interactive Machine Learning: An Overview
Frontiers in Artificial Intelligence (FAI), 2022
Stefano Teso
Öznur Alkan
Wolfgang Stammer
Elizabeth M. Daly
XAI
FAtt
LRM
485
75
0
29 Jul 2022
Human-Centric Research for NLP: Towards a Definition and Guiding Questions
Bhushan Kotnis
Kiril Gashteovski
J. Gastinger
G. Serra
Francesco Alesiani
T. Sztyler
Ammar Shaker
Na Gong
Carolin (Haas) Lawrence
Zhao Xu
138
11
0
10 Jul 2022
Challenges in Applying Explainability Methods to Improve the Fairness of NLP Models
Esma Balkir
S. Kiritchenko
I. Nejadgholi
Kathleen C. Fraser
257
40
0
08 Jun 2022
Concept-level Debugging of Part-Prototype Networks
International Conference on Learning Representations (ICLR), 2022
A. Bontempelli
Stefano Teso
Katya Tentori
Fausto Giunchiglia
Baptiste Caramiaux
313
59
0
31 May 2022
Argumentative Explanations for Pattern-Based Text Classifiers
Piyawat Lertvittayakumjorn
Francesca Toni
244
5
0
22 May 2022
Causal Discovery and Knowledge Injection for Contestable Neural Networks (with Appendices)
European Conference on Artificial Intelligence (ECAI), 2022
Fabrizio Russo
Francesca Toni
CML
218
8
0
19 May 2022
SparCAssist: A Model Risk Assessment Assistant Based on Sparse Generated Counterfactuals
Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2022
Zijian Zhang
Vinay Setty
Avishek Anand
145
7
0
03 May 2022
A survey on improving NLP models with human explanations
Mareike Hartmann
Daniel Sonntag
LRM
191
26
0
19 Apr 2022
Can language models learn from explanations in context?
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Andrew Kyle Lampinen
Ishita Dasgupta
Stephanie C. Y. Chan
Kory Matthewson
Michael Henry Tessler
Antonia Creswell
James L. McClelland
Jane X. Wang
Felix Hill
LRM
ReLM
512
347
0
05 Apr 2022
A Rationale-Centric Framework for Human-in-the-loop Machine Learning
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Jinghui Lu
Linyi Yang
Brian Mac Namee
Yue Zhang
161
43
0
24 Mar 2022
Towards Explainable Evaluation Metrics for Natural Language Generation
Christoph Leiter
Piyawat Lertvittayakumjorn
M. Fomicheva
Wei Zhao
Yang Gao
Steffen Eger
AAML
ELM
214
21
0
21 Mar 2022
A Survey of Adversarial Defences and Robustness in NLP
Shreyansh Goyal
Sumanth Doddapaneni
Mitesh M.Khapra
B. Ravindran
AAML
363
35
0
12 Mar 2022
Towards a Science of Human-AI Decision Making: A Survey of Empirical Studies
Vivian Lai
Chacha Chen
Q. V. Liao
Alison Smith-Renner
Chenhao Tan
249
208
0
21 Dec 2021
Tell me why! Explanations support learning relational and causal structure
Andrew Kyle Lampinen
Nicholas A. Roy
Ishita Dasgupta
Stephanie C. Y. Chan
Allison C. Tam
...
Chen Yan
Adam Santoro
Neil C. Rabinowitz
Jane X. Wang
Felix Hill
305
49
0
07 Dec 2021
What to Learn, and How: Toward Effective Learning from Rationales
Samuel Carton
Surya Kanoria
Chenhao Tan
375
28
0
30 Nov 2021
Interpreting Deep Learning Models in Natural Language Processing: A Review
Xiaofei Sun
Diyi Yang
Xiaoya Li
Tianwei Zhang
Yuxian Meng
Han Qiu
Guoyin Wang
Eduard H. Hovy
Jiwei Li
179
51
0
20 Oct 2021
1
2
Next