Towards Faithfully Interpretable NLP Systems: How should we define and evaluate faithfulness?

7 April 2020

Papers citing "Towards Faithfully Interpretable NLP Systems: How should we define and evaluate faithfulness?"

50 / 130 papers shown

Title
Trustworthy Social Bias Measurement Rishi Bommasani Percy Liang 27 10 0 20 Dec 2022
Evaluating Human-Language Model Interaction Mina Lee Megha Srivastava Amelia Hardy John Thickstun Esin Durmus ... Hancheng Cao Tony Lee Rishi Bommasani Michael S. Bernstein Percy Liang LM&MA ALM 56 98 0 19 Dec 2022
SEAT: Stable and Explainable Attention Lijie Hu Yixin Liu Ninghao Liu Mengdi Huai Lichao Sun Di Wang OOD 24 18 0 23 Nov 2022
Easy to Decide, Hard to Agree: Reducing Disagreements Between Saliency Methods Josip Jukić Martin Tutek Jan Snajder FAtt 18 0 0 15 Nov 2022
GLUE-X: Evaluating Natural Language Understanding Models from an Out-of-distribution Generalization Perspective Linyi Yang Shuibai Zhang Libo Qin Yafu Li Yidong Wang Hanmeng Liu Jindong Wang Xingxu Xie Yue Zhang ELM 39 79 0 15 Nov 2022
Effective Cross-Task Transfer Learning for Explainable Natural Language Inference with T5 Irina Bigoulaeva Rachneet Sachdeva Harish Tayyar Madabushi Aline Villavicencio Iryna Gurevych LRM 45 5 0 31 Oct 2022
ExPUNations: Augmenting Puns with Keywords and Explanations Jiao Sun Anjali Narayan-Chen Shereen Oraby Alessandra Cervone Tagyoung Chung Jing Huang Yang Liu Nanyun Peng 19 10 0 24 Oct 2022
Generating Hierarchical Explanations on Text Classification Without Connecting Rules Yiming Ju Yuanzhe Zhang Kang Liu Jun Zhao FAtt 20 3 0 24 Oct 2022
Explainable Slot Type Attentions to Improve Joint Intent Detection and Slot Filling Kalpa Gunaratna Vijay Srinivasan Akhila Yerukola Hongxia Jin 21 6 0 19 Oct 2022
Mitigating Covertly Unsafe Text within Natural Language Systems Alex Mei Anisha Kabir Sharon Levy Melanie Subbiah Emily Allaway J. Judge D. Patton Bruce Bimber Kathleen McKeown William Yang Wang 50 13 0 17 Oct 2022
StyLEx: Explaining Style Using Human Lexical Annotations Shirley Anugrah Hayati Kyumin Park Dheeraj Rajagopal Lyle Ungar Dongyeop Kang 22 3 0 14 Oct 2022
Honest Students from Untrusted Teachers: Learning an Interpretable Question-Answering Pipeline from a Pretrained Language Model Jacob Eisenstein D. Andor Bernd Bohnet Michael Collins David M. Mimno LRM 189 24 0 05 Oct 2022
Causal Proxy Models for Concept-Based Model Explanations Zhengxuan Wu Karel DÓosterlinck Atticus Geiger Amir Zur Christopher Potts MILM 77 35 0 28 Sep 2022
WildQA: In-the-Wild Video Question Answering Santiago Castro Naihao Deng Pingxuan Huang Mihai Burzo Rada Mihalcea 70 7 0 14 Sep 2022
ferret: a Framework for Benchmarking Explainers on Transformers Giuseppe Attanasio Eliana Pastor C. Bonaventura Debora Nozza 33 30 0 02 Aug 2022
An Interpretability Evaluation Benchmark for Pre-trained Language Models Ya-Ming Shen Lijie Wang Ying Chen Xinyan Xiao Jing Liu Hua-Hong Wu 37 4 0 28 Jul 2022
Beware the Rationalization Trap! When Language Model Explainability Diverges from our Mental Models of Language R. Sevastjanova Mennatallah El-Assady LRM 29 9 0 14 Jul 2022
FRAME: Evaluating Rationale-Label Consistency Metrics for Free-Text Rationales Aaron Chan Shaoliang Nie Liang Tan Xiaochang Peng Hamed Firooz Maziar Sanjabi Xiang Ren 40 9 0 02 Jul 2022
Mediators: Conversational Agents Explaining NLP Model Behavior Nils Feldhus A. Ravichandran Sebastian Möller 30 16 0 13 Jun 2022
Fooling Explanations in Text Classifiers Adam Ivankay Ivan Girardi Chiara Marchiori P. Frossard AAML 22 20 0 07 Jun 2022
Attribution-based Explanations that Provide Recourse Cannot be Robust H. Fokkema R. D. Heide T. Erven FAtt 44 18 0 31 May 2022
Argumentative Explanations for Pattern-Based Text Classifiers Piyawat Lertvittayakumjorn Francesca Toni 37 4 0 22 May 2022
ExSum: From Local Explanations to Model Understanding Yilun Zhou Marco Tulio Ribeiro J. Shah FAtt LRM 11 25 0 30 Apr 2022
Can Rationalization Improve Robustness? Howard Chen Jacqueline He Karthik Narasimhan Danqi Chen AAML 23 40 0 25 Apr 2022
Learning to Scaffold: Optimizing Model Explanations for Teaching Patrick Fernandes Marcos Vinícius Treviso Danish Pruthi André F. T. Martins Graham Neubig FAtt 19 22 0 22 Apr 2022
Calibrating Trust of Multi-Hop Question Answering Systems with Decompositional Probes Kaige Xie Sarah Wiegreffe Mark O. Riedl ReLM 18 12 0 16 Apr 2022
ProtoTEx: Explaining Model Decisions with Prototype Tensors Anubrata Das Chitrank Gupta Venelin Kovatchev Matthew Lease J. Li 24 26 0 11 Apr 2022
Using Interactive Feedback to Improve the Accuracy and Explainability of Question Answering Systems Post-Deployment Zichao Li Prakhar Sharma Xing Han Lù Jackie C.K. Cheung Siva Reddy HAI 25 26 0 06 Apr 2022
Interpretation of Black Box NLP Models: A Survey Shivani Choudhary N. Chatterjee S. K. Saha FAtt 34 10 0 31 Mar 2022
Towards Explainable Evaluation Metrics for Natural Language Generation Christoph Leiter Piyawat Lertvittayakumjorn M. Fomicheva Wei-Ye Zhao Yang Gao Steffen Eger AAML ELM 22 20 0 21 Mar 2022
FaiRR: Faithful and Robust Deductive Reasoning over Natural Language Soumya Sanyal Harman Singh Xiang Ren ReLM LRM 24 44 0 19 Mar 2022
Explainability in Graph Neural Networks: An Experimental Survey Peibo Li Yixing Yang M. Pagnucco Yang Song 23 31 0 17 Mar 2022
A Novel Perspective to Look At Attention: Bi-level Attention-based Explainable Topic Modeling for News Classification Dairui Liu Derek Greene Ruihai Dong 28 10 0 14 Mar 2022
Don't Lie to Me! Robust and Efficient Explainability with Verified Perturbation Analysis Thomas Fel Mélanie Ducoffe David Vigouroux Rémi Cadène Mikael Capelle C. Nicodeme Thomas Serre AAML 23 41 0 15 Feb 2022
DermX: an end-to-end framework for explainable automated dermatological diagnosis Raluca Jalaboi F. Faye Mauricio Orbes-Arteaga D. Jørgensen Ole Winther A. Galimzianova MedIm 11 17 0 14 Feb 2022
Exploring Transformer Backbones for Heterogeneous Treatment Effect Estimation Yi-Fan Zhang Hanlin Zhang Zachary Chase Lipton Li Erran Li Eric P. Xing OODD 24 29 0 02 Feb 2022
Diagnosing AI Explanation Methods with Folk Concepts of Behavior Alon Jacovi Jasmijn Bastings Sebastian Gehrmann Yoav Goldberg Katja Filippova 36 15 0 27 Jan 2022
Natural Language Deduction through Search over Statement Compositions Kaj Bostrom Zayne Sprague Swarat Chaudhuri Greg Durrett ReLM LRM 27 46 0 16 Jan 2022
UNIREX: A Unified Learning Framework for Language Model Rationale Extraction Aaron Chan Maziar Sanjabi Lambert Mathias L Tan Shaoliang Nie Xiaochang Peng Xiang Ren Hamed Firooz 38 41 0 16 Dec 2021
Sparse Interventions in Language Models with Differentiable Masking Nicola De Cao Leon Schmid Dieuwke Hupkes Ivan Titov 35 27 0 13 Dec 2021
Explainable Deep Learning in Healthcare: A Methodological Survey from an Attribution View Di Jin Elena Sergeeva W. Weng Geeticka Chauhan Peter Szolovits OOD 31 55 0 05 Dec 2021
Inducing Causal Structure for Interpretable Neural Networks Atticus Geiger Zhengxuan Wu Hanson Lu J. Rozner Elisa Kreiss Thomas F. Icard Noah D. Goodman Christopher Potts CML OOD 24 70 0 01 Dec 2021
A Survey on the Robustness of Feature Importance and Counterfactual Explanations Saumitra Mishra Sanghamitra Dutta Jason Long Daniele Magazzeni AAML 9 58 0 30 Oct 2021
Interpreting Deep Learning Models in Natural Language Processing: A Review Xiaofei Sun Diyi Yang Xiaoya Li Tianwei Zhang Yuxian Meng Han Qiu Guoyin Wang Eduard H. Hovy Jiwei Li 17 44 0 20 Oct 2021
Evaluating the Faithfulness of Importance Measures in NLP by Recursively Masking Allegedly Important Tokens and Retraining Andreas Madsen Nicholas Meade Vaibhav Adlakha Siva Reddy 103 35 0 15 Oct 2021
Distantly-Supervised Evidence Retrieval Enables Question Answering without Evidence Annotation Chen Zhao Chenyan Xiong Jordan L. Boyd-Graber Hal Daumé RALM 21 8 0 10 Oct 2021
Decision-Focused Summarization Chao-Chun Hsu Chenhao Tan 31 16 0 14 Sep 2021
Summarize-then-Answer: Generating Concise Explanations for Multi-hop Reading Comprehension Naoya Inoue H. Trivedi Steven K. Sinha Niranjan Balasubramanian Kentaro Inui 55 14 0 14 Sep 2021
Diagnostics-Guided Explanation Generation Pepa Atanasova J. Simonsen Christina Lioma Isabelle Augenstein LRM FAtt 36 6 0 08 Sep 2021
Counterfactual Evaluation for Explainable AI Yingqiang Ge Shuchang Liu Zelong Li Shuyuan Xu Shijie Geng Yunqi Li Juntao Tan Fei Sun Yongfeng Zhang CML 35 13 0 05 Sep 2021