Aligning Faithful Interpretations with their Social Attribution

1 June 2020

Papers citing "Aligning Faithful Interpretations with their Social Attribution"

39 / 39 papers shown

Title
Adversarial Cooperative Rationalization: The Risk of Spurious Correlations in Even Clean Datasets W. Liu Zhongyu Niu Lang Gao Zhiying Deng Jun Wang H. Wang Ruixuan Li 134 1 0 04 May 2025
Rubrik's Cube: Testing a New Rubric for Evaluating Explanations on the CUBE dataset Diana Galván-Sosa Gabrielle Gaudeau Pride Kavumba Yunmeng Li Hongyi gu Zheng Yuan Keisuke Sakaguchi P. Buttery LRM 35 0 0 31 Mar 2025
A Tale of Two Imperatives: Privacy and Explainability Supriya Manna Niladri Sett 94 0 0 30 Dec 2024
TabVer: Tabular Fact Verification with Natural Logic Rami Aly Andreas Vlachos LMTD 28 0 0 02 Nov 2024
Counterfactuals As a Means for Evaluating Faithfulness of Attribution Methods in Autoregressive Language Models Sepehr Kamahi Yadollah Yaghoobzadeh 48 0 0 21 Aug 2024
Explanation Regularisation through the Lens of Attributions Pedro Ferreira Wilker Aziz Ivan Titov 43 1 0 23 Jul 2024
Exploring the Trade-off Between Model Performance and Explanation Plausibility of Text Classifiers Using Human Rationales Lucas Resck Marcos M. Raimundo Jorge Poco 44 1 0 03 Apr 2024
Situated Natural Language Explanations Zining Zhu Hao Jiang Jingfeng Yang Sreyashi Nag Chao Zhang Jie Huang Yifan Gao Frank Rudzicz Bing Yin LRM 41 1 0 27 Aug 2023
DARE: Towards Robust Text Explanations in Biomedical and Healthcare Applications Adam Ivankay Mattia Rigotti P. Frossard OOD MedIm 21 1 0 05 Jul 2023
In Search of Verifiability: Explanations Rarely Enable Complementary Performance in AI-Advised Decision Making Raymond Fok Daniel S. Weld 24 61 0 12 May 2023
The State of Human-centered NLP Technology for Fact-checking Anubrata Das Houjiang Liu Venelin Kovatchev Matthew Lease HILM 19 61 0 08 Jan 2023
ROSCOE: A Suite of Metrics for Scoring Step-by-Step Reasoning O. Yu. Golovneva Moya Chen Spencer Poff Martin Corredor Luke Zettlemoyer Maryam Fazel-Zarandi Asli Celikyilmaz ReLM LRM 20 137 0 15 Dec 2022
RES: A Robust Framework for Guiding Visual Explanation Yuyang Gao Tong Sun Guangji Bai Siyi Gu S. Hong Liang Zhao FAtt AAML XAI 21 32 0 27 Jun 2022
Explanation-based Counterfactual Retraining(XCR): A Calibration Method for Black-box Models Liu Zhendong Wenyu Jiang Yan Zhang Chongjun Wang CML 6 0 0 22 Jun 2022
How explainable are adversarially-robust CNNs? Mehdi Nourelahi Lars Kotthoff Peijie Chen Anh Totti Nguyen AAML FAtt 22 8 0 25 May 2022
Necessity and Sufficiency for Explaining Text Classifiers: A Case Study in Hate Speech Detection Esma Balkir I. Nejadgholi Kathleen C. Fraser S. Kiritchenko FAtt 33 27 0 06 May 2022
Learning to Scaffold: Optimizing Model Explanations for Teaching Patrick Fernandes Marcos Vinícius Treviso Danish Pruthi André F. T. Martins Graham Neubig FAtt 19 22 0 22 Apr 2022
Towards Explainable Evaluation Metrics for Natural Language Generation Christoph Leiter Piyawat Lertvittayakumjorn M. Fomicheva Wei-Ye Zhao Yang Gao Steffen Eger AAML ELM 22 20 0 21 Mar 2022
Aligning Eyes between Humans and Deep Neural Network through Interactive Attention Alignment Yuyang Gao Tong Sun Liang Zhao Sungsoo Ray Hong HAI 21 37 0 06 Feb 2022
Diagnosing AI Explanation Methods with Folk Concepts of Behavior Alon Jacovi Jasmijn Bastings Sebastian Gehrmann Yoav Goldberg Katja Filippova 36 15 0 27 Jan 2022
Making a (Counterfactual) Difference One Rationale at a Time Michael J. Plyler Michal Green Min Chi 21 11 0 13 Jan 2022
Explain, Edit, and Understand: Rethinking User Study Design for Evaluating Model Explanations Siddhant Arora Danish Pruthi Norman M. Sadeh William W. Cohen Zachary Chase Lipton Graham Neubig FAtt 35 38 0 17 Dec 2021
What to Learn, and How: Toward Effective Learning from Rationales Samuel Carton Surya Kanoria Chenhao Tan 30 22 0 30 Nov 2021
Understanding Interlocking Dynamics of Cooperative Rationalization Mo Yu Yang Zhang Shiyu Chang Tommi Jaakkola 18 41 0 26 Oct 2021
Interpreting Deep Learning Models in Natural Language Processing: A Review Xiaofei Sun Diyi Yang Xiaoya Li Tianwei Zhang Yuxian Meng Han Qiu Guoyin Wang Eduard H. Hovy Jiwei Li 17 44 0 20 Oct 2021
The Irrationality of Neural Rationale Models Yiming Zheng Serena Booth J. Shah Yilun Zhou 32 16 0 14 Oct 2021
Diagnostics-Guided Explanation Generation Pepa Atanasova J. Simonsen Christina Lioma Isabelle Augenstein LRM FAtt 36 6 0 08 Sep 2021
ProoFVer: Natural Logic Theorem Proving for Fact Verification Amrith Krishna Sebastian Riedel Andreas Vlachos 21 61 0 25 Aug 2021
Knowledge-Grounded Self-Rationalization via Extractive and Natural Language Explanations Bodhisattwa Prasad Majumder Oana-Maria Camburu Thomas Lukasiewicz Julian McAuley 23 35 0 25 Jun 2021
On Sample Based Explanation Methods for NLP:Efficiency, Faithfulness, and Semantic Evaluation Wei Zhang Ziming Huang Yada Zhu Guangnan Ye Xiaodong Cui Fan Zhang 23 17 0 09 Jun 2021
The Out-of-Distribution Problem in Explainability and Search Methods for Feature Importance Explanations Peter Hase Harry Xie Mohit Bansal OODD LRM FAtt 18 91 0 01 Jun 2021
Zorro: Valid, Sparse, and Stable Explanations in Graph Neural Networks Thorben Funke Megha Khosla Mandeep Rathee Avishek Anand FAtt 23 38 0 18 May 2021
Do Feature Attribution Methods Correctly Attribute Features? Yilun Zhou Serena Booth Marco Tulio Ribeiro J. Shah FAtt XAI 22 132 0 27 Apr 2021
Contrastive Explanations for Model Interpretability Alon Jacovi Swabha Swayamdipta Shauli Ravfogel Yanai Elazar Yejin Choi Yoav Goldberg 35 95 0 02 Mar 2021
Explainability of deep vision-based autonomous driving systems: Review and challenges Éloi Zablocki H. Ben-younes P. Pérez Matthieu Cord XAI 37 169 0 13 Jan 2021
Invariant Rationalization Shiyu Chang Yang Zhang Mo Yu Tommi Jaakkola 179 201 0 22 Mar 2020
Towards Transparent Robotic Planning via Contrastive Explanations Shenghui Chen Kayla Boggess Lu Feng 17 9 0 16 Mar 2020
Towards A Rigorous Science of Interpretable Machine Learning Finale Doshi-Velez Been Kim XAI FaML 251 3,683 0 28 Feb 2017
Learning Attitudes and Attributes from Multi-Aspect Reviews Julian McAuley J. Leskovec Dan Jurafsky 200 296 0 15 Oct 2012