Towards Debiasing NLU Models from Unknown Biases

25 September 2020

Papers citing "Towards Debiasing NLU Models from Unknown Biases"

29 / 29 papers shown

Title
Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models Samuel Marks Can Rager Eric J. Michaud Yonatan Belinkov David Bau Aaron Mueller 44 110 0 28 Mar 2024
Take Care of Your Prompt Bias! Investigating and Mitigating Prompt Bias in Factual Knowledge Extraction Ziyang Xu Keqin Peng Liang Ding Dacheng Tao Xiliang Lu 32 10 0 15 Mar 2024
Complexity Matters: Dynamics of Feature Learning in the Presence of Spurious Correlations GuanWen Qiu Da Kuang Surbhi Goel 25 8 0 05 Mar 2024
GLS-CSC: A Simple but Effective Strategy to Mitigate Chinese STM Models' Over-Reliance on Superficial Clue Yanrui Du Sendong Zhao Yuhan Chen Rai Bai Jing Liu Huaqin Wu Haifeng Wang Bing Qin 25 2 0 08 Sep 2023
A Survey on Fairness in Large Language Models Yingji Li Mengnan Du Rui Song Xin Wang Ying Wang ALM 37 59 0 20 Aug 2023
Modeling the Q-Diversity in a Min-max Play Game for Robust Optimization Ting Wu Rui Zheng Tao Gui Qi Zhang Xuanjing Huang 25 2 0 20 May 2023
Think Twice: Measuring the Efficiency of Eliminating Prediction Shortcuts of Question Answering Models Lukávs Mikula Michal vStefánik Marek Petrovivc Petr Sojka 28 3 0 11 May 2023
Diffusion Theory as a Scalpel: Detecting and Purifying Poisonous Dimensions in Pre-trained Language Models Caused by Backdoor or Bias Zhiyuan Zhang Deli Chen Hao Zhou Fandong Meng Jie Zhou Xu Sun 26 5 0 08 May 2023
Delving into Identify-Emphasize Paradigm for Combating Unknown Bias Bowen Zhao Chen Chen Qian-Wei Wang Anfeng He Shutao Xia 13 1 0 22 Feb 2023
Guide the Learner: Controlling Product of Experts Debiasing Method Based on Token Attribution Similarities Ali Modarressi Hossein Amirkhani Mohammad Taher Pilehvar 6 2 0 06 Feb 2023
Feature-Level Debiased Natural Language Understanding Yougang Lyu Piji Li Yechang Yang Maarten de Rijke Pengjie Ren Yukun Zhao Dawei Yin Z. Ren 23 10 0 11 Dec 2022
Looking at the Overlooked: An Analysis on the Word-Overlap Bias in Natural Language Inference S. Rajaee Yadollah Yaghoobzadeh Mohammad Taher Pilehvar 23 5 0 07 Nov 2022
GAPX: Generalized Autoregressive Paraphrase-Identification X Yi Zhou Renyu Li Hayden Housen Ser-Nam Lim BDL 25 0 0 05 Oct 2022
Shortcut Learning of Large Language Models in Natural Language Understanding Mengnan Du Fengxiang He Na Zou Dacheng Tao Xia Hu KELM OffRL 19 82 0 25 Aug 2022
Distilling Model Failures as Directions in Latent Space Saachi Jain Hannah Lawrence Ankur Moitra A. Madry 16 89 0 29 Jun 2022
Last Layer Re-Training is Sufficient for Robustness to Spurious Correlations Polina Kirichenko Pavel Izmailov A. Wilson OOD 29 314 0 06 Apr 2022
OccamNets: Mitigating Dataset Bias by Favoring Simpler Hypotheses Robik Shrestha Kushal Kafle Christopher Kanan CML 21 13 0 05 Apr 2022
Adaptor: Objective-Centric Adaptation Framework for Language Models Michal vStefánik Vít Novotný Nikola Groverová Petr Sojka 20 10 0 08 Mar 2022
Saving Dense Retriever from Shortcut Dependency in Conversational Search Sungdong Kim Gangwoo Kim 17 26 0 15 Feb 2022
Measure and Improve Robustness in NLP Models: A Survey Xuezhi Wang Haohan Wang Diyi Yang 139 130 0 15 Dec 2021
Avoiding Inference Heuristics in Few-shot Prompt-based Finetuning Prasetya Ajie Utama N. Moosavi Victor Sanh Iryna Gurevych AAML 56 35 0 09 Sep 2021
End-to-End Self-Debiasing Framework for Robust NLU Training Abbas Ghaddar Philippe Langlais Mehdi Rezagholizadeh Ahmad Rashid UQCV 16 36 0 05 Sep 2021
A Survey on Automated Fact-Checking Zhijiang Guo M. Schlichtkrull Andreas Vlachos 25 454 0 26 Aug 2021
Context-aware Adversarial Training for Name Regularity Bias in Named Entity Recognition Abbas Ghaddar Philippe Langlais Ahmad Rashid Mehdi Rezagholizadeh 30 42 0 24 Jul 2021
An Investigation of the (In)effectiveness of Counterfactually Augmented Data Nitish Joshi He He OODD 19 46 0 01 Jul 2021
Evading the Simplicity Bias: Training a Diverse Set of Models Discovers Solutions with Superior OOD Generalization Damien Teney Ehsan Abbasnejad Simon Lucey A. Hengel 20 86 0 12 May 2021
Improving Robustness by Augmenting Training Sentences with Predicate-Argument Structures N. Moosavi M. Boer Prasetya Ajie Utama Iryna Gurevych 6 13 0 23 Oct 2020
Hypothesis Only Baselines in Natural Language Inference Adam Poliak Jason Naradowsky Aparajita Haldar Rachel Rudinger Benjamin Van Durme 187 576 0 02 May 2018
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding Alex Jinpeng Wang Amanpreet Singh Julian Michael Felix Hill Omer Levy Samuel R. Bowman ELM 294 6,943 0 20 Apr 2018