Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1809.10610
Cited By
Counterfactual Fairness in Text Classification through Robustness
27 September 2018
Sahaj Garg
Vincent Perot
Nicole Limtiaco
Ankur Taly
Ed H. Chi
Alex Beutel
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Counterfactual Fairness in Text Classification through Robustness"
35 / 35 papers shown
Title
SAGE
\texttt{SAGE}
SAGE
: A Generic Framework for LLM Safety Evaluation
Madhur Jindal
Hari Shrawgi
Parag Agrawal
Sandipan Dandapat
ELM
47
0
0
28 Apr 2025
Collapsed Language Models Promote Fairness
Jingxuan Xu
Wuyang Chen
Linyi Li
Yao Zhao
Yunchao Wei
39
0
0
06 Oct 2024
Counterfactual Fairness by Combining Factual and Counterfactual Predictions
Zeyu Zhou
Tianci Liu
Ruqi Bai
Jing Gao
Murat Kocaoglu
David I. Inouye
36
2
0
03 Sep 2024
Uncovering Bias in Large Vision-Language Models at Scale with Counterfactuals
Phillip Howard
Kathleen C. Fraser
Anahita Bhiwandiwalla
S. Kiritchenko
48
9
0
30 May 2024
How does promoting the minority fraction affect generalization? A theoretical study of the one-hidden-layer neural network on group imbalance
Hongkang Li
Shuai Zhang
Yihua Zhang
Meng Wang
Sijia Liu
Pin-Yu Chen
33
4
0
12 Mar 2024
Measuring Bias in a Ranked List using Term-based Representations
Amin Abolghasemi
Leif Azzopardi
Arian Askari
Maarten de Rijke
Suzan Verberne
34
6
0
09 Mar 2024
A Survey on Fairness in Large Language Models
Yingji Li
Mengnan Du
Rui Song
Xin Wang
Ying Wang
ALM
37
59
0
20 Aug 2023
Should We Attend More or Less? Modulating Attention for Fairness
A. Zayed
Gonçalo Mordido
Samira Shabanian
Sarath Chandar
35
10
0
22 May 2023
Bias Beyond English: Counterfactual Tests for Bias in Sentiment Analysis in Four Languages
Seraphina Goldfarb-Tarrant
Adam Lopez
Roi Blanco
Diego Marcheggiani
22
13
0
19 May 2023
Implementing Responsible AI: Tensions and Trade-Offs Between Ethics Aspects
Conrad Sanderson
David M. Douglas
Qinghua Lu
32
11
0
17 Apr 2023
Deep Causal Learning: Representation, Discovery and Inference
Zizhen Deng
Xiaolong Zheng
Hu Tian
D. Zeng
CML
BDL
26
11
0
07 Nov 2022
Equal Experience in Recommender Systems
Jaewoong Cho
Moonseok Choi
Changho Suh
FaML
16
1
0
12 Oct 2022
Discovering Bugs in Vision Models using Off-the-shelf Image Generation and Captioning
Olivia Wiles
Isabela Albuquerque
Sven Gowal
VLM
30
46
0
18 Aug 2022
Multilingual HateCheck: Functional Tests for Multilingual Hate Speech Detection Models
Paul Röttger
Haitham Seelawi
Debora Nozza
Zeerak Talat
Bertie Vidgen
14
65
0
20 Jun 2022
Challenges in Applying Explainability Methods to Improve the Fairness of NLP Models
Esma Balkir
S. Kiritchenko
I. Nejadgholi
Kathleen C. Fraser
16
36
0
08 Jun 2022
Accurate Fairness: Improving Individual Fairness without Trading Accuracy
Xuran Li
Peng Wu
Jing Su
FaML
23
17
0
18 May 2022
Necessity and Sufficiency for Explaining Text Classifiers: A Case Study in Hate Speech Detection
Esma Balkir
I. Nejadgholi
Kathleen C. Fraser
S. Kiritchenko
FAtt
15
27
0
06 May 2022
Easy Adaptation to Mitigate Gender Bias in Multilingual Text Classification
Xiaolei Huang
FaML
8
8
0
12 Apr 2022
Learning fair representation with a parametric integral probability metric
Dongha Kim
Kunwoong Kim
Insung Kong
Ilsang Ohn
Yongdai Kim
FaML
17
16
0
07 Feb 2022
CausalSim: A Causal Framework for Unbiased Trace-Driven Simulation
Abdullah Alomar
Pouya Hamadanian
Arash Nasr-Esfahany
Anish Agarwal
MohammadIman Alizadeh
Devavrat Shah
CML
10
22
0
05 Jan 2022
Modeling Techniques for Machine Learning Fairness: A Survey
Mingyang Wan
Daochen Zha
Ninghao Liu
Na Zou
SyDa
FaML
17
36
0
04 Nov 2021
Enhancing Model Robustness and Fairness with Causality: A Regularization Approach
Zhao Wang
Kai Shu
A. Culotta
OOD
11
14
0
03 Oct 2021
Improving Counterfactual Generation for Fair Hate Speech Detection
Aida Mostafazadeh Davani
Ali Omrani
Brendan Kennedy
M. Atari
Xiang Ren
Morteza Dehghani
14
9
0
03 Aug 2021
Synthesizing Adversarial Negative Responses for Robust Response Ranking and Evaluation
Prakhar Gupta
Yulia Tsvetkov
Jeffrey P. Bigham
26
22
0
10 Jun 2021
Counterfactual Invariance to Spurious Correlations: Why and How to Pass Stress Tests
Victor Veitch
Alexander DÁmour
Steve Yadlowsky
Jacob Eisenstein
OOD
9
91
0
31 May 2021
Re-imagining Algorithmic Fairness in India and Beyond
Nithya Sambasivan
Erin Arnesen
Ben Hutchinson
Tulsee Doshi
Vinodkumar Prabhakaran
FaML
11
174
0
25 Jan 2021
Optimism in the Face of Adversity: Understanding and Improving Deep Learning through Adversarial Robustness
Guillermo Ortiz-Jiménez
Apostolos Modas
Seyed-Mohsen Moosavi-Dezfooli
P. Frossard
AAML
19
48
0
19 Oct 2020
A Distributionally Robust Approach to Fair Classification
Bahar Taşkesen
Viet Anh Nguyen
Daniel Kuhn
Jose H. Blanchet
FaML
13
61
0
18 Jul 2020
Two Simple Ways to Learn Individual Fairness Metrics from Data
Debarghya Mukherjee
Mikhail Yurochkin
Moulinath Banerjee
Yuekai Sun
FaML
8
96
0
19 Jun 2020
Social Biases in NLP Models as Barriers for Persons with Disabilities
Ben Hutchinson
Vinodkumar Prabhakaran
Emily L. Denton
Kellie Webster
Yu Zhong
Stephen Denuyl
9
302
0
02 May 2020
Multilingual Twitter Corpus and Baselines for Evaluating Demographic Bias in Hate Speech Recognition
Xiaolei Huang
Linzi Xing
Franck Dernoncourt
Michael J. Paul
6
87
0
24 Feb 2020
Counterfactual Vision-and-Language Navigation via Adversarial Path Sampling
Tsu-jui Fu
X. Wang
Matthew F. Peterson
Scott T. Grafton
M. Eckstein
William Yang Wang
49
41
0
17 Nov 2019
Perturbation Sensitivity Analysis to Detect Unintended Model Biases
Vinodkumar Prabhakaran
Ben Hutchinson
Margaret Mitchell
6
117
0
09 Oct 2019
Incorporating Priors with Feature Attribution on Text Classification
Frederick Liu
Besim Avci
FAtt
FaML
17
120
0
19 Jun 2019
Putting Fairness Principles into Practice: Challenges, Metrics, and Improvements
Alex Beutel
Jilin Chen
Tulsee Doshi
Hai Qian
Allison Woodruff
Christine Luu
Pierre Kreitmann
Jonathan Bischof
Ed H. Chi
FaML
24
150
0
14 Jan 2019
1