Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2011.06485
Cited By
v1
v2 (latest)
Fairness and Robustness in Invariant Learning: A Case Study in Toxicity Classification
12 November 2020
Robert Adragna
Elliot Creager
David Madras
R. Zemel
OOD
FaML
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Fairness and Robustness in Invariant Learning: A Case Study in Toxicity Classification"
29 / 29 papers shown
AdvSumm: Adversarial Training for Bias Mitigation in Text Summarization
Mukur Gupta
Nikhil Reddy Varimalla
Nicholas Deas
Melanie Subbiah
Kathleen McKeown
386
2
0
06 Jun 2025
Toxicity of the Commons: Curating Open-Source Pre-Training Data
Catherine Arnett
Eliot Jones
Ivan P. Yamshchikov
Pierre-Carl Langlais
394
7
0
29 Oct 2024
A Comprehensive Survey and Classification of Evaluation Criteria for Trustworthy Artificial Intelligence
AI and Ethics (AI & Ethics), 2024
Louise McCormack
Malika Bendechache
XAI
354
10
0
10 Oct 2024
One Fits All: Learning Fair Graph Neural Networks for Various Sensitive Attributes
Knowledge Discovery and Data Mining (KDD), 2024
Yuchang Zhu
Jintang Li
Yatao Bian
Zibin Zheng
Liang Chen
378
11
0
19 Jun 2024
Improving Commonsense Bias Classification by Mitigating the Influence of Demographic Terms
JinKyu Lee
Jihie Kim
179
0
0
11 Jun 2024
Designing Long-term Group Fair Policies in Dynamical Systems
Conference on Fairness, Accountability and Transparency (FAccT), 2023
Miriam Rateike
Isabel Valera
Patrick Forré
377
9
0
21 Nov 2023
Causal Adversarial Perturbations for Individual Fairness and Robustness in Heterogeneous Data Spaces
AAAI Conference on Artificial Intelligence (AAAI), 2023
A. Ehyaei
Kiarash Mohammadi
Amir-Hossein Karimi
Samira Samadi
G. Farnadi
AAML
263
5
0
17 Aug 2023
Improving Identity-Robustness for Face Models
Q. Qi
Shervin Ardeshir
CVBM
OOD
272
3
0
07 Apr 2023
Function Composition in Trustworthy Machine Learning: Implementation Choices, Insights, and Questions
Manish Nagireddy
Moninder Singh
Samuel C. Hoffman
Evaline Ju
Karthikeyan N. Ramamurthy
Kush R. Varshney
350
2
0
17 Feb 2023
Learning Optimal Features via Partial Invariance
AAAI Conference on Artificial Intelligence (AAAI), 2023
Moulik Choraria
Ibtihal Ferwana
Ankur Mani
Lav Varshney
OOD
293
3
0
28 Jan 2023
A Survey on Preserving Fairness Guarantees in Changing Environments
Ainhize Barrainkua
Paula Gordaliza
Jose A. Lozano
Novi Quadrianto
FaML
445
3
0
14 Nov 2022
Fairness via Adversarial Attribute Neighbourhood Robust Learning
Q. Qi
Shervin Ardeshir
Yi Tian Xu
Tianbao Yang
302
0
0
12 Oct 2022
Fairness and robustness in anti-causal prediction
Maggie Makar
Alexander DÁmour
OOD
296
13
0
20 Sep 2022
ABCinML: Anticipatory Bias Correction in Machine Learning Applications
Conference on Fairness, Accountability and Transparency (FAccT), 2022
Abdulaziz A. Almuzaini
C. Bhatt
David M. Pennock
V. Singh
FaML
200
13
0
14 Jun 2022
Challenges in Applying Explainability Methods to Improve the Fairness of NLP Models
Esma Balkir
S. Kiritchenko
I. Nejadgholi
Kathleen C. Fraser
357
42
0
08 Jun 2022
De-biasing "bias" measurement
Conference on Fairness, Accountability and Transparency (FAccT), 2022
K. Lum
Yunfeng Zhang
Amanda Bower
315
33
0
11 May 2022
The Road to Explainability is Paved with Bias: Measuring the Fairness of Explanations
Conference on Fairness, Accountability and Transparency (FAccT), 2022
Aparna Balagopalan
Haoran Zhang
Kimia Hamidieh
Thomas Hartvigsen
Frank Rudzicz
Marzyeh Ghassemi
330
99
0
06 May 2022
Feature robustness and sex differences in medical imaging: a case study in MRI-based Alzheimer's disease detection
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2022
Eike Petersen
Aasa Feragen
Maria Luise da Costa Zemsch
A. Henriksen
Oskar Eiler Wiese Christensen
M. Ganz
OOD
258
31
0
04 Apr 2022
On the Intrinsic and Extrinsic Fairness Evaluation Metrics for Contextualized Language Representations
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Yang Trista Cao
Yada Pruksachatkun
Kai-Wei Chang
Rahul Gupta
Varun Kumar
Jwala Dhamala
Aram Galstyan
244
108
0
25 Mar 2022
Diagnosing failures of fairness transfer across distribution shift in real-world medical settings
Neural Information Processing Systems (NeurIPS), 2022
Jessica Schrouff
Natalie Harris
Oluwasanmi Koyejo
Ibrahim Alabdulmohsin
Eva Schnider
...
Vivek Natarajan
Alan Karthikesalingam
Katherine A. Heller
Silvia Chiappa
Alexander DÁmour
OOD
499
74
0
02 Feb 2022
Grad2Task: Improved Few-shot Text Classification Using Gradients for Task Representation
Neural Information Processing Systems (NeurIPS), 2022
Jixuan Wang
Kuan-Chieh Wang
Frank Rudzicz
M. Brudno
VLM
179
24
0
27 Jan 2022
Balancing Fairness and Robustness via Partial Invariance
Moulik Choraria
Ibtihal Ferwana
Ankur Mani
Lav Varshney
OOD
315
1
0
17 Dec 2021
Identifying and Benchmarking Natural Out-of-Context Prediction Problems
Neural Information Processing Systems (NeurIPS), 2021
David Madras
D. Psaltis
CML
OOD
295
5
0
25 Oct 2021
Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond
Amir Feder
Katherine A. Keith
Emaad A. Manzoor
Reid Pryzant
Dhanya Sridhar
...
Roi Reichart
Margaret E. Roberts
Brandon M Stewart
Victor Veitch
Diyi Yang
CML
483
307
0
02 Sep 2021
Responsible and Regulatory Conform Machine Learning for Medicine: A Survey of Challenges and Solutions
IEEE Access (IEEE Access), 2021
Eike Petersen
Yannik Potdevin
Esfandiar Mohammadi
Stephan Zidowitz
Sabrina Breyer
...
Sandra Henn
Ludwig Pechmann
M. Leucker
P. Rostalski
Christian Herzog
FaML
AILaw
OOD
315
38
0
20 Jul 2021
Measuring and Improving Model-Moderator Collaboration using Uncertainty Estimation
Ian D Kivlichan
Zi Lin
J. Liu
Lucy Vasserman
202
24
0
09 Jul 2021
Does Robustness Improve Fairness? Approaching Fairness with Word Substitution Robustness Methods for Text Classification
Findings (Findings), 2021
Yada Pruksachatkun
Satyapriya Krishna
Jwala Dhamala
Rahul Gupta
Kai-Wei Chang
210
33
0
21 Jun 2021
An Empirical Framework for Domain Generalization in Clinical Settings
ACM Conference on Health, Inference, and Learning (CHIL), 2021
Haoran Zhang
Natalie Dullerud
Laleh Seyyed-Kalantari
Q. Morris
Shalmali Joshi
Marzyeh Ghassemi
OOD
AI4CE
313
69
0
20 Mar 2021
WILDS: A Benchmark of in-the-Wild Distribution Shifts
International Conference on Machine Learning (ICML), 2020
Pang Wei Koh
Shiori Sagawa
Henrik Marklund
Sang Michael Xie
Marvin Zhang
...
A. Kundaje
Emma Pierson
Sergey Levine
Chelsea Finn
Abigail Z. Jacobs
OOD
779
1,744
0
14 Dec 2020
1
Page 1 of 1