ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1903.04561
  4. Cited By
Nuanced Metrics for Measuring Unintended Bias with Real Data for Text
  Classification

Nuanced Metrics for Measuring Unintended Bias with Real Data for Text Classification

11 March 2019
Daniel Borkan
Lucas Dixon
Jeffrey Scott Sorensen
Nithum Thain
Lucy Vasserman
ArXivPDFHTML

Papers citing "Nuanced Metrics for Measuring Unintended Bias with Real Data for Text Classification"

50 / 125 papers shown
Title
Detecting Unintended Social Bias in Toxic Language Datasets
Detecting Unintended Social Bias in Toxic Language Datasets
Nihar Ranjan Sahoo
Himanshu Gupta
P. Bhattacharyya
21
18
0
21 Oct 2022
Scaling Instruction-Finetuned Language Models
Scaling Instruction-Finetuned Language Models
Hyung Won Chung
Le Hou
Shayne Longpre
Barret Zoph
Yi Tay
...
Jacob Devlin
Adam Roberts
Denny Zhou
Quoc V. Le
Jason W. Wei
ReLM
LRM
103
3,019
0
20 Oct 2022
On Feature Learning in the Presence of Spurious Correlations
On Feature Learning in the Presence of Spurious Correlations
Pavel Izmailov
Polina Kirichenko
Nate Gruver
A. Wilson
43
118
0
20 Oct 2022
How Hate Speech Varies by Target Identity: A Computational Analysis
How Hate Speech Varies by Target Identity: A Computational Analysis
Michael Miller Yoder
Lynnette Hui Xian Ng
D. W. Brown
Kathleen M. Carley
35
20
0
19 Oct 2022
On Learning Fairness and Accuracy on Multiple Subgroups
On Learning Fairness and Accuracy on Multiple Subgroups
Changjian Shui
Gezheng Xu
Qi Chen
Jiaqi Li
Charles Ling
Tal Arbel
Boyu Wang
Christian Gagné
46
37
0
19 Oct 2022
Towards Procedural Fairness: Uncovering Biases in How a Toxic Language
  Classifier Uses Sentiment Information
Towards Procedural Fairness: Uncovering Biases in How a Toxic Language Classifier Uses Sentiment Information
I. Nejadgholi
Esma Balkir
Kathleen C. Fraser
S. Kiritchenko
40
3
0
19 Oct 2022
Towards Explaining Distribution Shifts
Towards Explaining Distribution Shifts
Sean Kulinski
David I. Inouye
OffRL
FAtt
44
24
0
19 Oct 2022
Analyzing Text Representations under Tight Annotation Budgets: Measuring
  Structural Alignment
Analyzing Text Representations under Tight Annotation Budgets: Measuring Structural Alignment
César González-Gutiérrez
Audi Primadhanty
Francesco Cazzaro
A. Quattoni
33
0
0
11 Oct 2022
A Keyword Based Approach to Understanding the Overpenalization of
  Marginalized Groups by English Marginal Abuse Models on Twitter
A Keyword Based Approach to Understanding the Overpenalization of Marginalized Groups by English Marginal Abuse Models on Twitter
Kyra Yee
Alice Schoenauer Sebag
Olivia Redfield
Emily Sheng
Matthias Eck
Luca Belli
28
2
0
07 Oct 2022
State-of-the-art generalisation research in NLP: A taxonomy and review
State-of-the-art generalisation research in NLP: A taxonomy and review
Dieuwke Hupkes
Mario Giulianelli
Verna Dankers
Mikel Artetxe
Yanai Elazar
...
Leila Khalatbari
Maria Ryskina
Rita Frieske
Ryan Cotterell
Zhijing Jin
129
95
0
06 Oct 2022
UMIX: Improving Importance Weighting for Subpopulation Shift via
  Uncertainty-Aware Mixup
UMIX: Improving Importance Weighting for Subpopulation Shift via Uncertainty-Aware Mixup
Zongbo Han
Zhipeng Liang
Fan Yang
Liu Liu
Lanqing Li
Yatao Bian
P. Zhao
Bing Wu
Changqing Zhang
Jianhua Yao
56
34
0
19 Sep 2022
Task Selection for AutoML System Evaluation
Task Selection for AutoML System Evaluation
Jon Lorraine
Nihesh Anderson
Chansoo Lee
Quentin de Laroussilhe
Mehadi Hassen
52
4
0
26 Aug 2022
Minimax AUC Fairness: Efficient Algorithm with Provable Convergence
Minimax AUC Fairness: Efficient Algorithm with Provable Convergence
Zhenhuan Yang
Yan Lok Ko
Kush R. Varshney
Yiming Ying
FaML
33
17
0
22 Aug 2022
Exploring Hate Speech Detection with HateXplain and BERT
Exploring Hate Speech Detection with HateXplain and BERT
Arvind Subramaniam
A. Mehra
Sayani Kundu
18
3
0
09 Aug 2022
Calibrated ensembles can mitigate accuracy tradeoffs under distribution
  shift
Calibrated ensembles can mitigate accuracy tradeoffs under distribution shift
Ananya Kumar
Tengyu Ma
Percy Liang
Aditi Raghunathan
UQCV
OODD
OOD
49
38
0
18 Jul 2022
Contrastive Adapters for Foundation Model Group Robustness
Contrastive Adapters for Foundation Model Group Robustness
Michael Zhang
Christopher Ré
VLM
23
62
0
14 Jul 2022
Characteristics of Harmful Text: Towards Rigorous Benchmarking of
  Language Models
Characteristics of Harmful Text: Towards Rigorous Benchmarking of Language Models
Maribeth Rauh
John F. J. Mellor
J. Uesato
Po-Sen Huang
Johannes Welbl
...
Amelia Glaese
G. Irving
Iason Gabriel
William S. Isaac
Lisa Anne Hendricks
38
49
0
16 Jun 2022
Modeling the Data-Generating Process is Necessary for
  Out-of-Distribution Generalization
Modeling the Data-Generating Process is Necessary for Out-of-Distribution Generalization
Jivat Neet Kaur
Emre Kıcıman
Amit Sharma
UQCV
OOD
30
25
0
15 Jun 2022
ABCinML: Anticipatory Bias Correction in Machine Learning Applications
ABCinML: Anticipatory Bias Correction in Machine Learning Applications
Abdulaziz A. Almuzaini
C. Bhatt
David M. Pennock
V. Singh
FaML
30
10
0
14 Jun 2022
Conditional Supervised Contrastive Learning for Fair Text Classification
Conditional Supervised Contrastive Learning for Fair Text Classification
Jianfeng Chi
Will Shand
Yaodong Yu
Kai-Wei Chang
Han Zhao
Yuan Tian
FaML
54
14
0
23 May 2022
Evaluation Gaps in Machine Learning Practice
Evaluation Gaps in Machine Learning Practice
Ben Hutchinson
Negar Rostamzadeh
Christina Greer
Katherine A. Heller
Vinodkumar Prabhakaran
ELM
36
56
0
11 May 2022
UL2: Unifying Language Learning Paradigms
UL2: Unifying Language Learning Paradigms
Yi Tay
Mostafa Dehghani
Vinh Q. Tran
Xavier Garcia
Jason W. Wei
...
Tal Schuster
H. Zheng
Denny Zhou
N. Houlsby
Donald Metzler
AI4CE
74
298
0
10 May 2022
Counterfactually Augmented Data and Unintended Bias: The Case of Sexism
  and Hate Speech Detection
Counterfactually Augmented Data and Unintended Bias: The Case of Sexism and Hate Speech Detection
Indira Sen
Mattia Samory
Claudia Wagner
Isabelle Augenstein
28
17
0
09 May 2022
Necessity and Sufficiency for Explaining Text Classifiers: A Case Study
  in Hate Speech Detection
Necessity and Sufficiency for Explaining Text Classifiers: A Case Study in Hate Speech Detection
Esma Balkir
I. Nejadgholi
Kathleen C. Fraser
S. Kiritchenko
FAtt
41
27
0
06 May 2022
Is Your Toxicity My Toxicity? Exploring the Impact of Rater Identity on
  Toxicity Annotation
Is Your Toxicity My Toxicity? Exploring the Impact of Rater Identity on Toxicity Annotation
Nitesh Goyal
Ian D Kivlichan
Rachel Rosen
Lucy Vasserman
41
90
0
01 May 2022
Easy Adaptation to Mitigate Gender Bias in Multilingual Text
  Classification
Easy Adaptation to Mitigate Gender Bias in Multilingual Text Classification
Xiaolei Huang
FaML
21
8
0
12 Apr 2022
Last Layer Re-Training is Sufficient for Robustness to Spurious
  Correlations
Last Layer Re-Training is Sufficient for Robustness to Spurious Correlations
Polina Kirichenko
Pavel Izmailov
A. Wilson
OOD
56
320
0
06 Apr 2022
AUC Maximization in the Era of Big Data and AI: A Survey
AUC Maximization in the Era of Big Data and AI: A Survey
Tianbao Yang
Yiming Ying
44
179
0
28 Mar 2022
A study on the distribution of social biases in self-supervised learning
  visual models
A study on the distribution of social biases in self-supervised learning visual models
Kirill Sirotkin
Pablo Carballeira
Marcos Escudero-Viñolo
22
18
0
03 Mar 2022
Exploring the Unfairness of DP-SGD Across Settings
Exploring the Unfairness of DP-SGD Across Settings
Frederik Noe
R. Herskind
Anders Søgaard
27
4
0
24 Feb 2022
A New Generation of Perspective API: Efficient Multilingual
  Character-level Transformers
A New Generation of Perspective API: Efficient Multilingual Character-level Transformers
Alyssa Lees
Vinh Q. Tran
Yi Tay
Jeffrey Scott Sorensen
Jai Gupta
Donald Metzler
Lucy Vasserman
39
176
0
22 Feb 2022
Reward Modeling for Mitigating Toxicity in Transformer-based Language
  Models
Reward Modeling for Mitigating Toxicity in Transformer-based Language Models
Farshid Faal
K. Schmitt
Jia Yuan Yu
13
24
0
19 Feb 2022
Discovering Distribution Shifts using Latent Space Representations
Discovering Distribution Shifts using Latent Space Representations
Leo Betthauser
Urszula Chajewska
M. Diesendruck
Rohith Pesala
OOD
41
5
0
04 Feb 2022
Fairness for Text Classification Tasks with Identity Information Data
  Augmentation Methods
Fairness for Text Classification Tasks with Identity Information Data Augmentation Methods
Mohit Wadhwa
Mohan Bhambhani
Ashvini Jindal
Uma Sawant
Ramanujam Madhavan
19
4
0
04 Feb 2022
Handling Bias in Toxic Speech Detection: A Survey
Handling Bias in Toxic Speech Detection: A Survey
Tanmay Garg
Sarah Masud
Tharun Suresh
Tanmoy Chakraborty
17
91
0
26 Jan 2022
Leveraging Transformers for Hate Speech Detection in Conversational
  Code-Mixed Tweets
Leveraging Transformers for Hate Speech Detection in Conversational Code-Mixed Tweets
Zaki Mustafa Farooqi
Sreyan Ghosh
R. Shah
32
29
0
18 Dec 2021
Balancing Fairness and Robustness via Partial Invariance
Balancing Fairness and Robustness via Partial Invariance
Moulik Choraria
Ibtihal Ferwana
Ankur Mani
Lav Varshney
OOD
36
1
0
17 Dec 2021
Simple Text Detoxification by Identifying a Linear Toxic Subspace in
  Language Model Embeddings
Simple Text Detoxification by Identifying a Linear Toxic Subspace in Language Model Embeddings
Andrew Wang
Mohit Sudhakar
Yangfeng Ji
20
2
0
15 Dec 2021
Deep AUC Maximization for Medical Image Classification: Challenges and
  Opportunities
Deep AUC Maximization for Medical Image Classification: Challenges and Opportunities
Tianbao Yang
30
3
0
01 Nov 2021
Simple data balancing achieves competitive worst-group-accuracy
Simple data balancing achieves competitive worst-group-accuracy
Badr Youbi Idrissi
Martín Arjovsky
Mohammad Pezeshki
David Lopez-Paz
56
173
0
27 Oct 2021
Sparse Distillation: Speeding Up Text Classification by Using Bigger
  Student Models
Sparse Distillation: Speeding Up Text Classification by Using Bigger Student Models
Qinyuan Ye
Madian Khabsa
M. Lewis
Sinong Wang
Xiang Ren
Aaron Jaech
39
5
0
16 Oct 2021
Focus on the Common Good: Group Distributional Robustness Follows
Focus on the Common Good: Group Distributional Robustness Follows
Vihari Piratla
Praneeth Netrapalli
Sunita Sarawagi
OOD
33
25
0
06 Oct 2021
Unpacking the Interdependent Systems of Discrimination: Ableist Bias in
  NLP Systems through an Intersectional Lens
Unpacking the Interdependent Systems of Discrimination: Ableist Bias in NLP Systems through an Intersectional Lens
Saad Hassan
Matt Huenerfauth
Cecilia Ovesdotter Alm
51
38
0
01 Oct 2021
Scale Efficiently: Insights from Pre-training and Fine-tuning
  Transformers
Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers
Yi Tay
Mostafa Dehghani
J. Rao
W. Fedus
Samira Abnar
Hyung Won Chung
Sharan Narang
Dani Yogatama
Ashish Vaswani
Donald Metzler
206
111
0
22 Sep 2021
Fairness-aware Class Imbalanced Learning
Fairness-aware Class Imbalanced Learning
Shivashankar Subramanian
Afshin Rahimi
Timothy Baldwin
Trevor Cohn
Lea Frermann
FaML
109
28
0
21 Sep 2021
Towards Out-Of-Distribution Generalization: A Survey
Towards Out-Of-Distribution Generalization: A Survey
Jiashuo Liu
Zheyan Shen
Yue He
Xingxuan Zhang
Renzhe Xu
Han Yu
Peng Cui
CML
OOD
69
519
0
31 Aug 2021
How Hateful are Movies? A Study and Prediction on Movie Subtitles
How Hateful are Movies? A Study and Prediction on Movie Subtitles
Niklas von Boguszewski
Sana Moin
Anirban Bhowmick
Seid Muhie Yimam
Christian Biemann
25
4
0
19 Aug 2021
On Measures of Biases and Harms in NLP
On Measures of Biases and Harms in NLP
Sunipa Dev
Emily Sheng
Jieyu Zhao
Aubrie Amstutz
Jiao Sun
...
M. Sanseverino
Jiin Kim
Akihiro Nishi
Nanyun Peng
Kai-Wei Chang
33
80
0
07 Aug 2021
Just Train Twice: Improving Group Robustness without Training Group
  Information
Just Train Twice: Improving Group Robustness without Training Group Information
E. Liu
Behzad Haghgoo
Annie S. Chen
Aditi Raghunathan
Pang Wei Koh
Shiori Sagawa
Percy Liang
Chelsea Finn
OOD
37
540
0
19 Jul 2021
Trustworthy AI: A Computational Perspective
Trustworthy AI: A Computational Perspective
Haochen Liu
Yiqi Wang
Wenqi Fan
Xiaorui Liu
Yaxin Li
Shaili Jain
Yunhao Liu
Anil K. Jain
Jiliang Tang
FaML
104
197
0
12 Jul 2021
Previous
123
Next