Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1903.10561
Cited By
On Measuring Social Biases in Sentence Encoders
25 March 2019
Chandler May
Alex Jinpeng Wang
Shikha Bordia
Samuel R. Bowman
Rachel Rudinger
Re-assign community
ArXiv
PDF
HTML
Papers citing
"On Measuring Social Biases in Sentence Encoders"
50 / 367 papers shown
Title
A Bayesian approach to uncertainty in word embedding bias estimation
Alicja Dobrzeniecka
R. Urbaniak
17
1
0
15 Jun 2023
Sociodemographic Bias in Language Models: A Survey and Forward Path
Vipul Gupta
Pranav Narayanan Venkit
Shomir Wilson
R. Passonneau
42
19
0
13 Jun 2023
Bias Against 93 Stigmatized Groups in Masked Language Models and Downstream Sentiment Classification Tasks
Katelyn Mei
Sonia Fereidooni
Aylin Caliskan
14
45
0
08 Jun 2023
An Empirical Analysis of Parameter-Efficient Methods for Debiasing Pre-Trained Language Models
Zhongbin Xie
Thomas Lukasiewicz
19
12
0
06 Jun 2023
T2IAT: Measuring Valence and Stereotypical Biases in Text-to-Image Generation
Jialu Wang
Xinyue Liu
Zonglin Di
Y. Liu
Xin Eric Wang
14
32
0
01 Jun 2023
An Invariant Learning Characterization of Controlled Text Generation
Carolina Zheng
Claudia Shi
Keyon Vafa
Amir Feder
David M. Blei
OOD
18
8
0
31 May 2023
Marked Personas: Using Natural Language Prompts to Measure Stereotypes in Language Models
Myra Cheng
Esin Durmus
Dan Jurafsky
25
174
0
29 May 2023
An Efficient Multilingual Language Model Compression through Vocabulary Trimming
Asahi Ushio
Yi Zhou
Jose Camacho-Collados
39
7
0
24 May 2023
Trade-Offs Between Fairness and Privacy in Language Modeling
Cleo Matzken
Steffen Eger
Ivan Habernal
SILM
31
6
0
24 May 2023
Detecting and Mitigating Indirect Stereotypes in Word Embeddings
Erin E. George
Joyce A. Chew
Deanna Needell
8
0
0
23 May 2023
Having Beer after Prayer? Measuring Cultural Bias in Large Language Models
Tarek Naous
Michael Joseph Ryan
Alan Ritter
Wei-ping Xu
24
84
0
23 May 2023
Out-of-Distribution Generalization in Text Classification: Past, Present, and Future
Linyi Yang
Y. Song
Xuan Ren
Chenyang Lyu
Yidong Wang
Lingqiao Liu
Jindong Wang
Jennifer Foster
Yue Zhang
OOD
20
2
0
23 May 2023
A Trip Towards Fairness: Bias and De-Biasing in Large Language Models
Leonardo Ranaldi
Elena Sofia Ruzzetti
Davide Venditti
Dario Onorati
Fabio Massimo Zanzotto
27
33
0
23 May 2023
Language-Agnostic Bias Detection in Language Models with Bias Probing
Abdullatif Köksal
Omer F. Yalcin
Ahmet Akbiyik
M. Kilavuz
Anna Korhonen
Hinrich Schütze
15
1
0
22 May 2023
Multilingual Holistic Bias: Extending Descriptors and Patterns to Unveil Demographic Biases in Languages at Scale
Marta R. Costa-jussá
Pierre Yves Andrews
Eric Michael Smith
Prangthip Hansanti
C. Ropers
Elahe Kalbassi
Cynthia Gao
Daniel Licht
Carleigh Wood
27
15
0
22 May 2023
On Bias and Fairness in NLP: Investigating the Impact of Bias and Debiasing in Language Models on the Fairness of Toxicity Detection
Fatma Elsafoury
Stamos Katsigiannis
22
1
0
22 May 2023
ToxBuster: In-game Chat Toxicity Buster with BERT
Zachary Yang
Yasmine Maricar
M. Davari
Nicolas Grenon-Godbout
Reihaneh Rabbany
14
3
0
21 May 2023
Measuring Intersectional Biases in Historical Documents
Nadav Borenstein
Karolina Stañczak
Thea Rolskov
N. Perez
N. Kafer
Isabelle Augenstein
6
4
0
21 May 2023
Solving NLP Problems through Human-System Collaboration: A Discussion-based Approach
Masahiro Kaneko
Graham Neubig
Naoaki Okazaki
25
6
0
19 May 2023
Solving Cosine Similarity Underestimation between High Frequency Words by L2 Norm Discounting
Saeth Wannasuphoprasit
Yi Zhou
Danushka Bollegala
19
4
0
17 May 2023
Language Model Tokenizers Introduce Unfairness Between Languages
Aleksandar Petrov
Emanuele La Malfa
Philip H. S. Torr
Adel Bibi
16
96
0
17 May 2023
On the Origins of Bias in NLP through the Lens of the Jim Code
Fatma Elsafoury
Gavin Abercrombie
28
4
0
16 May 2023
Constructing Holistic Measures for Social Biases in Masked Language Models
Y. Liu
Yuexian Hou
13
0
0
12 May 2023
A Survey on Intersectional Fairness in Machine Learning: Notions, Mitigation, and Challenges
Usman Gohar
Lu Cheng
FaML
27
31
0
11 May 2023
StarCoder: may the source be with you!
Raymond Li
Loubna Ben Allal
Yangtian Zi
Niklas Muennighoff
Denis Kocetkov
...
Sean M. Hughes
Thomas Wolf
Arjun Guha
Leandro von Werra
H. D. Vries
37
710
0
09 May 2023
On the Independence of Association Bias and Empirical Fairness in Language Models
Laura Cabello
Anna Katrine van Zee
Anders Søgaard
24
25
0
20 Apr 2023
Effectiveness of Debiasing Techniques: An Indigenous Qualitative Analysis
Vithya Yogarajan
Gillian Dobbie
H. Gouk
6
3
0
17 Apr 2023
Evaluation of Social Biases in Recent Large Pre-Trained Models
Swapnil Sharma
Nikita Anand
V. KranthiKiranG.
Alind Jain
13
0
0
13 Apr 2023
Toxicity in ChatGPT: Analyzing Persona-assigned Language Models
A. Deshpande
Vishvak Murahari
Tanmay Rajpurohit
A. Kalyan
Karthik Narasimhan
LM&MA
LLMAG
11
332
0
11 Apr 2023
An Overview on Language Models: Recent Developments and Outlook
Chengwei Wei
Yun Cheng Wang
Bin Wang
C.-C. Jay Kuo
10
41
0
10 Mar 2023
Logic Against Bias: Textual Entailment Mitigates Stereotypical Sentence Reasoning
Hongyin Luo
James R. Glass
NAI
21
7
0
10 Mar 2023
In-Depth Look at Word Filling Societal Bias Measures
Matúš Pikuliak
Ivana Benová
Viktor Bachratý
21
9
0
24 Feb 2023
Fairness in Language Models Beyond English: Gaps and Challenges
Krithika Ramesh
Sunayana Sitaram
Monojit Choudhury
22
23
0
24 Feb 2023
Auditing large language models: a three-layered approach
Jakob Mokander
Jonas Schuett
Hannah Rose Kirk
Luciano Floridi
AILaw
MLAU
29
193
0
16 Feb 2023
BiasTestGPT: Using ChatGPT for Social Bias Testing of Language Models
Rafal Kocielnik
Shrimai Prabhumoye
Vivian Zhang
Roy Jiang
R. Alvarez
Anima Anandkumar
30
6
0
14 Feb 2023
Fair Enough: Standardizing Evaluation and Model Selection for Fairness Research in NLP
Xudong Han
Timothy Baldwin
Trevor Cohn
16
12
0
11 Feb 2023
Counter-GAP: Counterfactual Bias Evaluation through Gendered Ambiguous Pronouns
Zhongbin Xie
Vid Kocijan
Thomas Lukasiewicz
Oana-Maria Camburu
6
2
0
11 Feb 2023
Erasure of Unaligned Attributes from Neural Representations
Shun Shao
Yftah Ziser
Shay B. Cohen
12
9
0
06 Feb 2023
FineDeb: A Debiasing Framework for Language Models
Akash Saravanan
Dhruv Mullick
Habibur Rahman
Nidhi Hegde
FedML
AI4CE
10
4
0
05 Feb 2023
How Far Can It Go?: On Intrinsic Gender Bias Mitigation for Text Classification
E. Tokpo
Pieter Delobelle
Bettina Berendt
T. Calders
35
7
0
30 Jan 2023
Comparing Intrinsic Gender Bias Evaluation Measures without using Human Annotated Examples
Masahiro Kaneko
Danushka Bollegala
Naoaki Okazaki
16
9
0
28 Jan 2023
Vision-Language Models Performing Zero-Shot Tasks Exhibit Gender-based Disparities
Melissa Hall
Laura Gustafson
Aaron B. Adcock
Ishan Misra
Candace Ross
VLM
24
22
0
26 Jan 2023
An Empirical Study of Metrics to Measure Representational Harms in Pre-Trained Language Models
Saghar Hosseini
Hamid Palangi
Ahmed Hassan Awadallah
14
21
0
22 Jan 2023
Blacks is to Anger as Whites is to Joy? Understanding Latent Affective Bias in Large Pre-trained Neural Language Models
Anoop Kadan
P Deepak
Sahely Bhadra
Manjary P.Gangan
L. LajishV.
19
2
0
21 Jan 2023
Contrastive Language-Vision AI Models Pretrained on Web-Scraped Multimodal Data Exhibit Sexual Objectification Bias
Robert Wolfe
Yiwei Yang
Billy Howe
Aylin Caliskan
DiffM
13
51
0
21 Dec 2022
Trustworthy Social Bias Measurement
Rishi Bommasani
Percy Liang
27
10
0
20 Dec 2022
Cross-Lingual Retrieval Augmented Prompt for Low-Resource Languages
Ercong Nie
Sheng Liang
Helmut Schmid
Hinrich Schütze
VLM
RALM
LRM
20
22
0
19 Dec 2022
Undesirable Biases in NLP: Addressing Challenges of Measurement
Oskar van der Wal
Dominik Bachmann
Alina Leidinger
L. Maanen
Willem H. Zuidema
K. Schulz
17
6
0
24 Nov 2022
Deep Learning on a Healthy Data Diet: Finding Important Examples for Fairness
A. Zayed
Prasanna Parthasarathi
Gonçalo Mordido
Hamid Palangi
Samira Shabanian
Sarath Chandar
18
21
0
20 Nov 2022
Conceptor-Aided Debiasing of Large Language Models
Yifei Li
Lyle Ungar
João Sedoc
6
4
0
20 Nov 2022
Previous
1
2
3
4
5
6
7
8
Next