Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
1911.01485
Cited By
Assessing Social and Intersectional Biases in Contextualized Word Representations
Neural Information Processing Systems (NeurIPS), 2019
4 November 2019
Y. Tan
Elisa Celis
FaML
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Assessing Social and Intersectional Biases in Contextualized Word Representations"
50 / 142 papers shown
Title
Adjacent Words, Divergent Intents: Jailbreaking Large Language Models via Task Concurrency
Yukun Jiang
Mingjie Li
Michael Backes
Yang Zhang
116
3
0
24 Oct 2025
Prompting Away Stereotypes? Evaluating Bias in Text-to-Image Models for Occupations
Shaina Raza
Maximus Powers
Partha Pratim Saha
Mahveen Raza
Rizwan Qureshi
68
0
0
31 Aug 2025
Benchmarking Sociolinguistic Diversity in Swahili NLP: A Taxonomy-Guided Approach
Kezia Oketch
John P. Lalor
A. Abbasi
84
0
0
06 Aug 2025
Exploring Gender Bias in Large Language Models: An In-depth Dive into the German Language
Kristin Gnadt
David Thulke
Simone Kopeinik
Ralf Schluter
145
0
0
22 Jul 2025
Measuring (a Sufficient) World Model in LLMs: A Variance Decomposition Framework
Nadav Kunievsky
James A. Evans
151
0
0
19 Jun 2025
Biases Propagate in Encoder-based Vision-Language Models: A Systematic Analysis From Intrinsic Measures to Zero-shot Retrieval Outcomes
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Kshitish Ghate
Tessa E. S. Charlesworth
Mona Diab
Aylin Caliskan
VLM
115
2
0
06 Jun 2025
Words of Warmth: Trust and Sociability Norms for over 26k English Words
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Saif M. Mohammad
141
1
0
04 Jun 2025
A Survey on Stereotype Detection in Natural Language Processing
ACM Computing Surveys (ACM Comput. Surv.), 2025
Alessandra Teresa Cignarella
Anastasia Giachanou
Els Lefever
148
0
0
23 May 2025
Mitigate One, Skew Another? Tackling Intersectional Biases in Text-to-Image Models
Pushkar Shukla
Aditya Chinchure
Emily Diana
A. Tolbert
K. Hosanagar
Vineeth N. Balasubramanian
Leonid Sigal
Matthew Turk
142
2
0
22 May 2025
HInter: Exposing Hidden Intersectional Bias in Large Language Models
Badr Souani
E. Soremekun
Mike Papadakis
Setsuko Yokoyama
Sudipta Chattopadhyay
Yves Le Traon
207
2
0
15 Mar 2025
Efficient Safety Alignment of Large Language Models via Preference Re-ranking and Representation-based Reward Modeling
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Qiyuan Deng
X. Bai
Kehai Chen
Yaowei Wang
Liqiang Nie
Min Zhang
OffRL
229
0
0
13 Mar 2025
BiasConnect: Investigating Bias Interactions in Text-to-Image Models
Pushkar Shukla
Aditya Chinchure
Emily Diana
A. Tolbert
K. Hosanagar
Vineeth N. Balasubramanian
Leonid Sigal
Matthew Turk
167
2
0
12 Mar 2025
Fair Text Classification via Transferable Representations
Thibaud Leteno
Michael Perrot
Charlotte Laclau
Antoine Gourru
Christophe Gravier
FaML
317
0
0
10 Mar 2025
FairSense-AI: Responsible AI Meets Sustainability
Shaina Raza
Mukund Sayeeganesh Chettiar
Matin Yousefabadi
Tahniat Khan
Veronica Chatrath
295
0
0
04 Mar 2025
Evaluating the Effect of Retrieval Augmentation on Social Biases
Tianhui Zhang
Yi Zhou
Danushka Bollegala
249
1
0
24 Feb 2025
Profiling Bias in LLMs: Stereotype Dimensions in Contextual Word Embeddings
Carolin M. Schuster
Maria-Alexandra Dinisor
Shashwat Ghatiwala
Georg Groh
323
3
0
25 Nov 2024
Speciesism in Natural Language Processing Research
AI and Ethics (AI & Ethics), 2024
Masashi Takeshita
Rafal Rzepka
177
6
0
18 Oct 2024
Ethics Whitepaper: Whitepaper on Ethical Research into Large Language Models
Eddie L. Ungless
Nikolas Vitsakis
Zeerak Talat
James Garforth
Bjorn Ross
Arno Onken
Atoosa Kasirzadeh
Alexandra Birch
238
3
0
17 Oct 2024
Mitigating Biases to Embrace Diversity: A Comprehensive Annotation Benchmark for Toxic Language
Xinmeng Hou
241
1
0
17 Oct 2024
Crossing Margins: Intersectional Users' Ethical Concerns about Software
Lauren Olson
Tom P. Humbert
Ricarda Anna-Lena Fischer
Bob Westerveld
Florian Kunneman
Emitzá Guzmán
104
2
0
10 Oct 2024
Make Compound Sentences Simple to Analyze: Learning to Split Sentences for Aspect-based Sentiment Analysis
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Yongsik Seo
Sungwon Song
Ryang Heo
Jieyong Kim
Dongha Lee
CoGe
185
3
0
03 Oct 2024
What is the social benefit of hate speech detection research? A Systematic Review
Sidney Gig-Jan Wong
127
1
0
26 Sep 2024
A Study on Bias Detection and Classification in Natural Language Processing
Ana Sofia Evans
Helena Moniz
Luísa Coheur
151
1
0
14 Aug 2024
The BIAS Detection Framework: Bias Detection in Word Embeddings and Language Models for European Languages
A. Puttick
Leander Rankwiler
Catherine Ikae
Mascha Kurpicz-Briki
140
3
0
26 Jul 2024
Fairness Definitions in Language Models Explained
Thang Viet Doan
Zhibo Chu
Sribala Vidyadhari Chinta
Wenbin Zhang
ALM
315
17
0
26 Jul 2024
How Are LLMs Mitigating Stereotyping Harms? Learning from Search Engine Studies
Alina Leidinger
Richard Rogers
339
18
0
16 Jul 2024
Who is better at math, Jenny or Jingzhen? Uncovering Stereotypes in Large Language Models
Zara Siddique
Liam D. Turner
Luis Espinosa-Anke
169
2
0
09 Jul 2024
Sociocultural Considerations in Monitoring Anti-LGBTQ+ Content on Social Media
Sidney G. -J. Wong
118
0
0
01 Jul 2024
Fairness and Bias in Multimodal AI: A Survey
Tosin Adewumi
Lama Alkhaled
Namrata Gurung
G. V. Boven
Irene Pagliai
293
23
0
27 Jun 2024
Whose Preferences? Differences in Fairness Preferences and Their Impact on the Fairness of AI Utilizing Human Feedback
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Emilia Agis Lerner
Florian E. Dorner
Elliott Ash
Naman Goel
178
3
0
09 Jun 2024
Hate Speech Detection with Generalizable Target-aware Fairness
Tong Chen
Danny Wang
Xurong Liang
Marten Risius
Gianluca Demartini
Hongzhi Yin
365
10
0
28 May 2024
Quite Good, but Not Enough: Nationality Bias in Large Language Models -- A Case Study of ChatGPT
Shucheng Zhu
Weikang Wang
Ying Liu
206
16
0
11 May 2024
Hire Me or Not? Examining Language Model's Behavior with Occupation Attributes
International Conference on Computational Linguistics (COLING), 2024
Damin Zhang
Yi Zhang
Geetanjali Bihani
Julia Taylor Rayz
364
4
0
06 May 2024
GeniL: A Multilingual Dataset on Generalizing Language
Aida Mostafazadeh Davani
S. Gubbi
Sunipa Dev
Shachi Dave
Vinodkumar Prabhakaran
194
2
0
08 Apr 2024
IndiBias: A Benchmark Dataset to Measure Social Biases in Language Models for Indian Context
Nihar Ranjan Sahoo
Pranamya Prashant Kulkarni
Narjis Asad
Arif Ahmad
Tanu Goyal
Aparna Garimella
Pushpak Bhattacharyya
255
24
0
29 Mar 2024
Protected group bias and stereotypes in Large Language Models
Hadas Kotek
David Q. Sun
Zidi Xiu
Margit Bowler
Christopher Klein
AILaw
ALM
126
6
0
21 Mar 2024
Self-Consistent Reasoning-based Aspect-Sentiment Quad Prediction with Extract-Then-Assign Strategy
Jieyong Kim
Ryang Heo
Yongsik Seo
SeongKu Kang
Jinyoung Yeo
Dongha Lee
ReLM
LRM
136
14
0
01 Mar 2024
Large Language Models are Geographically Biased
Rohin Manvi
Samar Khanna
Marshall Burke
David B. Lobell
Stefano Ermon
329
82
0
05 Feb 2024
Verifiable evaluations of machine learning models using zkSNARKs
Tobin South
Alexander Camuto
Shrey Jain
Shayla Nguyen
Robert Mahari
Christian Paquin
Jason Morton
Alex Pentland
MLAU
ALM
165
18
0
05 Feb 2024
From Google Gemini to OpenAI Q* (Q-Star): A Survey of Reshaping the Generative Artificial Intelligence (AI) Research Landscape
Timothy R. McIntosh
Teo Susnjak
Tong Liu
Paul Watters
Malka N. Halgamuge
355
73
0
18 Dec 2023
Taxonomy-based CheckList for Large Language Model Evaluation
Damin Zhang
117
0
0
15 Dec 2023
Weakly Supervised Detection of Hallucinations in LLM Activations
Miriam Rateike
C. Cintas
John Wamburu
Tanya Akumu
Skyler Speakman
195
19
0
05 Dec 2023
What Do Llamas Really Think? Revealing Preference Biases in Language Model Representations
Raphael Tang
Xinyu Crystina Zhang
Jimmy J. Lin
Ferhan Ture
296
10
0
30 Nov 2023
Fair Text Classification with Wasserstein Independence
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Thibaud Leteno
Antoine Gourru
Charlotte Laclau
Rémi Emonet
Christophe Gravier
FaML
207
5
0
21 Nov 2023
Bias A-head? Analyzing Bias in Transformer-Based Language Model Attention Heads
Yi Yang
Hanyu Duan
Ahmed Abbasi
John P. Lalor
Kar Yan Tam
170
12
0
17 Nov 2023
Benefits and Harms of Large Language Models in Digital Mental Health
Munmun De Choudhury
Sachin R. Pendse
Neha Kumar
LM&MA
AI4MH
181
62
0
07 Nov 2023
Global Voices, Local Biases: Socio-Cultural Prejudices across Languages
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
A. Mukherjee
Chahat Raj
Ziwei Zhu
Antonios Anastasopoulos
173
24
0
26 Oct 2023
Geographical Erasure in Language Generation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Pola Schwöbel
Jacek Golebiowski
Michele Donini
Cédric Archambeau
Danish Pruthi
149
11
0
23 Oct 2023
How Good is ChatGPT in Giving Advice on Your Visualization Design?
Nam Wook Kim
Grace Myers
Benjamin Bach
418
31
0
14 Oct 2023
Toward Operationalizing Pipeline-aware ML Fairness: A Research Agenda for Developing Practical Guidelines and Tools
Conference on Equity and Access in Algorithms, Mechanisms, and Optimization (EAAMO), 2023
Maximilian Schambach
Rakshit Naidu
Rayid Ghani
Kit T. Rodolfa
Daniel E. Ho
Hoda Heidari
FaML
237
21
0
29 Sep 2023
1
2
3
Next