ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2306.03950
  4. Cited By
MISGENDERED: Limits of Large Language Models in Understanding Pronouns

MISGENDERED: Limits of Large Language Models in Understanding Pronouns

6 June 2023
Tamanna Hossain
Sunipa Dev
Sameer Singh
    AILaw
ArXivPDFHTML

Papers citing "MISGENDERED: Limits of Large Language Models in Understanding Pronouns"

28 / 28 papers shown
Title
Agree to Disagree? A Meta-Evaluation of LLM Misgendering
Agree to Disagree? A Meta-Evaluation of LLM Misgendering
Arjun Subramonian
Vagrant Gautam
Preethi Seshadri
Dietrich Klakow
Kai-Wei Chang
Yizhou Sun
27
1
0
23 Apr 2025
An LLM-as-a-judge Approach for Scalable Gender-Neutral Translation Evaluation
An LLM-as-a-judge Approach for Scalable Gender-Neutral Translation Evaluation
Andrea Piergentili
Beatrice Savoldi
Matteo Negri
L. Bentivogli
ELM
35
0
0
16 Apr 2025
Assumed Identities: Quantifying Gender Bias in Machine Translation of Ambiguous Occupational Terms
Orfeas Menis-Mastromichalakis
Giorgos Filandrianos
M. Symeonaki
Giorgos Stamou
55
0
0
06 Mar 2025
Robust Bias Detection in MLMs and its Application to Human Trait Ratings
Robust Bias Detection in MLMs and its Application to Human Trait Ratings
Ingroj Shrestha
Louis Tay
Padmini Srinivasan
73
0
0
24 Feb 2025
Gender-Neutral Large Language Models for Medical Applications: Reducing Bias in PubMed Abstracts
Gender-Neutral Large Language Models for Medical Applications: Reducing Bias in PubMed Abstracts
Elizabeth Schaefer
Kirk Roberts
36
0
0
10 Jan 2025
Solving the Challenge Set without Solving the Task: On Winograd Schemas
  as a Test of Pronominal Coreference Resolution
Solving the Challenge Set without Solving the Task: On Winograd Schemas as a Test of Pronominal Coreference Resolution
Ian Porada
Jackie C.K. Cheung
27
0
0
12 Oct 2024
The Lou Dataset -- Exploring the Impact of Gender-Fair Language in
  German Text Classification
The Lou Dataset -- Exploring the Impact of Gender-Fair Language in German Text Classification
Andreas Waldis
Joel Birrer
Anne Lauscher
Iryna Gurevych
25
1
0
26 Sep 2024
WinoPron: Revisiting English Winogender Schemas for Consistency,
  Coverage, and Grammatical Case
WinoPron: Revisiting English Winogender Schemas for Consistency, Coverage, and Grammatical Case
Vagrant Gautam
Julius Steuer
Eileen Bingert
Ray Johns
Anne Lauscher
Dietrich Klakow
48
3
0
09 Sep 2024
Probing Causality Manipulation of Large Language Models
Probing Causality Manipulation of Large Language Models
Chenyang Zhang
Haibo Tong
Bin Zhang
Dongyu Zhang
LRM
31
0
0
26 Aug 2024
How Well Do LLMs Identify Cultural Unity in Diversity?
How Well Do LLMs Identify Cultural Unity in Diversity?
Jialin Li
Junli Wang
Junjie Hu
Ming Jiang
35
4
0
09 Aug 2024
Exploring LGBTQ+ Bias in Generative AI Answers across Different Country
  and Religious Contexts
Exploring LGBTQ+ Bias in Generative AI Answers across Different Country and Religious Contexts
L. Vicsek
Anna Vancsó
Mike Zajko
Judit Takacs
29
0
0
03 Jul 2024
From Descriptive Richness to Bias: Unveiling the Dark Side of Generative
  Image Caption Enrichment
From Descriptive Richness to Bias: Unveiling the Dark Side of Generative Image Caption Enrichment
Yusuke Hirota
Ryo Hachiuma
Chao-Han Huck Yang
Yuta Nakashima
VLM
33
3
0
20 Jun 2024
QueerBench: Quantifying Discrimination in Language Models Toward Queer
  Identities
QueerBench: Quantifying Discrimination in Language Models Toward Queer Identities
Mae Sosto
Alberto Barrón-Cedeño
30
3
0
18 Jun 2024
Do Large Language Models Discriminate in Hiring Decisions on the Basis
  of Race, Ethnicity, and Gender?
Do Large Language Models Discriminate in Hiring Decisions on the Basis of Race, Ethnicity, and Gender?
Haozhe An
Christabel Acquaye
Colin Wang
Zongxia Li
Rachel Rudinger
36
12
0
15 Jun 2024
Learning from Natural Language Explanations for Generalizable Entity
  Matching
Learning from Natural Language Explanations for Generalizable Entity Matching
Somin Wadhwa
Adit Krishnan
Runhui Wang
Byron C. Wallace
Chris Kong
LRM
37
3
0
13 Jun 2024
AI Agents Under Threat: A Survey of Key Security Challenges and Future
  Pathways
AI Agents Under Threat: A Survey of Key Security Challenges and Future Pathways
Zehang Deng
Yongjian Guo
Changzhou Han
Wanlun Ma
Junwu Xiong
Sheng Wen
Yang Xiang
42
23
0
04 Jun 2024
Enhancing Gender-Inclusive Machine Translation with Neomorphemes and
  Large Language Models
Enhancing Gender-Inclusive Machine Translation with Neomorphemes and Large Language Models
Andrea Piergentili
Beatrice Savoldi
Matteo Negri
L. Bentivogli
31
5
0
14 May 2024
Transforming Dutch: Debiasing Dutch Coreference Resolution Systems for
  Non-binary Pronouns
Transforming Dutch: Debiasing Dutch Coreference Resolution Systems for Non-binary Pronouns
Goya van Boven
Yupei Du
Dong Nguyen
21
1
0
30 Apr 2024
MisgenderMender: A Community-Informed Approach to Interventions for
  Misgendering
MisgenderMender: A Community-Informed Approach to Interventions for Misgendering
Tamanna Hossain
Sunipa Dev
Sameer Singh
21
5
0
23 Apr 2024
Robust Pronoun Fidelity with English LLMs: Are they Reasoning,
  Repeating, or Just Biased?
Robust Pronoun Fidelity with English LLMs: Are they Reasoning, Repeating, or Just Biased?
Vagrant Gautam
Eileen Bingert
D. Zhu
Anne Lauscher
Dietrich Klakow
43
8
0
04 Apr 2024
Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language
  Model Systems
Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems
Tianyu Cui
Yanling Wang
Chuanpu Fu
Yong Xiao
Sijia Li
...
Junwu Xiong
Xinyu Kong
Zujie Wen
Ke Xu
Qi Li
55
56
0
11 Jan 2024
Tokenization Matters: Navigating Data-Scarce Tokenization for Gender
  Inclusive Language Technologies
Tokenization Matters: Navigating Data-Scarce Tokenization for Gender Inclusive Language Technologies
Anaelia Ovalle
Ninareh Mehrabi
Palash Goyal
Jwala Dhamala
Kai-Wei Chang
Richard Zemel
Aram Galstyan
Yuval Pinter
Rahul Gupta
14
10
0
19 Dec 2023
Survey on Factuality in Large Language Models: Knowledge, Retrieval and
  Domain-Specificity
Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity
Cunxiang Wang
Xiaoze Liu
Yuanhao Yue
Xiangru Tang
Tianhang Zhang
...
Linyi Yang
Jindong Wang
Xing Xie
Zheng-Wei Zhang
Yue Zhang
HILM
KELM
51
182
0
11 Oct 2023
VisoGender: A dataset for benchmarking gender bias in image-text pronoun
  resolution
VisoGender: A dataset for benchmarking gender bias in image-text pronoun resolution
S. Hall
F. G. Abrantes
Hanwen Zhu
Grace A. Sodunke
Aleksandar Shtedritski
Hannah Rose Kirk
CoGe
11
38
0
21 Jun 2023
Challenges to Evaluating the Generalization of Coreference Resolution
  Models: A Measurement Modeling Perspective
Challenges to Evaluating the Generalization of Coreference Resolution Models: A Measurement Modeling Perspective
Ian Porada
Alexandra Olteanu
Kaheer Suleman
Adam Trischler
Jackie C.K. Cheung
27
6
0
16 Mar 2023
Quantifying Social Biases Using Templates is Unreliable
Quantifying Social Biases Using Templates is Unreliable
P. Seshadri
Pouya Pezeshkpour
Sameer Singh
51
33
0
09 Oct 2022
Fantastically Ordered Prompts and Where to Find Them: Overcoming
  Few-Shot Prompt Order Sensitivity
Fantastically Ordered Prompts and Where to Find Them: Overcoming Few-Shot Prompt Order Sensitivity
Yao Lu
Max Bartolo
Alastair Moore
Sebastian Riedel
Pontus Stenetorp
AILaw
LRM
277
1,117
0
18 Apr 2021
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
Leo Gao
Stella Biderman
Sid Black
Laurence Golding
Travis Hoppe
...
Horace He
Anish Thite
Noa Nabeshima
Shawn Presser
Connor Leahy
AIMat
248
1,986
0
31 Dec 2020
1