Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.07492
Cited By
SocialStigmaQA: A Benchmark to Uncover Stigma Amplification in Generative Language Models
12 December 2023
Manish Nagireddy
Lamogha Chiazor
Moninder Singh
Ioana Baldini
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SocialStigmaQA: A Benchmark to Uncover Stigma Amplification in Generative Language Models"
4 / 4 papers shown
Title
Protecting Users From Themselves: Safeguarding Contextual Privacy in Interactions with Conversational Agents
Ivoline Ngong
Swanand Kadhe
Hao Wang
K. Murugesan
Justin D. Weisz
Amit Dhurandhar
K. Ramamurthy
44
2
0
22 Feb 2025
Programming Refusal with Conditional Activation Steering
Bruce W. Lee
Inkit Padhi
K. Ramamurthy
Erik Miehling
Pierre L. Dognin
Manish Nagireddy
Amit Dhurandhar
LLMSV
91
13
0
06 Sep 2024
Measure and Improve Robustness in NLP Models: A Survey
Xuezhi Wang
Haohan Wang
Diyi Yang
139
130
0
15 Dec 2021
BBQ: A Hand-Built Bias Benchmark for Question Answering
Alicia Parrish
Angelica Chen
Nikita Nangia
Vishakh Padmakumar
Jason Phang
Jana Thompson
Phu Mon Htut
Sam Bowman
212
367
0
15 Oct 2021
1