Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2304.03738
Cited By
v1
v2
v3 (latest)
Should ChatGPT be Biased? Challenges and Risks of Bias in Large Language Models
First Monday (FM), 2023
7 April 2023
Emilio Ferrara
SILM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Should ChatGPT be Biased? Challenges and Risks of Bias in Large Language Models"
50 / 119 papers shown
Title
WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models
Neural Information Processing Systems (NeurIPS), 2024
Peng Wang
Zexi Li
Ningyu Zhang
Ziwen Xu
Yunzhi Yao
Yong Jiang
Pengjun Xie
Fei Huang
Huajun Chen
KELM
CLL
274
57
0
23 May 2024
Sociotechnical Implications of Generative Artificial Intelligence for Information Access
Bhaskar Mitra
Henriette Cramer
Olya Gurevich
282
7
0
19 May 2024
Binary Hypothesis Testing for Softmax Models and Leverage Score Models
Yeqi Gao
Yuzhou Gu
Zhao Song
372
1
0
09 May 2024
Influence of Solution Efficiency and Valence of Instruction on Additive and Subtractive Solution Strategies in Humans and GPT-4
Lydia Uhler
Verena Jordan
Jürgen Buder
Markus Huff
F. Papenmeier
186
0
0
25 Apr 2024
A Comprehensive Survey on Evaluating Large Language Model Applications in the Medical Industry
Yining Huang
Keke Tang
Meilian Chen
Boyuan Wang
ELM
LM&MA
357
28
0
24 Apr 2024
Beyond Human Norms: Unveiling Unique Values of Large Language Models through Interdisciplinary Approaches
Pablo Biedma
Xiaoyuan Yi
Linus Huang
Maosong Sun
Xing Xie
PILM
295
9
0
19 Apr 2024
RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs
Shreyas Chaudhari
Pranjal Aggarwal
Vishvak Murahari
Tanmay Rajpurohit
Ashwin Kalyan
Karthik Narasimhan
Ameet Deshpande
Bruno Castro da Silva
342
83
0
12 Apr 2024
Will the Real Linda Please Stand up...to Large Language Models? Examining the Representativeness Heuristic in LLMs
Pengda Wang
Zilin Xiao
Hanjie Chen
Frederick L. Oswald
232
14
0
01 Apr 2024
A Survey on Multilingual Large Language Models: Corpora, Alignment, and Bias
Yuemei Xu
Ling Hu
Jiayi Zhao
Zihan Qiu
Yuqi Ye
Hanwen Gu
LRM
394
89
0
01 Apr 2024
Fairness in Large Language Models: A Taxonomic Survey
Zhibo Chu
Sribala Vidyadhari Chinta
Wenbin Zhang
AILaw
253
86
0
31 Mar 2024
Survey on Large Language Model-Enhanced Reinforcement Learning: Concept, Taxonomy, and Methods
Yuji Cao
Huan Zhao
Yuheng Cheng
Ting Shu
Guolong Liu
Gaoqi Liang
Junhua Zhao
Yun Li
LLMAG
KELM
OffRL
LM&Ro
352
146
0
30 Mar 2024
The Strong Pull of Prior Knowledge in Large Language Models and Its Impact on Emotion Recognition
Georgios Chochlakis
Alexandros Potamianos
Kristina Lerman
Shrikanth Narayanan
159
7
0
25 Mar 2024
Locating and Mitigating Gender Bias in Large Language Models
Yuchen Cai
Ding Cao
Rongxi Guo
Yaqin Wen
Guiquan Liu
Enhong Chen
160
9
0
21 Mar 2024
Humanoid Robots and Humanoid AI: Review, Perspectives and Directions
ACM Computing Surveys (ACM CSUR), 2024
Longbing Cao
306
8
0
19 Mar 2024
Evaluating LLMs for Gender Disparities in Notable Persons
L. Rhue
Sofie Goethals
Arun Sundararajan
159
5
0
14 Mar 2024
Few-Shot Fairness: Unveiling LLM's Potential for Fairness-Aware Classification
Garima Chhikara
Anurag Sharma
Kripabandhu Ghosh
Abhijnan Chakraborty
228
21
0
28 Feb 2024
Aligning Large Language Models to a Domain-specific Graph Database
Yuanyuan Liang
Keren Tan
Tingyu Xie
Wenbiao Tao
Siyuan Wang
Yunshi Lan
Weining Qian
187
17
0
26 Feb 2024
Exploring ChatGPT and its Impact on Society
Md. Asraful Haque
Shuai Li
SILM
248
49
0
21 Feb 2024
Exploring the Impact of AI Value Alignment in Collaborative Ideation: Effects on Perception, Ownership, and Output
Alicia Guo
Pat Pataranutaporn
Pattie Maes
198
24
0
20 Feb 2024
I Am Not Them: Fluid Identities and Persistent Out-group Bias in Large Language Models
Wenchao Dong
Assem Zhunis
Hyojin Chin
Jiyoung Han
Meeyoung Cha
181
3
0
16 Feb 2024
Advancing Legal Reasoning: The Integration of AI to Navigate Complexities and Biases in Global Jurisprudence with Semi-Automated Arbitration Processes (SAAPs)
Michael De'Shazer
113
1
0
06 Feb 2024
Behind the Screen: Investigating ChatGPT's Dark Personality Traits and Conspiracy Beliefs
Erik Weber
Jérôme Rutinowski
Markus Pauly
90
3
0
06 Feb 2024
Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems
Tianyu Cui
Yanling Wang
Chuanpu Fu
Yong Xiao
Sijia Li
...
Junwu Xiong
Xinyu Kong
ZuJie Wen
Ke Xu
Qi Li
272
97
0
11 Jan 2024
Understanding LLMs: A Comprehensive Overview from Training to Inference
Yi-Hsueh Liu
Haoyang He
Tianle Han
Xu-Yao Zhang
Mengyuan Liu
...
Xiaoyan Cai
Tuo Zhang
Ning Qiang
Tianming Liu
Bao Ge
SyDa
430
120
0
04 Jan 2024
A Novel Evaluation Framework for Assessing Resilience Against Prompt Injection Attacks in Large Language Models
Daniel Wankit Yip
Aysan Esmradi
C. Chan
AAML
178
20
0
02 Jan 2024
From Bytes to Biases: Investigating the Cultural Self-Perception of Large Language Models
Wolfgang Messner
Tatum Greene
Josephine Matalone
178
8
0
21 Dec 2023
Quantifying Bias in Text-to-Image Generative Models
Jordan Vice
Naveed Akhtar
Leonid Sigal
Lin Wang
210
18
0
20 Dec 2023
Cultural Bias and Cultural Alignment of Large Language Models
PNAS Nexus (PNAS Nexus), 2023
Yan Tao
Olga Viberg
Ryan S. Baker
René F. Kizilcec
ELM
361
195
0
23 Nov 2023
Sequencing Matters: A Generate-Retrieve-Generate Model for Building Conversational Agents
Quinn Patwardhan
Grace Hui Yang
125
3
0
16 Nov 2023
Health Disparities through Generative AI Models: A Comparison Study Using A Domain Specific large language model
Future Technologies Conference (FT), 2023
Yohn Jairo Parra Bautista
Vinicious Lima
Carlos Theran
Richard A. Aló
LM&MA
77
5
0
23 Oct 2023
Automated Claim Matching with Large Language Models: Empowering Fact-Checkers in the Fight Against Misinformation
Eun Cheol Choi
Emilio Ferrara
185
29
0
13 Oct 2023
Factuality Challenges in the Era of Large Language Models
Isabelle Augenstein
Timothy Baldwin
Meeyoung Cha
Tanmoy Chakraborty
Giovanni Luca Ciampaglia
...
Rubén Míguez
Preslav Nakov
Dietram A. Scheufele
Shivam Sharma
Giovanni Zagni
HILM
321
53
0
08 Oct 2023
A Formalism and Approach for Improving Robustness of Large Language Models Using Risk-Adjusted Confidence Scores
Ke Shen
Mayank Kejriwal
187
2
0
05 Oct 2023
MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use
International Conference on Learning Representations (ICLR), 2023
Yue Huang
Jiawen Shi
Yuan Li
Chenrui Fan
Siyuan Wu
...
Yixin Liu
Pan Zhou
Yao Wan
Neil Zhenqiang Gong
Lichao Sun
LLMAG
462
144
0
04 Oct 2023
DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models
International Conference on Learning Representations (ICLR), 2023
Yongchan Kwon
Eric Wu
K. Wu
James Zou
DiffM
TDI
343
91
0
02 Oct 2023
GenAI Against Humanity: Nefarious Applications of Generative Artificial Intelligence and Large Language Models
Journal of Computational Social Science (JCSS), 2023
Emilio Ferrara
326
161
0
01 Oct 2023
Open-Sourcing Highly Capable Foundation Models: An evaluation of risks, benefits, and alternative methods for pursuing open-source objectives
Social Science Research Network (SSRN), 2023
Elizabeth Seger
Noemi Dreksler
Richard Moulange
Emily Dardaman
Jonas Schuett
...
Emma Bluemke
Michael Aird
Patrick Levermore
Julian Hazell
Abhishek Gupta
166
55
0
29 Sep 2023
ChatGPT Performance on Standardized Testing Exam -- A Proposed Strategy for Learners
Umer Farooq
S. Anwar
29
5
0
25 Sep 2023
People's Perceptions Toward Bias and Related Concepts in Large Language Models: A Systematic Review
Lu Wang
Max Song
R. Rezapour
Bum Chul Kwon
Jina Huh-Yoo
AI4CE
259
5
0
25 Sep 2023
Public Perceptions of Gender Bias in Large Language Models: Cases of ChatGPT and Ernie
Kyrie Zhixuan Zhou
M. Sanfilippo
102
16
0
17 Sep 2023
Multimodal Multi-Hop Question Answering Through a Conversation Between Tools and Efficiently Finetuned Large Language Models
Hossein Rajabzadeh
Suyuchen Wang
Hyock Ju Kwon
Bang Liu
KELM
152
6
0
16 Sep 2023
Bias and Fairness in Chatbots: An Overview
APSIPA Transactions on Signal and Information Processing (TASIP), 2023
Jintang Xue
Yun Cheng Wang
Chengwei Wei
Xiaofeng Liu
Jonghye Woo
C.-C. Jay Kuo
283
54
0
16 Sep 2023
A Fast Optimization View: Reformulating Single Layer Attention in LLM Based on Tensor and SVM Trick, and Solving It in Matrix Multiplication Time
Yeqi Gao
Zhao Song
Weixin Wang
Junze Yin
258
30
0
14 Sep 2023
Generative AI
Business & Information Systems Engineering (BISE), 2023
Stefan Feuerriegel
Jochen Hartmann
Christian Janiesch
Patrick Zschech
307
998
0
13 Sep 2023
The Moral Machine Experiment on Large Language Models
Royal Society Open Science (RSOS), 2023
Kazuhiro Takemoto
112
38
0
12 Sep 2023
FuzzLLM: A Novel and Universal Fuzzing Framework for Proactively Discovering Jailbreak Vulnerabilities in Large Language Models
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Dongyu Yao
Jianshu Zhang
Ian G. Harris
Marcel Carlsson
229
57
0
11 Sep 2023
A Critical Examination of the Ethics of AI-Mediated Peer Review
L. Schintler
C. McNeely
James Witte
112
12
0
02 Sep 2023
Bias and Fairness in Large Language Models: A Survey
Computational Linguistics (CL), 2023
Isabel O. Gallegos
Ryan Rossi
Joe Barrow
Md Mehrab Tanjim
Sungchul Kim
Franck Dernoncourt
Tong Yu
Ruiyi Zhang
Nesreen Ahmed
AILaw
374
869
0
02 Sep 2023
Ethical Framework for Harnessing the Power of AI in Healthcare and Beyond
IEEE Access (IEEE Access), 2023
Sidra Nasir
Rizwan Ahmed Khan
Samita Bai
176
49
0
31 Aug 2023
Do-Not-Answer: A Dataset for Evaluating Safeguards in LLMs
Yuxia Wang
Jinyan Su
Xudong Han
Preslav Nakov
Timothy Baldwin
296
146
0
25 Aug 2023
Previous
1
2
3
Next