ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.11436
  4. Cited By
Pride and Prejudice: LLM Amplifies Self-Bias in Self-Refinement
v1v2 (latest)

Pride and Prejudice: LLM Amplifies Self-Bias in Self-Refinement

18 February 2024
Wenda Xu
Guanglei Zhu
Xuandong Zhao
Liangming Pan
Lei Li
Wenjie Wang
ArXiv (abs)PDFHTMLGithub (8★)

Papers citing "Pride and Prejudice: LLM Amplifies Self-Bias in Self-Refinement"

12 / 62 papers shown
Self-Correction is More than Refinement: A Learning Framework for Visual and Language Reasoning Tasks
Self-Correction is More than Refinement: A Learning Framework for Visual and Language Reasoning TasksAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Jiayi He
Hehai Lin
Q. Wang
Yi R. Fung
Chenhui Xu
ReLMLRM
594
27
0
05 Oct 2024
Learning Code Preference via Synthetic Evolution
Learning Code Preference via Synthetic Evolution
Jiawei Liu
Thanh Nguyen
Mingyue Shang
Hantian Ding
Xiaopeng Li
Yu Yu
Varun Kumar
Zijian Wang
SyDaALMAAML
221
18
0
04 Oct 2024
Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge
Justice or Prejudice? Quantifying Biases in LLM-as-a-JudgeInternational Conference on Learning Representations (ICLR), 2024
Jiayi Ye
Zixiang Xu
Yue Huang
Dongping Chen
Qihui Zhang
...
Werner Geyer
Chao Huang
Pin-Yu Chen
Nitesh Chawla
Xiangliang Zhang
ELM
368
207
0
03 Oct 2024
Depression Diagnosis Dialogue Simulation: Self-improving Psychiatrist
  with Tertiary Memory
Depression Diagnosis Dialogue Simulation: Self-improving Psychiatrist with Tertiary Memory
Kunyao Lan
Bingrui Jin
Zichen Zhu
Siyuan Chen
Shu Zhang
Kenny Q. Zhu
Mengyue Wu
288
8
0
20 Sep 2024
PingPong: A Benchmark for Role-Playing Language Models with User Emulation and Multi-Model Evaluation
PingPong: A Benchmark for Role-Playing Language Models with User Emulation and Multi-Model Evaluation
Ilya Gusev
LLMAG
518
5
0
10 Sep 2024
From Calculation to Adjudication: Examining LLM judges on Mathematical Reasoning Tasks
From Calculation to Adjudication: Examining LLM judges on Mathematical Reasoning Tasks
Andreas Stephan
D. Zhu
Matthias Aßenmacher
Xiaoyu Shen
Benjamin Roth
ELM
418
15
0
06 Sep 2024
Internal Consistency and Self-Feedback in Large Language Models: A
  Survey
Internal Consistency and Self-Feedback in Large Language Models: A Survey
Xun Liang
Chenyang Xi
Zifan Zheng
Ding Chen
Qingchen Yu
...
Rong-Hua Li
Peng Cheng
Zhonghao Wang
Feiyu Xiong
Zhiyu Li
HILMLRM
502
46
0
19 Jul 2024
Can Automatic Metrics Assess High-Quality Translations?
Can Automatic Metrics Assess High-Quality Translations?
Sweta Agrawal
António Farinhas
Ricardo Rei
André F. T. Martins
174
16
0
28 May 2024
Advancing LLM Reasoning Generalists with Preference Trees
Advancing LLM Reasoning Generalists with Preference Trees
Lifan Yuan
Ganqu Cui
Hanbin Wang
Ning Ding
Xingyao Wang
...
Zhenghao Liu
Bowen Zhou
Yuan Yao
Zhiyuan Liu
Maosong Sun
LRM
333
179
0
02 Apr 2024
Dialectical Alignment: Resolving the Tension of 3H and Security Threats
  of LLMs
Dialectical Alignment: Resolving the Tension of 3H and Security Threats of LLMs
Shu Yang
Jiayuan Su
Han Jiang
Mengdi Li
Keyuan Cheng
Muhammad Asif Ali
Lijie Hu
Haiyan Zhao
290
8
0
30 Mar 2024
MONAL: Model Autophagy Analysis for Modeling Human-AI Interactions
MONAL: Model Autophagy Analysis for Modeling Human-AI Interactions
Shu Yang
Muhammad Asif Ali
Lu Yu
Lijie Hu
Haiyan Zhao
LLMAG
251
5
0
17 Feb 2024
Feedback Loops With Language Models Drive In-Context Reward Hacking
Feedback Loops With Language Models Drive In-Context Reward HackingInternational Conference on Machine Learning (ICML), 2024
Alexander Pan
Erik Jones
Meena Jagadeesan
Jacob Steinhardt
KELM
402
56
0
09 Feb 2024
Previous
12
Page 2 of 2