v1v2 (latest)

Pride and Prejudice: LLM Amplifies Self-Bias in Self-Refinement

18 February 2024

Lei Li

ArXiv (abs)PDF HTML Github (8★)

Papers citing "Pride and Prejudice: LLM Amplifies Self-Bias in Self-Refinement"

12 / 62 papers shown

Self-Correction is More than Refinement: A Learning Framework for Visual and Language Reasoning TasksAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

594

05 Oct 2024

Learning Code Preference via Synthetic Evolution

221

04 Oct 2024

Justice or Prejudice? Quantifying Biases in LLM-as-a-JudgeInternational Conference on Learning Representations (ICLR), 2024

Jiayi Ye

Zixiang Xu

Yue Huang

Dongping Chen

...

Xiangliang Zhang

368

207

03 Oct 2024

Depression Diagnosis Dialogue Simulation: Self-improving Psychiatrist with Tertiary Memory

Bingrui Jin

Mengyue Wu

288

20 Sep 2024

PingPong: A Benchmark for Role-Playing Language Models with User Emulation and Multi-Model Evaluation

Ilya Gusev

LLMAG

518

10 Sep 2024

From Calculation to Adjudication: Examining LLM judges on Mathematical Reasoning Tasks

418

06 Sep 2024

Internal Consistency and Self-Feedback in Large Language Models: A Survey

...

502

19 Jul 2024

Can Automatic Metrics Assess High-Quality Translations?

Sweta Agrawal

António Farinhas

Ricardo Rei

André F. T. Martins

174

28 May 2024

Advancing LLM Reasoning Generalists with Preference Trees

...

Zhiyuan Liu

Maosong Sun

LRM

333

179

02 Apr 2024

Dialectical Alignment: Resolving the Tension of 3H and Security Threats of LLMs

Lijie Hu

290

30 Mar 2024

MONAL: Model Autophagy Analysis for Modeling Human-AI Interactions

Lijie Hu

251

17 Feb 2024

Feedback Loops With Language Models Drive In-Context Reward HackingInternational Conference on Machine Learning (ICML), 2024

402

09 Feb 2024