v1v2v3 (latest)

Assistant-Guided Mitigation of Teacher Preference Bias in LLM-as-a-Judge

25 May 2025

Papers citing "Assistant-Guided Mitigation of Teacher Preference Bias in LLM-as-a-Judge"

19 / 19 papers shown

Do LLM Evaluators Prefer Themselves for a Reason?

359

04 Apr 2025

PRD: Peer Rank and Discussion Improve Large Language Model based Evaluations

556

126

03 Jan 2025

Justice or Prejudice? Quantifying Biases in LLM-as-a-JudgeInternational Conference on Learning Representations (ICLR), 2024

Jiayi Ye

Zixiang Xu

Yue Huang

Dongping Chen

...

Xiangliang Zhang

368

207

03 Oct 2024

Aligning Human and LLM Judgments: Insights from EvalAssist on Task-Specific Evaluations and AI-assisted Assessment Strategy PreferencesACM Symposium on User Interface Software and Technology (UIST), 2024

Martin Santillan Cooper

287

01 Oct 2024

Beyond Scalar Reward Model: Learning Generative Judge from Preference Data

Wei Shen

Dong Yan

Yiqun Liu

281

01 Oct 2024

Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge

Jason Weston

374

156

28 Jul 2024

LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks

...

616

186

26 Jun 2024

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Graham Neubig

389

331

02 May 2024

LLM Evaluators Recognize and Favor Their Own Generations

Arjun Panickssery

Samuel R. Bowman

Shi Feng

446

366

15 Apr 2024

Length-Controlled AlpacaEval: A Simple Way to Debias Automatic Evaluators

465

617

06 Apr 2024

Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference

Wei-Lin Chiang

Lianmin Zheng

Ying Sheng

Anastasios Nikolas Angelopoulos

Tianle Li

...

Hao Zhang

431

992

07 Mar 2024

Humans or LLMs as the Judge? A Study on Judgement Biases

568

214

16 Feb 2024

JudgeLM: Fine-tuned Large Language Models are Scalable JudgesInternational Conference on Learning Representations (ICLR), 2023

473

258

26 Oct 2023

Prometheus: Inducing Fine-grained Evaluation Capability in Language ModelsInternational Conference on Learning Representations (ICLR), 2023

...

534

375

12 Oct 2023

Judging LLM-as-a-Judge with MT-Bench and Chatbot ArenaNeural Information Processing Systems (NeurIPS), 2023

...

3.2K

6,725

09 Jun 2023

PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning OptimizationInternational Conference on Learning Representations (ICLR), 2023

...

Yue Zhang

479

332

08 Jun 2023

Benchmarking Foundation Models with Language-Model-as-an-ExaminerNeural Information Processing Systems (NeurIPS), 2023

Yuze He

...

Yijia Xiao

Haozhe Lyu

Jiayin Zhang

Juanzi Li

Lei Hou

ALM ELM

293

201

07 Jun 2023

Direct Preference Optimization: Your Language Model is Secretly a Reward ModelNeural Information Processing Systems (NeurIPS), 2023

Christopher D. Manning

Chelsea Finn

ALM

953

6,888

29 May 2023

G-Eval: NLG Evaluation using GPT-4 with Better Human AlignmentConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Yang Liu

Shuohang Wang

608

1,873

29 Mar 2023