v1v2 (latest)

Preference Leakage: A Contamination Problem in LLM-as-a-judge

3 February 2025

ArXiv (abs)PDF HTML HuggingFace (41 upvotes)

Papers citing "Preference Leakage: A Contamination Problem in LLM-as-a-judge"

50 / 117 papers shown

BPO: Towards Balanced Preference Optimization between Knowledge Breadth and Depth in AlignmentNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

484

21 Feb 2025

CLIPPER: Compression enables long-context synthetic data generation

443

20 Feb 2025

Who Taught You That? Tracing Teachers in Model DistillationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

577

10 Feb 2025

Adversarial ML Problems Are Getting Harder to Solve and to Evaluate

368

04 Feb 2025

Quantification of Large Language Model DistillationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

...

Hamid Alinejad-Rokny

310

22 Jan 2025

Assessing the Impact of Conspiracy Theories Using Large Language Models

430

09 Dec 2024

From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge

...

1.2K

311

25 Nov 2024

ShifCon: Enhancing Non-Dominant Language Capabilities with a Shift-based Contrastive Framework

629

25 Oct 2024

Agent-as-a-Judge: Evaluate Agents with Agents

Wenyi Wang

...

Raghuraman Krishnamoorthi

411

106

14 Oct 2024

Justice or Prejudice? Quantifying Biases in LLM-as-a-JudgeInternational Conference on Learning Representations (ICLR), 2024

Jiayi Ye

Zixiang Xu

Yue Huang

Dongping Chen

...

Xiangliang Zhang

368

207

03 Oct 2024

Law of the Weakest Link: Cross Capabilities of Large Language Models

Ming Zhong

...

Dhruv Mahajan

Jiawei Han

Laurens van der Maaten

ELM

182

30 Sep 2024

Exploring Large Language Models for Feature Selection: A Data-centric PerspectiveSIGKDD Explorations (SIGKDD Explor.), 2024

248

21 Aug 2024

Fostering Natural Conversation in Large Language Models with NICO: a Natural Interactive COnversation dataset

Jiaxing Zhang

249

18 Aug 2024

DataGen: Unified Synthetic Dataset Generation via Large Language ModelsIEEE International Joint Conference on Neural Network (IJCNN), 2025

...

613

27 Jun 2024

LiveBench: A Challenging, Contamination-Limited LLM Benchmark

Manley Roberts

...

Tom Goldstein

Willie Neiswanger

Micah Goldblum

ELM

389

27 Jun 2024

Unveiling the Spectrum of Data Contamination in Language Models: A Survey from Detection to Remediation

Xiangru Tang

Arman Cohan

270

20 Jun 2024

Uncovering Latent Memories: Assessing Data Leakage and Memorization Patterns in Frontier AI Models

Ila R Fiete

414

20 Jun 2024

Data Contamination Can Cross Language BarriersConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Feng Yao

209

19 Jun 2024

Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges

Aman Singh Thakur

Kartik Choudhary

Venkat Srinik Ramayapally

Sankaran Vaidyanathan

Dieuwke Hupkes

ELM ALM

850

140

18 Jun 2024

From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline

Joseph E. Gonzalez

Ion Stoica

ALM

351

331

17 Jun 2024

Measuring memorization in RLHF for code completion

Aneesh Pappu

Billy Porter

Ilia Shumailov

Jamie Hayes

338

17 Jun 2024

Benchmark Data Contamination of Large Language Models: A Survey

287

06 Jun 2024

MixEval: Deriving Wisdom of the Crowd from LLM Benchmark Mixtures

Graham Neubig

Yang You

ELM

211

03 Jun 2024

DALK: Dynamic Co-Augmentation of LLMs and KG to answer Alzheimer's Disease Questions with Scientific Literature

...

Huan Liu

339

08 May 2024

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Graham Neubig

389

331

02 May 2024

Balancing Speciality and Versatility: a Coarse to Fine Framework for Supervised Fine-tuning Large Language Model

487

16 Apr 2024

LLM Evaluators Recognize and Favor Their Own Generations

Arjun Panickssery

Samuel R. Bowman

Shi Feng

443

366

15 Apr 2024

Length-Controlled AlpacaEval: A Simple Way to Debias Automatic Evaluators

464

617

06 Apr 2024

Can LLMs Learn from Previous Mistakes? Investigating LLMs' Errors to Boost for Reasoning

415

29 Mar 2024

Optimization-based Prompt Injection Attack to LLM-as-a-Judge

550

121

26 Mar 2024

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

686

1,212

20 Mar 2024

Elephants Never Forget: Testing Language Models for Memorization of Tabular Data

233

11 Mar 2024

Generalization or Memorization: Data Contamination and Trustworthy Evaluation for Large Language Models

390

123

24 Feb 2024

Pride and Prejudice: LLM Amplifies Self-Bias in Self-Refinement

Lei Li

298

18 Feb 2024

Humans or LLMs as the Judge? A Study on Judgement Biases

568

214

16 Feb 2024

Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

441

15 Feb 2024

Self-Alignment for Factuality: Mitigating Hallucinations in LLMs via Self-Evaluation

Linfeng Song

297

14 Feb 2024

Leak, Cheat, Repeat: Data Contamination and Evaluation Malpractices in Closed-Source LLMsConference of the European Chapter of the Association for Computational Linguistics (EACL), 2024

452

259

06 Feb 2024

Contextualization Distillation from Large Language Model for Knowledge Graph CompletionFindings (Findings), 2024

Huan Liu

387

28 Jan 2024

Investigating Data Contamination for Pre-training Language Models

321

11 Jan 2024

LLM-as-a-Coauthor: Can Mixed Human-Written and Machine-Generated Text Be Detected?

...

310

11 Jan 2024

Task Contamination: Language Models May Not Be Few-Shot Anymore

Changmao Li

Jeffrey Flanigan

378

130

26 Dec 2023

What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning

430

325

25 Dec 2023

AlignBench: Benchmarking Chinese Alignment of Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Xiao Liu

...

Yuxiao Dong

381

30 Nov 2023

Investigating Data Contamination in Modern Benchmarks for Large Language Models

Arman Cohan

391

113

16 Nov 2023

LLMs as Narcissistic Evaluators: When Ego Inflates Evaluation Scores

441

16 Nov 2023

Ziya2: Data-centric Learning is All LLMs Need

...

301

06 Nov 2023

Prometheus: Inducing Fine-grained Evaluation Capability in Language ModelsInternational Conference on Learning Representations (ICLR), 2023

...

531

375

12 Oct 2023

Mistral 7B

Albert Q. Jiang

Alexandre Sablayrolles

A. Mensch

Chris Bamford

Devendra Singh Chaplot

...

394

3,000

10 Oct 2023

MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to UseInternational Conference on Learning Representations (ICLR), 2023

...

Lichao Sun

533

151

04 Oct 2023