ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.09247
  4. Cited By
Benchmark Inflation: Revealing LLM Performance Gaps Using Retro-Holdouts

Benchmark Inflation: Revealing LLM Performance Gaps Using Retro-Holdouts

11 October 2024
Jacob Haimes
Cenny Wenner
Kunvar Thaman
Vassil Tashev
Clement Neo
Esben Kran
Jason Schreiber
ArXivPDFHTML

Papers citing "Benchmark Inflation: Revealing LLM Performance Gaps Using Retro-Holdouts"

1 / 1 papers shown
Title
Does Data Contamination Detection Work (Well) for LLMs? A Survey and Evaluation on Detection Assumptions
Does Data Contamination Detection Work (Well) for LLMs? A Survey and Evaluation on Detection Assumptions
Yujuan Fu
Özlem Uzuner
Meliha Yetisgen
Fei Xia
38
3
0
24 Oct 2024
1