Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.09247
Cited By
Benchmark Inflation: Revealing LLM Performance Gaps Using Retro-Holdouts
11 October 2024
Jacob Haimes
Cenny Wenner
Kunvar Thaman
Vassil Tashev
Clement Neo
Esben Kran
Jason Schreiber
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Benchmark Inflation: Revealing LLM Performance Gaps Using Retro-Holdouts"
1 / 1 papers shown
Title
Does Data Contamination Detection Work (Well) for LLMs? A Survey and Evaluation on Detection Assumptions
Yujuan Fu
Özlem Uzuner
Meliha Yetisgen
Fei Xia
45
3
0
24 Oct 2024
1