ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.06233
  4. Cited By
Data Contamination Quiz: A Tool to Detect and Estimate Contamination in
  Large Language Models
v1v2v3v4v5v6 (latest)

Data Contamination Quiz: A Tool to Detect and Estimate Contamination in Large Language Models

Transactions of the Association for Computational Linguistics (TACL), 2023
10 November 2023
Shahriar Golchin
Mihai Surdeanu
ArXiv (abs)PDFHTML

Papers citing "Data Contamination Quiz: A Tool to Detect and Estimate Contamination in Large Language Models"

19 / 19 papers shown
Title
Breaking Memorization Barriers in LLM Code Fine-Tuning via Information Bottleneck for Improved Generalization
Breaking Memorization Barriers in LLM Code Fine-Tuning via Information Bottleneck for Improved Generalization
Changsheng Wang
Xin Chen
Sijia Liu
Ke Ding
CLL
136
0
0
15 Oct 2025
MemLens: Uncovering Memorization in LLMs with Activation Trajectories
MemLens: Uncovering Memorization in LLMs with Activation Trajectories
Zirui He
Haiyan Zhao
Ali Payani
Mengnan Du
108
0
0
25 Sep 2025
How Can I Publish My LLM Benchmark Without Giving the True Answers Away?
How Can I Publish My LLM Benchmark Without Giving the True Answers Away?
Takashi Ishida
Thanawat Lodkaew
Ikko Yamane
605
1
0
23 May 2025
Confidence in Large Language Model Evaluation: A Bayesian Approach to Limited-Sample Challenges
Confidence in Large Language Model Evaluation: A Bayesian Approach to Limited-Sample Challenges
Xiao Xiao
Yu Su
Sijing Zhang
Zhang Chen
Yadong Chen
Tian Liu
197
6
0
30 Apr 2025
Language Models May Verbatim Complete Text They Were Not Explicitly Trained On
Language Models May Verbatim Complete Text They Were Not Explicitly Trained On
Katja Filippova
Christopher A. Choquette-Choo
Matthew Jagielski
Peter Kairouz
Sanmi Koyejo
Abigail Z. Jacobs
Nicolas Papernot
391
11
0
21 Mar 2025
The Emperor's New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination
The Emperor's New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination
Yifan Sun
Han Wang
Dongbai Li
Gang Wang
Huan Zhang
AAML
264
4
0
20 Mar 2025
Using Large Language Models for Automated Grading of Student Writing
  about Science
Using Large Language Models for Automated Grading of Student Writing about ScienceInternational Journal of Artificial Intelligence in Education (IJAIED), 2024
Chris Impey
Matthew Wenger
Nikhil Garuda
Shahriar Golchin
Sarah Stamer
ELMAI4Ed
129
14
0
25 Dec 2024
On Memorization of Large Language Models in Logical Reasoning
On Memorization of Large Language Models in Logical Reasoning
Chulin Xie
Yangsibo Huang
Chiyuan Zhang
Da Yu
Xinyun Chen
Bill Yuchen Lin
Bo Li
Badih Ghazi
Ravi Kumar
LRM
372
90
0
30 Oct 2024
ASR Error Correction using Large Language Models
ASR Error Correction using Large Language ModelsIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2024
Rao Ma
Mengjie Qian
Mark Gales
Kate Knill
KELM
267
19
0
14 Sep 2024
DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning
  Graph
DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph
Zhehao Zhang
Jiaao Chen
Diyi Yang
LRM
184
23
0
25 Jun 2024
Large Language Models are Zero-Shot Next Location Predictors
Large Language Models are Zero-Shot Next Location Predictors
Ciro Beneduce
Bruno Lepri
Massimiliano Luca
332
19
0
31 May 2024
ConStat: Performance-Based Contamination Detection in Large Language
  Models
ConStat: Performance-Based Contamination Detection in Large Language Models
Jasper Dekoninck
Mark Niklas Muller
Martin Vechev
139
14
0
25 May 2024
Benchmarking Benchmark Leakage in Large Language Models
Benchmarking Benchmark Leakage in Large Language Models
Ruijie Xu
Zengzhi Wang
Run-Ze Fan
Pengfei Liu
203
89
0
29 Apr 2024
TRUCE: Private Benchmarking to Prevent Contamination and Improve
  Comparative Evaluation of LLMs
TRUCE: Private Benchmarking to Prevent Contamination and Improve Comparative Evaluation of LLMs
Tanmay Rajore
Nishanth Chandran
Sunayana Sitaram
Divya Gupta
Rahul Sharma
Kashish Mittal
Manohar Swaminathan
245
22
0
01 Mar 2024
Dynamic Evaluation of Large Language Models by Meta Probing Agents
Dynamic Evaluation of Large Language Models by Meta Probing Agents
Lingyao Li
Yongfeng Zhang
Qinlin Zhao
Ruochen Xu
Xing Xie
297
54
0
21 Feb 2024
DE-COP: Detecting Copyrighted Content in Language Models Training Data
DE-COP: Detecting Copyrighted Content in Language Models Training Data
André V. Duarte
Xuandong Zhao
Arlindo L. Oliveira
Lei Li
309
63
0
15 Feb 2024
Large Language Models As MOOCs Graders
Large Language Models As MOOCs Graders
Shahriar Golchin
Nikhil Garuda
Christopher Impey
Matthew Wenger
AI4Ed
127
6
0
06 Feb 2024
Evading Data Contamination Detection for Language Models is (too) Easy
Evading Data Contamination Detection for Language Models is (too) Easy
Jasper Dekoninck
Mark Niklas Muller
Maximilian Baader
Marc Fischer
Martin Vechev
283
28
0
05 Feb 2024
DyVal: Dynamic Evaluation of Large Language Models for Reasoning Tasks
DyVal: Dynamic Evaluation of Large Language Models for Reasoning TasksInternational Conference on Learning Representations (ICLR), 2023
A. Maritan
Jiaao Chen
S. Dey
Luca Schenato
Diyi Yang
Xing Xie
ELMLRM
339
78
0
29 Sep 2023
1