Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.15938
Cited By
Generalization or Memorization: Data Contamination and Trustworthy Evaluation for Large Language Models
24 February 2024
Yihong Dong
Xue Jiang
Huanyu Liu
Zhi Jin
Bin Gu
Mengfei Yang
Ge Li
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Generalization or Memorization: Data Contamination and Trustworthy Evaluation for Large Language Models"
7 / 7 papers shown
Title
Generative Evaluation of Complex Reasoning in Large Language Models
Haowei Lin
X. Wang
Ruilin Yan
Baizhou Huang
Haotian Ye
Jianhua Zhu
Zihao Wang
James Y. Zou
Jianzhu Ma
Yitao Liang
ReLM
ELM
LRM
85
0
0
03 Apr 2025
Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination
D. Song
Sicheng Lai
Shunian Chen
Lichao Sun
Benyou Wang
56
0
0
06 Nov 2024
Detecting Training Data of Large Language Models via Expectation Maximization
Gyuwan Kim
Yang Li
Evangelia Spiliopoulou
Jie Ma
Miguel Ballesteros
William Yang Wang
MIALM
90
3
2
10 Oct 2024
Min-K%++: Improved Baseline for Detecting Pre-Training Data from Large Language Models
Jingyang Zhang
Jingwei Sun
Eric C. Yeats
Ouyang Yang
Martin Kuo
Jianyi Zhang
Hao Frank Yang
Hai Li
29
41
0
03 Apr 2024
Don't Make Your LLM an Evaluation Benchmark Cheater
Kun Zhou
Yutao Zhu
Zhipeng Chen
Wentong Chen
Wayne Xin Zhao
Xu Chen
Yankai Lin
Ji-Rong Wen
Jiawei Han
ELM
105
136
0
03 Nov 2023
Data Contamination Through the Lens of Time
Manley Roberts
Himanshu Thakur
Christine Herlihy
Colin White
Samuel Dooley
81
30
0
16 Oct 2023
PACE: Improving Prompt with Actor-Critic Editing for Large Language Model
Yihong Dong
Kangcheng Luo
Xue Jiang
Zhi Jin
Ge Li
LRM
KELM
10
9
0
19 Aug 2023
1