Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2012.07805
Cited By
v1
v2 (latest)
Extracting Training Data from Large Language Models
USENIX Security Symposium (USENIX Security), 2020
14 December 2020
Nicholas Carlini
Florian Tramèr
Eric Wallace
Matthew Jagielski
Ariel Herbert-Voss
Katherine Lee
Adam Roberts
Tom B. Brown
Basel Alomair
Ulfar Erlingsson
Alina Oprea
Colin Raffel
MLAU
SILM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Papers citing
"Extracting Training Data from Large Language Models"
50 / 1,528 papers shown
Executable Governance for AI: Translating Policies into Rules Using LLMs
Gautam Varma Datla
Anudeep Vurity
Tejaswani Dash
Tazeem Ahmad
Mohd Adnan
Saima Rafi
ELM
156
1
0
04 Dec 2025
Efficient Public Verification of Private ML via Regularization
Zoë Ruha Bell
Anvith Thudi
Olive Franzese-McLaughlin
Nicolas Papernot
Shafi Goldwasser
118
0
0
03 Dec 2025
Grokked Models are Better Unlearners
Yuanbang Liang
Yang Li
MU
OOD
441
0
0
03 Dec 2025
Context-Aware Hierarchical Learning: A Two-Step Paradigm towards Safer LLMs
Tengyun Ma
Jiaqi Yao
Daojing He
Shihao Peng
Yu Li
Shaohui Liu
Zhuotao Tian
198
0
0
03 Dec 2025
Towards Contextual Sensitive Data Detection
Liang Telkamp
Madelon Hulsebos
97
0
0
02 Dec 2025
Randomized Masked Finetuning: An Efficient Way to Mitigate Memorization of PIIs in LLMs
Kunj Joshi
David A. Smith
138
0
0
02 Dec 2025
Ensemble Privacy Defense for Knowledge-Intensive LLMs against Membership Inference Attacks
Haowei Fu
Bo Ni
Han Xu
Kunpeng Liu
Dan Lin
Tyler Derr
157
0
0
01 Dec 2025
Privacy Preserving Diffusion Models for Mixed-Type Tabular Data Generation
Timur Sattarov
Marco Schreyer
Damian Borth
150
0
0
29 Nov 2025
Orion-Bix: Bi-Axial Attention for Tabular In-Context Learning
Mohamed Bouadi
Pratinav Seth
Aditya Tanna
Vinay Kumar Sankarapu
91
1
0
28 Nov 2025
Decomposed Trust: Exploring Privacy, Adversarial Robustness, Fairness, and Ethics of Low-Rank LLMs
Daniel Agyei Asante
Md Mokarram Chowdhury
Yang Li
168
0
0
27 Nov 2025
Can Finetuing LLMs on Small Human Samples Increase Heterogeneity, Alignment, and Belief-Action Coherence?
Steven Wang
Kyle Hunt
Shaojie Tang
Kenneth Joseph
ALM
236
1
0
26 Nov 2025
Memories Retrieved from Many Paths: A Multi-Prefix Framework for Robust Detection of Training Data Leakage in Large Language Models
Trung Cuong Dang
David A. Mohaisen
AAML
236
2
0
25 Nov 2025
Quantifying the Privacy Implications of High-Fidelity Synthetic Network Traffic
Van-Tai Tran
Shinan Liu
Tian Li
Nick Feamster
MIACV
626
1
0
25 Nov 2025
Exploiting the Experts: Unauthorized Compression in MoE-LLMs
Pinaki Prasad Guha Neogi
Ahmad Mohammadshirazi
Dheeraj Kulshrestha
R. Ramnath
MoE
190
0
0
22 Nov 2025
Geometric-disentangelment Unlearning
Duo Zhou
Yuji Zhang
Tianxin Wei
Ruizhong Qiu
Ke Yang
...
Jingrui He
Hanghang Tong
Heng Ji
Huan Zhang
Huan Zhang
MU
384
0
0
21 Nov 2025
Music Recommendation with Large Language Models: Challenges, Opportunities, and Evaluation
Elena V. Epure
Yashar Deldjoo
Bruno Sguerra
Markus Schedl
Manuel Moussallam
213
0
0
20 Nov 2025
Membership Inference Attacks Beyond Overfitting
Mona Khalil
Alberto Blanco-Justicia
N. Jebreel
Josep Domingo-Ferrer
MIALM
250
0
0
20 Nov 2025
As If We've Met Before: LLMs Exhibit Certainty in Recognizing Seen Files
Haodong Li
Jingqi Zhang
Xiao Cheng
Peihua Mai
Haoyu Wang
Yang Pan
398
0
0
19 Nov 2025
Effective Code Membership Inference for Code Completion Models via Adversarial Prompts
Yuan Jiang
Zehao Li
Shan Huang
Christoph Treude
Xiaohong Su
Tiantian Wang
AAML
346
1
0
19 Nov 2025
How to Train Private Clinical Language Models: A Comparative Study of Privacy-Preserving Pipelines for ICD-9 Coding
Mathieu Dufour
Andrew Duncan
147
0
0
18 Nov 2025
AI Bill of Materials and Beyond: Systematizing Security Assurance through the AI Risk Scanning (AIRS) Framework
Samuel Nathanson
Alexander Lee
Catherine Chen Kieffer
Jared Junkin
Jessica Ye
Amir Saeed
Melanie Lockhart
Russ Fink
Elisha Peterson
Lanier Watkins
127
0
0
16 Nov 2025
Forgetting-MarI: LLM Unlearning via Marginal Information Regularization
Shizhou Xu
Yuan Ni
Stefan Broecker
Thomas Strohmer
MU
516
0
0
14 Nov 2025
InData: Towards Secure Multi-Step, Tool-Based Data Analysis
Karthikeyan K
Raghuveer Thirukovalluru
Bhuwan Dhingra
David Edwin Carlson
SyDa
190
0
0
14 Nov 2025
Concept-RuleNet: Grounded Multi-Agent Neurosymbolic Reasoning in Vision Language Models
Sanchit Sinha
Guangzhi Xiong
Zhenghao He
Aidong Zhang
154
0
0
13 Nov 2025
Hallucinate or Memorize? The Two Sides of Probabilistic Learning in Large Language Models
Journal of Imaging (JI), 2025
Junichiro Niimi
HILM
222
2
0
12 Nov 2025
Beyond Superficial Forgetting: Thorough Unlearning through Knowledge Density Estimation and Block Re-insertion
Feng Guo
Yuntao Wen
Shen Gao
Junshuo Zhang
Shuo Shang
KELM
MU
497
0
0
11 Nov 2025
SALT: Steering Activations towards Leakage-free Thinking in Chain of Thought
Shourya Batra
Pierce Tillman
Samarth Gaggar
Shashank Kesineni
Kevin Zhu
Sunishchal Dev
Ashwinee Panda
Vasu Sharma
Maheep Chaudhary
KELM
PILM
LLMSV
LRM
ELM
660
5
0
11 Nov 2025
Biologically-Informed Hybrid Membership Inference Attacks on Generative Genomic Models
Asia Belfiore
Jonathan Passerat-Palmbach
Dmitrii Usynin
186
1
0
10 Nov 2025
Uncovering Pretraining Code in LLMs: A Syntax-Aware Attribution Approach
Yuanheng Li
Z. Chen
Xiaoyun Liu
Yuhao Wang
Xin Peng
Yang Shi
Kaifeng Huang
Shengjie Zhao
AAML
287
0
0
10 Nov 2025
ConVerse: Benchmarking Contextual Safety in Agent-to-Agent Conversations
Amr Gomaa
Ahmed Salem
Sahar Abdelnabi
LLMAG
196
5
0
07 Nov 2025
REMIND: Input Loss Landscapes Reveal Residual Memorization in Post-Unlearning LLMs
Liran Cohen
Yaniv Nemcovesky
Avi Mendelson
MU
AAML
CLL
KELM
321
0
0
06 Nov 2025
Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations
Anka Reuel
Avijit Ghosh
Jenny Chim
Andrew Tran
Yanan Long
...
Zeerak Talat
Stella Biderman
Mykel J. Kochenderfer
Sanmi Koyejo
Irene Solaiman
ELM
259
3
0
06 Nov 2025
Contamination Detection for VLMs using Multi-Modal Semantic Perturbation
J. Park
Mu Cai
Feng Yao
Jingbo Shang
Soochahn Lee
Yong Jae Lee
AAML
VLM
141
0
0
05 Nov 2025
LiveSecBench: A Dynamic and Event-Driven Safety Benchmark for Chinese Language Model Applications
Yudong Li
Zhongliang Yang
Kejiang Chen
Wenxuan Wang
TianXin Zhang
...
Xingchi Gu
Peiru Yang
Tianxin Zhang
Yue Gao
Yongfeng Huang
ELM
273
0
0
04 Nov 2025
Black-Box Membership Inference Attack for LVLMs via Prior Knowledge-Calibrated Memory Probing
Jinhua Yin
Peiru Yang
Chen Yang
Huili Wang
Zhiyang Hu
Shangguang Wang
Yongfeng Huang
Tao Qi
229
1
0
03 Nov 2025
Remembering Unequally: Global and Disciplinary Bias in LLM Reconstruction of Scholarly Coauthor Lists
Ghazal Kalhor
Afra Mashhadi
119
0
0
01 Nov 2025
EL-MIA: Quantifying Membership Inference Risks of Sensitive Entities in LLMs
Ali Satvaty
Suzan Verberne
Fatih Turkmen
MIALM
353
1
0
31 Oct 2025
Detecting Data Contamination in LLMs via In-Context Learning
Michał Zawalski
Meriem Boubdir
Klaudia Bałazy
Besmira Nushi
Pablo Ribalta
215
2
0
30 Oct 2025
Questionnaire meets LLM: A Benchmark and Empirical Study of Structural Skills for Understanding Questions and Responses
Duc-Hai Nguyen
Vijayakumar Nanjappan
Barry O'Sullivan
Hoang D. Nguyen
163
0
0
30 Oct 2025
RECAP: Reproducing Copyrighted Data from LLMs Training with an Agentic Pipeline
André V. Duarte
Xuying Li
Bin Zeng
Arlindo L. Oliveira
Lei Li
Zhuo Li
175
0
0
29 Oct 2025
A Survey on Unlearning in Large Language Models
Ruichen Qiu
Jiajun Tan
Jiayue Pu
Honglin Wang
Xiao-Shan Gao
Fei Sun
MU
AILaw
PILM
786
2
0
29 Oct 2025
Idea2Plan: Exploring AI-Powered Research Planning
Jin Huang
Silviu Cucerzan
S. Jauhar
Ryen W. White
LLMAG
LRM
152
1
0
28 Oct 2025
PrivacyGuard: A Modular Framework for Privacy Auditing in Machine Learning
Luca Melis
Matthew Grange
Iden Kalemaj
Karan Chadha
Shengyuan Hu
Elena Kashtelyan
Will Bullock
176
0
0
27 Oct 2025
Retracing the Past: LLMs Emit Training Data When They Get Lost
Myeongseob Ko
Nikhil Reddy Billa
Adam Nguyen
Charles Fleming
Ming Jin
R. Jia
AAML
180
0
0
27 Oct 2025
LLEMA: Evolutionary Search with LLMs for Multi-Objective Materials Discovery
Nikhil Abhyankar
Sanchit Kabra
Saaketh Desai
Chandan K. Reddy
157
1
0
26 Oct 2025
Leverage Unlearning to Sanitize LLMs
Antoine Boutet
Lucas Magnana
MU
MedIm
255
1
0
24 Oct 2025
Black Box Absorption: LLMs Undermining Innovative Ideas
Wenjun Cao
174
0
0
23 Oct 2025
Adversarially-Aware Architecture Design for Robust Medical AI Systems
Alyssa Gerhart
Balaji Iyangar
AAML
228
1
0
23 Oct 2025
Machine Text Detectors are Membership Inference Attacks
Ryuto Koike
Liam Dugan
Masahiro Kaneko
Chris Callison-Burch
Naoaki Okazaki
233
1
0
22 Oct 2025
CircuitGuard: Mitigating LLM Memorization in RTL Code Generation Against IP Leakage
Nowfel Mashnoor
Mohammad Akyash
Hadi M Kamali
Kimia Azar
167
1
0
22 Oct 2025
1
2
3
4
...
29
30
31
Next
Page 1 of 31
Page
of 31
Go