ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.17035
  4. Cited By
Scalable Extraction of Training Data from (Production) Language Models

Scalable Extraction of Training Data from (Production) Language Models

28 November 2023
Milad Nasr
Nicholas Carlini
Jonathan Hayase
Matthew Jagielski
A. Feder Cooper
Daphne Ippolito
Christopher A. Choquette-Choo
Eric Wallace
Florian Tramèr
Katherine Lee
    SILM
ArXiv (abs)PDFHTMLHuggingFace (3 upvotes)

Papers citing "Scalable Extraction of Training Data from (Production) Language Models"

50 / 281 papers shown
Analyzing Memorization in Large Language Models through the Lens of Model Attribution
Analyzing Memorization in Large Language Models through the Lens of Model AttributionNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025
Tarun Ram Menta
Susmit Agrawal
Chirag Agarwal
188
10
0
10 Jan 2025
Multi-PA: A Multi-perspective Benchmark on Privacy Assessment for Large Vision-Language Models
Multi-PA: A Multi-perspective Benchmark on Privacy Assessment for Large Vision-Language Models
Jie M. Zhang
Xiangkui Cao
Zhouyu Han
Shiguang Shan
Xilin Chen
ELM
254
0
0
27 Dec 2024
Diverse and Effective Red Teaming with Auto-generated Rewards and
  Multi-step Reinforcement Learning
Diverse and Effective Red Teaming with Auto-generated Rewards and Multi-step Reinforcement Learning
Alex Beutel
Kai Y. Xiao
Johannes Heidecke
Lilian Weng
AAML
183
17
0
24 Dec 2024
Social Science Is Necessary for Operationalizing Socially Responsible Foundation Models
Social Science Is Necessary for Operationalizing Socially Responsible Foundation Models
Adam Davies
Elisa Nguyen
Michael Simeone
Erik Johnston
Martin Gubri
579
0
0
20 Dec 2024
Jailbreaking? One Step Is Enough!
Jailbreaking? One Step Is Enough!Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Weixiong Zheng
Peijian Zeng
Yuchen Ren
Hongyan Wu
Hongyan Wu
Jianfei Chen
Aimin Yang
Yimiao Zhou
AAML
224
1
0
17 Dec 2024
Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy and Research
Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy and Research
A. Feder Cooper
Christopher A. Choquette-Choo
Miranda Bogen
Matthew Jagielski
Katja Filippova
...
Hanna M. Wallach
Amy Cyphert
Katherine Lee
Nicolas Papernot
Katherine Lee
MUAILaw
357
29
0
09 Dec 2024
Towards Data Governance of Frontier AI Models
Towards Data Governance of Frontier AI Models
Jason Hausenloy
Duncan McClements
Madhavendra Thakur
454
2
0
05 Dec 2024
Learning to Forget using Hypernetworks
Learning to Forget using Hypernetworks
Jose Miguel Lara Rangel
Stefan Schoepf
Jack Foster
David M. Krueger
Usman Anwar
MU
367
3
0
01 Dec 2024
AIDBench: A benchmark for evaluating the authorship identification
  capability of large language models
AIDBench: A benchmark for evaluating the authorship identification capability of large language models
Zichen Wen
Dadi Guo
Huishuai Zhang
268
2
0
20 Nov 2024
Measuring Non-Adversarial Reproduction of Training Data in Large
  Language Models
Measuring Non-Adversarial Reproduction of Training Data in Large Language ModelsInternational Conference on Learning Representations (ICLR), 2024
Michael Aerni
Javier Rando
Edoardo Debenedetti
Nicholas Carlini
Daphne Ippolito
F. Tramèr
243
13
0
15 Nov 2024
A Social Outcomes and Priorities centered (SOP) Framework for AI policy
A Social Outcomes and Priorities centered (SOP) Framework for AI policy
Mohak Shah
167
0
0
12 Nov 2024
Beyond the Safety Bundle: Auditing the Helpful and Harmless Dataset
Beyond the Safety Bundle: Auditing the Helpful and Harmless DatasetNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Khaoula Chehbouni
Jonathan Colaço-Carr
Yash More
Jackie CK Cheung
G. Farnadi
573
7
0
12 Nov 2024
The Empirical Impact of Data Sanitization on Language Models
The Empirical Impact of Data Sanitization on Language Models
Anwesan Pal
Radhika Bhargava
Kyle Hinsz
Jacques Esterhuizen
Sudipta Bhattacharya
257
3
0
08 Nov 2024
Unveiling Synthetic Faces: How Synthetic Datasets Can Expose Real
  Identities
Unveiling Synthetic Faces: How Synthetic Datasets Can Expose Real Identities
Hatef Otroshi-Shahreza
S´ebastien Marcel
289
6
0
31 Oct 2024
Exactly Minimax-Optimal Locally Differentially Private Sampling
Exactly Minimax-Optimal Locally Differentially Private SamplingNeural Information Processing Systems (NeurIPS), 2024
Hyun-Young Park
Shahab Asoodeh
Si-Hyeon Lee
288
5
0
30 Oct 2024
Props for Machine-Learning Security
Props for Machine-Learning Security
Ari Juels
Farinaz Koushanfar
150
3
0
27 Oct 2024
Breaking Chains: Unraveling the Links in Multi-Hop Knowledge Unlearning
Breaking Chains: Unraveling the Links in Multi-Hop Knowledge Unlearning
Minseok Choi
C. Park
Dohyun Lee
Jaegul Choo
KELMMU
164
4
0
17 Oct 2024
Reconstruction of Differentially Private Text Sanitization via Large Language Models
Reconstruction of Differentially Private Text Sanitization via Large Language Models
Shuchao Pang
Zhigang Lu
Jian Shu
Peng Fu
Yongbin Zhou
Minhui Xue
AAML
431
5
0
16 Oct 2024
To Err is AI : A Case Study Informing LLM Flaw Reporting Practices
To Err is AI : A Case Study Informing LLM Flaw Reporting PracticesAAAI Conference on Artificial Intelligence (AAAI), 2024
Sean McGregor
Allyson Ettinger
Nick Judd
Paul Albee
Liwei Jiang
...
Avijit Ghosh
Christopher Fiorelli
Michelle Hoang
Sven Cattell
Nouha Dziri
200
5
0
15 Oct 2024
A Theoretical Survey on Foundation Models
A Theoretical Survey on Foundation Models
Shi Fu
Yuzhu Chen
Yingjie Wang
Dacheng Tao
304
0
0
15 Oct 2024
Federated Learning in Practice: Reflections and Projections
Federated Learning in Practice: Reflections and ProjectionsInternational Conference on Trust, Privacy and Security in Intelligent Systems and Applications (ICPSISA), 2024
Katharine Daly
Hubert Eichner
Peter Kairouz
H. B. McMahan
Daniel Ramage
Zheng Xu
FedML
317
29
0
11 Oct 2024
COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence Act
COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence Act
Philipp Guldimann
Alexander Spiridonov
Robin Staab
Nikola Jovanović
Mark Vero
...
Mislav Balunović
Nikola Konstantinov
Pavol Bielik
Petar Tsankov
Martin Vechev
ELM
353
20
0
10 Oct 2024
Rescriber: Smaller-LLM-Powered User-Led Data Minimization for LLM-Based Chatbots
Rescriber: Smaller-LLM-Powered User-Led Data Minimization for LLM-Based ChatbotsInternational Conference on Human Factors in Computing Systems (CHI), 2024
Jijie Zhou
Eryue Xu
Yaoyao Wu
Tianshi Li
411
0
0
10 Oct 2024
CodeCipher: Learning to Obfuscate Source Code Against LLMs
CodeCipher: Learning to Obfuscate Source Code Against LLMs
Yalan Lin
Chengcheng Wan
Yixiong Fang
Xiaodong Gu
155
3
0
08 Oct 2024
KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge
  Distillation from Server
KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from ServerConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Wenhao Wang
Xiaoyu Liang
Rui Ye
Jingyi Chai
Siheng Chen
Yanfeng Wang
SyDa
338
9
0
08 Oct 2024
Non-Halting Queries: Exploiting Fixed Points in LLMs
Non-Halting Queries: Exploiting Fixed Points in LLMs
Ghaith Hammouri
Kemal Derya
B. Sunar
300
1
0
08 Oct 2024
MIBench: A Comprehensive Framework for Benchmarking Model Inversion Attack and Defense
MIBench: A Comprehensive Framework for Benchmarking Model Inversion Attack and Defense
Yixiang Qiu
Hongyao Yu
Hao Fang
Wenbo Yu
Wenbo Yu
Bin Chen
Shu-Tao Xia
Ke Xu
Ke Xu
AAML
237
1
0
07 Oct 2024
How Much Can We Forget about Data Contamination?
How Much Can We Forget about Data Contamination?
Sebastian Bordt
Suraj Srinivas
Valentyn Boreiko
U. V. Luxburg
451
10
0
04 Oct 2024
Mitigating Memorization In Language Models
Mitigating Memorization In Language Models
Mansi Sakarvadia
Aswathy Ajith
Arham Khan
Nathaniel Hudson
Caleb Geniesse
Kyle Chard
Yaoqing Yang
Ian Foster
Michael W. Mahoney
KELMMU
396
8
0
03 Oct 2024
Undesirable Memorization in Large Language Models: A Survey
Undesirable Memorization in Large Language Models: A Survey
Ali Satvaty
Suzan Verberne
Fatih Turkmen
ELMPILM
582
23
0
03 Oct 2024
Position: LLM Unlearning Benchmarks are Weak Measures of Progress
Position: LLM Unlearning Benchmarks are Weak Measures of Progress
Pratiksha Thaker
Shengyuan Hu
Neil Kale
Yash Maurya
Zhiwei Steven Wu
Virginia Smith
MU
357
35
0
03 Oct 2024
Membership Inference Attacks Cannot Prove that a Model Was Trained On Your Data
Membership Inference Attacks Cannot Prove that a Model Was Trained On Your Data
Jie Zhang
Debeshee Das
Gautam Kamath
Florian Tramèr
MIALMMIACV
826
40
1
29 Sep 2024
Predicting memorization within Large Language Models fine-tuned for classification
Predicting memorization within Large Language Models fine-tuned for classification
Jérémie Dentan
Davide Buscaldi
A. Shabou
Sonia Vanier
340
1
0
27 Sep 2024
An Adversarial Perspective on Machine Unlearning for AI Safety
An Adversarial Perspective on Machine Unlearning for AI Safety
Jakub Łucki
Boyi Wei
Yangsibo Huang
Peter Henderson
F. Tramèr
Javier Rando
MUAAML
935
85
0
26 Sep 2024
Data-Centric AI Governance: Addressing the Limitations of Model-Focused
  Policies
Data-Centric AI Governance: Addressing the Limitations of Model-Focused Policies
Ritwik Gupta
Leah Walker
Rodolfo Corona
Stephanie Fu
Suzanne Petryk
Janet Napolitano
Trevor Darrell
Andrew W. Reddie
ELM
212
7
0
25 Sep 2024
Pretraining Data Detection for Large Language Models: A Divergence-based Calibration Method
Pretraining Data Detection for Large Language Models: A Divergence-based Calibration MethodConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Weichao Zhang
Ruqing Zhang
Jiafeng Guo
Maarten de Rijke
Yixing Fan
Xueqi Cheng
439
50
0
23 Sep 2024
Order of Magnitude Speedups for LLM Membership Inference
Order of Magnitude Speedups for LLM Membership InferenceConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Rongting Zhang
Martín Bertrán
Aaron Roth
443
2
0
22 Sep 2024
Unlocking Memorization in Large Language Models with Dynamic Soft
  Prompting
Unlocking Memorization in Large Language Models with Dynamic Soft PromptingConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Zhepeng Wang
Runxue Bao
Yawen Wu
Jackson Taylor
Cao Xiao
Feng Zheng
Weiwen Jiang
Shangqian Gao
Yanfu Zhang
PILM
205
21
0
20 Sep 2024
Extracting Memorized Training Data via Decomposition
Extracting Memorized Training Data via Decomposition
Ellen Su
Anu Vellore
Amy Chang
Raffaele Mura
Blaine Nelson
Paul Kassianik
Amin Karbasi
193
5
0
18 Sep 2024
MEOW: MEMOry Supervised LLM Unlearning Via Inverted Facts
MEOW: MEMOry Supervised LLM Unlearning Via Inverted Facts
Tianle Gu
Kexin Huang
Ruilin Luo
Yuanqi Yao
Yujiu Yang
Yan Teng
Yingchun Wang
MU
359
15
0
18 Sep 2024
Generated Data with Fake Privacy: Hidden Dangers of Fine-tuning Large Language Models on Generated Data
Generated Data with Fake Privacy: Hidden Dangers of Fine-tuning Large Language Models on Generated Data
Atilla Akkus
Mingjie Li
Junjie Chu
Junjie Chu
Michael Backes
Sinem Sav
Sinem Sav
SILMSyDa
356
13
0
12 Sep 2024
Introducing ELLIPS: An Ethics-Centered Approach to Research on LLM-Based
  Inference of Psychiatric Conditions
Introducing ELLIPS: An Ethics-Centered Approach to Research on LLM-Based Inference of Psychiatric ConditionsAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2024
R. Rocca
Giada Pistilli
Kritika Maheshwari
Riccardo Fusaroli
47
3
0
06 Sep 2024
Large Language Models in Drug Discovery and Development: From Disease
  Mechanisms to Clinical Trials
Large Language Models in Drug Discovery and Development: From Disease Mechanisms to Clinical Trials
Yizhen Zheng
Huan Yee Koh
M. Yang
Li Li
Lauren T. May
Geoffrey I. Webb
Shirui Pan
George Church
LM&MA
241
47
0
06 Sep 2024
Recent Advances in Attack and Defense Approaches of Large Language
  Models
Recent Advances in Attack and Defense Approaches of Large Language Models
Jing Cui
Yishi Xu
Zhewei Huang
Shuchang Zhou
Jianbin Jiao
Junge Zhang
PILMAAML
345
8
0
05 Sep 2024
Forget to Flourish: Leveraging Machine-Unlearning on Pretrained Language
  Models for Privacy Leakage
Forget to Flourish: Leveraging Machine-Unlearning on Pretrained Language Models for Privacy LeakageAAAI Conference on Artificial Intelligence (AAAI), 2024
Md Rafi Ur Rashid
Jing Liu
T. Koike-Akino
Shagufta Mehnaz
Ye Wang
MUSILM
327
11
0
30 Aug 2024
PrivacyLens: Evaluating Privacy Norm Awareness of Language Models in Action
PrivacyLens: Evaluating Privacy Norm Awareness of Language Models in ActionNeural Information Processing Systems (NeurIPS), 2024
Yijia Shao
Tianshi Li
Weiyan Shi
Yanchen Liu
Diyi Yang
PILM
550
78
0
29 Aug 2024
LLM-PBE: Assessing Data Privacy in Large Language Models
LLM-PBE: Assessing Data Privacy in Large Language ModelsProceedings of the VLDB Endowment (PVLDB), 2024
Qinbin Li
Junyuan Hong
Chulin Xie
Jeffrey Tan
Rachel Xin
...
Dan Hendrycks
Zinan Lin
Bo Li
Bingsheng He
Dawn Song
ELMPILM
311
48
0
23 Aug 2024
Promises and challenges of generative artificial intelligence for human
  learning
Promises and challenges of generative artificial intelligence for human learningNature Human Behaviour (Nat Hum Behav), 2024
Lixiang Yan
Samuel Greiff
Ziwen Teuber
Dragan Gašević
446
0
0
22 Aug 2024
Tracing Privacy Leakage of Language Models to Training Data via Adjusted
  Influence Functions
Tracing Privacy Leakage of Language Models to Training Data via Adjusted Influence Functions
Jinxin Liu
Zao Yang
240
2
0
20 Aug 2024
Not Every Image is Worth a Thousand Words: Quantifying Originality in
  Stable Diffusion
Not Every Image is Worth a Thousand Words: Quantifying Originality in Stable Diffusion
Adi Haviv
Shahar Sarfaty
Uri Y. Hacohen
N. Elkin-Koren
Roi Livni
Amit H. Bermano
225
3
0
15 Aug 2024
Previous
123456
Next