Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2311.17035
Cited By
Scalable Extraction of Training Data from (Production) Language Models
28 November 2023
Milad Nasr
Nicholas Carlini
Jonathan Hayase
Matthew Jagielski
A. Feder Cooper
Daphne Ippolito
Christopher A. Choquette-Choo
Eric Wallace
Florian Tramèr
Katherine Lee
SILM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (3 upvotes)
Papers citing
"Scalable Extraction of Training Data from (Production) Language Models"
50 / 281 papers shown
Analyzing Memorization in Large Language Models through the Lens of Model Attribution
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Tarun Ram Menta
Susmit Agrawal
Chirag Agarwal
188
10
0
10 Jan 2025
Multi-PA: A Multi-perspective Benchmark on Privacy Assessment for Large Vision-Language Models
Jie M. Zhang
Xiangkui Cao
Zhouyu Han
Shiguang Shan
Xilin Chen
ELM
254
0
0
27 Dec 2024
Diverse and Effective Red Teaming with Auto-generated Rewards and Multi-step Reinforcement Learning
Alex Beutel
Kai Y. Xiao
Johannes Heidecke
Lilian Weng
AAML
183
17
0
24 Dec 2024
Social Science Is Necessary for Operationalizing Socially Responsible Foundation Models
Adam Davies
Elisa Nguyen
Michael Simeone
Erik Johnston
Martin Gubri
579
0
0
20 Dec 2024
Jailbreaking? One Step Is Enough!
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Weixiong Zheng
Peijian Zeng
Yuchen Ren
Hongyan Wu
Hongyan Wu
Jianfei Chen
Aimin Yang
Yimiao Zhou
AAML
224
1
0
17 Dec 2024
Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy and Research
A. Feder Cooper
Christopher A. Choquette-Choo
Miranda Bogen
Matthew Jagielski
Katja Filippova
...
Hanna M. Wallach
Amy Cyphert
Katherine Lee
Nicolas Papernot
Katherine Lee
MU
AILaw
357
29
0
09 Dec 2024
Towards Data Governance of Frontier AI Models
Jason Hausenloy
Duncan McClements
Madhavendra Thakur
454
2
0
05 Dec 2024
Learning to Forget using Hypernetworks
Jose Miguel Lara Rangel
Stefan Schoepf
Jack Foster
David M. Krueger
Usman Anwar
MU
367
3
0
01 Dec 2024
AIDBench: A benchmark for evaluating the authorship identification capability of large language models
Zichen Wen
Dadi Guo
Huishuai Zhang
268
2
0
20 Nov 2024
Measuring Non-Adversarial Reproduction of Training Data in Large Language Models
International Conference on Learning Representations (ICLR), 2024
Michael Aerni
Javier Rando
Edoardo Debenedetti
Nicholas Carlini
Daphne Ippolito
F. Tramèr
243
13
0
15 Nov 2024
A Social Outcomes and Priorities centered (SOP) Framework for AI policy
Mohak Shah
167
0
0
12 Nov 2024
Beyond the Safety Bundle: Auditing the Helpful and Harmless Dataset
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Khaoula Chehbouni
Jonathan Colaço-Carr
Yash More
Jackie CK Cheung
G. Farnadi
573
7
0
12 Nov 2024
The Empirical Impact of Data Sanitization on Language Models
Anwesan Pal
Radhika Bhargava
Kyle Hinsz
Jacques Esterhuizen
Sudipta Bhattacharya
257
3
0
08 Nov 2024
Unveiling Synthetic Faces: How Synthetic Datasets Can Expose Real Identities
Hatef Otroshi-Shahreza
S´ebastien Marcel
289
6
0
31 Oct 2024
Exactly Minimax-Optimal Locally Differentially Private Sampling
Neural Information Processing Systems (NeurIPS), 2024
Hyun-Young Park
Shahab Asoodeh
Si-Hyeon Lee
288
5
0
30 Oct 2024
Props for Machine-Learning Security
Ari Juels
Farinaz Koushanfar
150
3
0
27 Oct 2024
Breaking Chains: Unraveling the Links in Multi-Hop Knowledge Unlearning
Minseok Choi
C. Park
Dohyun Lee
Jaegul Choo
KELM
MU
164
4
0
17 Oct 2024
Reconstruction of Differentially Private Text Sanitization via Large Language Models
Shuchao Pang
Zhigang Lu
Jian Shu
Peng Fu
Yongbin Zhou
Minhui Xue
AAML
431
5
0
16 Oct 2024
To Err is AI : A Case Study Informing LLM Flaw Reporting Practices
AAAI Conference on Artificial Intelligence (AAAI), 2024
Sean McGregor
Allyson Ettinger
Nick Judd
Paul Albee
Liwei Jiang
...
Avijit Ghosh
Christopher Fiorelli
Michelle Hoang
Sven Cattell
Nouha Dziri
200
5
0
15 Oct 2024
A Theoretical Survey on Foundation Models
Shi Fu
Yuzhu Chen
Yingjie Wang
Dacheng Tao
304
0
0
15 Oct 2024
Federated Learning in Practice: Reflections and Projections
International Conference on Trust, Privacy and Security in Intelligent Systems and Applications (ICPSISA), 2024
Katharine Daly
Hubert Eichner
Peter Kairouz
H. B. McMahan
Daniel Ramage
Zheng Xu
FedML
317
29
0
11 Oct 2024
COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence Act
Philipp Guldimann
Alexander Spiridonov
Robin Staab
Nikola Jovanović
Mark Vero
...
Mislav Balunović
Nikola Konstantinov
Pavol Bielik
Petar Tsankov
Martin Vechev
ELM
353
20
0
10 Oct 2024
Rescriber: Smaller-LLM-Powered User-Led Data Minimization for LLM-Based Chatbots
International Conference on Human Factors in Computing Systems (CHI), 2024
Jijie Zhou
Eryue Xu
Yaoyao Wu
Tianshi Li
411
0
0
10 Oct 2024
CodeCipher: Learning to Obfuscate Source Code Against LLMs
Yalan Lin
Chengcheng Wan
Yixiong Fang
Xiaodong Gu
155
3
0
08 Oct 2024
KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from Server
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Wenhao Wang
Xiaoyu Liang
Rui Ye
Jingyi Chai
Siheng Chen
Yanfeng Wang
SyDa
338
9
0
08 Oct 2024
Non-Halting Queries: Exploiting Fixed Points in LLMs
Ghaith Hammouri
Kemal Derya
B. Sunar
300
1
0
08 Oct 2024
MIBench: A Comprehensive Framework for Benchmarking Model Inversion Attack and Defense
Yixiang Qiu
Hongyao Yu
Hao Fang
Wenbo Yu
Wenbo Yu
Bin Chen
Shu-Tao Xia
Ke Xu
Ke Xu
AAML
237
1
0
07 Oct 2024
How Much Can We Forget about Data Contamination?
Sebastian Bordt
Suraj Srinivas
Valentyn Boreiko
U. V. Luxburg
451
10
0
04 Oct 2024
Mitigating Memorization In Language Models
Mansi Sakarvadia
Aswathy Ajith
Arham Khan
Nathaniel Hudson
Caleb Geniesse
Kyle Chard
Yaoqing Yang
Ian Foster
Michael W. Mahoney
KELM
MU
396
8
0
03 Oct 2024
Undesirable Memorization in Large Language Models: A Survey
Ali Satvaty
Suzan Verberne
Fatih Turkmen
ELM
PILM
582
23
0
03 Oct 2024
Position: LLM Unlearning Benchmarks are Weak Measures of Progress
Pratiksha Thaker
Shengyuan Hu
Neil Kale
Yash Maurya
Zhiwei Steven Wu
Virginia Smith
MU
357
35
0
03 Oct 2024
Membership Inference Attacks Cannot Prove that a Model Was Trained On Your Data
Jie Zhang
Debeshee Das
Gautam Kamath
Florian Tramèr
MIALM
MIACV
826
40
1
29 Sep 2024
Predicting memorization within Large Language Models fine-tuned for classification
Jérémie Dentan
Davide Buscaldi
A. Shabou
Sonia Vanier
340
1
0
27 Sep 2024
An Adversarial Perspective on Machine Unlearning for AI Safety
Jakub Łucki
Boyi Wei
Yangsibo Huang
Peter Henderson
F. Tramèr
Javier Rando
MU
AAML
935
85
0
26 Sep 2024
Data-Centric AI Governance: Addressing the Limitations of Model-Focused Policies
Ritwik Gupta
Leah Walker
Rodolfo Corona
Stephanie Fu
Suzanne Petryk
Janet Napolitano
Trevor Darrell
Andrew W. Reddie
ELM
212
7
0
25 Sep 2024
Pretraining Data Detection for Large Language Models: A Divergence-based Calibration Method
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Weichao Zhang
Ruqing Zhang
Jiafeng Guo
Maarten de Rijke
Yixing Fan
Xueqi Cheng
439
50
0
23 Sep 2024
Order of Magnitude Speedups for LLM Membership Inference
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Rongting Zhang
Martín Bertrán
Aaron Roth
443
2
0
22 Sep 2024
Unlocking Memorization in Large Language Models with Dynamic Soft Prompting
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Zhepeng Wang
Runxue Bao
Yawen Wu
Jackson Taylor
Cao Xiao
Feng Zheng
Weiwen Jiang
Shangqian Gao
Yanfu Zhang
PILM
205
21
0
20 Sep 2024
Extracting Memorized Training Data via Decomposition
Ellen Su
Anu Vellore
Amy Chang
Raffaele Mura
Blaine Nelson
Paul Kassianik
Amin Karbasi
193
5
0
18 Sep 2024
MEOW: MEMOry Supervised LLM Unlearning Via Inverted Facts
Tianle Gu
Kexin Huang
Ruilin Luo
Yuanqi Yao
Yujiu Yang
Yan Teng
Yingchun Wang
MU
359
15
0
18 Sep 2024
Generated Data with Fake Privacy: Hidden Dangers of Fine-tuning Large Language Models on Generated Data
Atilla Akkus
Mingjie Li
Junjie Chu
Junjie Chu
Michael Backes
Sinem Sav
Sinem Sav
SILM
SyDa
356
13
0
12 Sep 2024
Introducing ELLIPS: An Ethics-Centered Approach to Research on LLM-Based Inference of Psychiatric Conditions
AAAI/ACM Conference on AI, Ethics, and Society (AIES), 2024
R. Rocca
Giada Pistilli
Kritika Maheshwari
Riccardo Fusaroli
47
3
0
06 Sep 2024
Large Language Models in Drug Discovery and Development: From Disease Mechanisms to Clinical Trials
Yizhen Zheng
Huan Yee Koh
M. Yang
Li Li
Lauren T. May
Geoffrey I. Webb
Shirui Pan
George Church
LM&MA
241
47
0
06 Sep 2024
Recent Advances in Attack and Defense Approaches of Large Language Models
Jing Cui
Yishi Xu
Zhewei Huang
Shuchang Zhou
Jianbin Jiao
Junge Zhang
PILM
AAML
345
8
0
05 Sep 2024
Forget to Flourish: Leveraging Machine-Unlearning on Pretrained Language Models for Privacy Leakage
AAAI Conference on Artificial Intelligence (AAAI), 2024
Md Rafi Ur Rashid
Jing Liu
T. Koike-Akino
Shagufta Mehnaz
Ye Wang
MU
SILM
327
11
0
30 Aug 2024
PrivacyLens: Evaluating Privacy Norm Awareness of Language Models in Action
Neural Information Processing Systems (NeurIPS), 2024
Yijia Shao
Tianshi Li
Weiyan Shi
Yanchen Liu
Diyi Yang
PILM
550
78
0
29 Aug 2024
LLM-PBE: Assessing Data Privacy in Large Language Models
Proceedings of the VLDB Endowment (PVLDB), 2024
Qinbin Li
Junyuan Hong
Chulin Xie
Jeffrey Tan
Rachel Xin
...
Dan Hendrycks
Zinan Lin
Bo Li
Bingsheng He
Dawn Song
ELM
PILM
311
48
0
23 Aug 2024
Promises and challenges of generative artificial intelligence for human learning
Nature Human Behaviour (Nat Hum Behav), 2024
Lixiang Yan
Samuel Greiff
Ziwen Teuber
Dragan Gašević
446
0
0
22 Aug 2024
Tracing Privacy Leakage of Language Models to Training Data via Adjusted Influence Functions
Jinxin Liu
Zao Yang
240
2
0
20 Aug 2024
Not Every Image is Worth a Thousand Words: Quantifying Originality in Stable Diffusion
Adi Haviv
Shahar Sarfaty
Uri Y. Hacohen
N. Elkin-Koren
Roi Livni
Amit H. Bermano
225
3
0
15 Aug 2024
Previous
1
2
3
4
5
6
Next