Scalable Extraction of Training Data from (Production) Language Models

28 November 2023

Christopher A. Choquette-Choo

ArXiv (abs)PDF HTML HuggingFace (3 upvotes)

Papers citing "Scalable Extraction of Training Data from (Production) Language Models"

50 / 281 papers shown

Analyzing Memorization in Large Language Models through the Lens of Model AttributionNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

Tarun Ram Menta

Susmit Agrawal

Chirag Agarwal

188

10 Jan 2025

Multi-PA: A Multi-perspective Benchmark on Privacy Assessment for Large Vision-Language Models

254

27 Dec 2024

Diverse and Effective Red Teaming with Auto-generated Rewards and Multi-step Reinforcement Learning

183

24 Dec 2024

Social Science Is Necessary for Operationalizing Socially Responsible Foundation Models

579

20 Dec 2024

Jailbreaking? One Step Is Enough!Annual Meeting of the Association for Computational Linguistics (ACL), 2024

224

17 Dec 2024

Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy and Research

A. Feder Cooper

Christopher A. Choquette-Choo

...

357

09 Dec 2024

Towards Data Governance of Frontier AI Models

Jason Hausenloy

Duncan McClements

Madhavendra Thakur

454

05 Dec 2024

Learning to Forget using Hypernetworks

Jose Miguel Lara Rangel

367

01 Dec 2024

AIDBench: A benchmark for evaluating the authorship identification capability of large language models

Zichen Wen

Dadi Guo

Huishuai Zhang

268

20 Nov 2024

Measuring Non-Adversarial Reproduction of Training Data in Large Language ModelsInternational Conference on Learning Representations (ICLR), 2024

243

15 Nov 2024

A Social Outcomes and Priorities centered (SOP) Framework for AI policy

Mohak Shah

167

12 Nov 2024

Beyond the Safety Bundle: Auditing the Helpful and Harmless DatasetNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

573

12 Nov 2024

The Empirical Impact of Data Sanitization on Language Models

257

08 Nov 2024

Unveiling Synthetic Faces: How Synthetic Datasets Can Expose Real Identities

Hatef Otroshi-Shahreza

S´ebastien Marcel

289

31 Oct 2024

Exactly Minimax-Optimal Locally Differentially Private SamplingNeural Information Processing Systems (NeurIPS), 2024

Hyun-Young Park

Shahab Asoodeh

Si-Hyeon Lee

288

30 Oct 2024

Props for Machine-Learning Security

Ari Juels

Farinaz Koushanfar

150

27 Oct 2024

Breaking Chains: Unraveling the Links in Multi-Hop Knowledge Unlearning

164

17 Oct 2024

Reconstruction of Differentially Private Text Sanitization via Large Language Models

431

16 Oct 2024

To Err is AI : A Case Study Informing LLM Flaw Reporting PracticesAAAI Conference on Artificial Intelligence (AAAI), 2024

...

200

15 Oct 2024

A Theoretical Survey on Foundation Models

Shi Fu

Yuzhu Chen

Yingjie Wang

Dacheng Tao

304

15 Oct 2024

Federated Learning in Practice: Reflections and ProjectionsInternational Conference on Trust, Privacy and Security in Intelligent Systems and Applications (ICPSISA), 2024

Daniel Ramage

317

11 Oct 2024

COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence Act

...

Martin Vechev

353

10 Oct 2024

Rescriber: Smaller-LLM-Powered User-Led Data Minimization for LLM-Based ChatbotsInternational Conference on Human Factors in Computing Systems (CHI), 2024

411

10 Oct 2024

CodeCipher: Learning to Obfuscate Source Code Against LLMs

155

08 Oct 2024

KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from ServerConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Wenhao Wang

Xiaoyu Liang

Rui Ye

Jingyi Chai

Siheng Chen

Yanfeng Wang

SyDa

338

08 Oct 2024

Non-Halting Queries: Exploiting Fixed Points in LLMs

Ghaith Hammouri

Kemal Derya

B. Sunar

300

08 Oct 2024

MIBench: A Comprehensive Framework for Benchmarking Model Inversion Attack and Defense

Hao Fang

Bin Chen

237

07 Oct 2024

How Much Can We Forget about Data Contamination?

451

04 Oct 2024

Mitigating Memorization In Language Models

Arham Khan

Kyle Chard

Ian Foster

Michael W. Mahoney

KELM MU

396

03 Oct 2024

Undesirable Memorization in Large Language Models: A Survey

582

03 Oct 2024

Position: LLM Unlearning Benchmarks are Weak Measures of Progress

Virginia Smith

357

03 Oct 2024

Membership Inference Attacks Cannot Prove that a Model Was Trained On Your Data

Florian Tramèr

826

29 Sep 2024

Predicting memorization within Large Language Models fine-tuned for classification

340

27 Sep 2024

An Adversarial Perspective on Machine Unlearning for AI Safety

935

26 Sep 2024

Data-Centric AI Governance: Addressing the Limitations of Model-Focused Policies

Stephanie Fu

Trevor Darrell

212

25 Sep 2024

Pretraining Data Detection for Large Language Models: A Divergence-based Calibration MethodConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Jiafeng Guo

439

23 Sep 2024

Order of Magnitude Speedups for LLM Membership InferenceConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Rongting Zhang

Martín Bertrán

Aaron Roth

443

22 Sep 2024

Unlocking Memorization in Large Language Models with Dynamic Soft PromptingConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Zhepeng Wang

Yawen Wu

Yanfu Zhang

205

20 Sep 2024

Extracting Memorized Training Data via Decomposition

Blaine Nelson

Paul Kassianik

Amin Karbasi

193

18 Sep 2024

MEOW: MEMOry Supervised LLM Unlearning Via Inverted Facts

Yujiu Yang

Yingchun Wang

359

18 Sep 2024

Generated Data with Fake Privacy: Hidden Dangers of Fine-tuning Large Language Models on Generated Data

Michael Backes

356

12 Sep 2024

Introducing ELLIPS: An Ethics-Centered Approach to Research on LLM-Based Inference of Psychiatric ConditionsAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2024

06 Sep 2024

Large Language Models in Drug Discovery and Development: From Disease Mechanisms to Clinical Trials

Yizhen Zheng

Geoffrey I. Webb

241

06 Sep 2024

Recent Advances in Attack and Defense Approaches of Large Language Models

345

05 Sep 2024

Forget to Flourish: Leveraging Machine-Unlearning on Pretrained Language Models for Privacy LeakageAAAI Conference on Artificial Intelligence (AAAI), 2024

Ye Wang

327

30 Aug 2024

PrivacyLens: Evaluating Privacy Norm Awareness of Language Models in ActionNeural Information Processing Systems (NeurIPS), 2024

550

29 Aug 2024

LLM-PBE: Assessing Data Privacy in Large Language ModelsProceedings of the VLDB Endowment (PVLDB), 2024

Qinbin Li

...

Bo Li

Dawn Song

311

23 Aug 2024

Promises and challenges of generative artificial intelligence for human learningNature Human Behaviour (Nat Hum Behav), 2024

Lixiang Yan

Samuel Greiff

Ziwen Teuber

Dragan Gašević

446

22 Aug 2024

Tracing Privacy Leakage of Language Models to Training Data via Adjusted Influence Functions

Jinxin Liu

Zao Yang

240

20 Aug 2024

Not Every Image is Worth a Thousand Words: Quantifying Originality in Stable Diffusion

225

15 Aug 2024