Scalable Extraction of Training Data from (Production) Language Models

28 November 2023

Christopher A. Choquette-Choo

ArXiv (abs)PDF HTML HuggingFace (3 upvotes)

Papers citing "Scalable Extraction of Training Data from (Production) Language Models"

50 / 281 papers shown

Private Memorization Editing: Turning Memorization into a Defense to Strengthen Data Privacy in Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Elena Sofia Ruzzetti

Giancarlo A. Xompero

Davide Venditti

Fabio Massimo Zanzotto

KELM PILM

291

09 Jun 2025

Quantifying Cross-Modality Memorization in Vision-Language Models

330

05 Jun 2025

Membership Inference Attacks on Sequence Models

273

05 Jun 2025

Privacy Leaks by Adversaries: Adversarial Iterations for Membership Inference Attack

313

03 Jun 2025

ACCESS DENIED INC: The First Benchmark Environment for Sensitivity AwarenessAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Dren Fazlija

Arkadij Orlov

Sandipan Sikdar

255

01 Jun 2025

Existing Large Language Model Unlearning Evaluations Are Inconclusive

154

31 May 2025

How much do language models memorize?

408

30 May 2025

Hush! Protecting Secrets During Model Training: An Indistinguishability Approach

193

30 May 2025

Exploring the limits of strong membership inference attacks on large language models

Jamie Hayes

Ilia Shumailov

Christopher A. Choquette-Choo

Matthew Jagielski

G. Kaissis

...

Matthieu Meeus

Yves-Alexandre de Montjoye

Franziska Boenisch

Adam Dziedzic

A. Feder Cooper

341

24 May 2025

Be Careful When Fine-tuning On Open-Source LLMs: Your Fine-tuning Data Could Be Secretly Stolen!

319

21 May 2025

Shared Path: Unraveling Memorization in Multilingual LLMs through Language Similarities

278

21 May 2025

Is Your Prompt Safe? Investigating Prompt Injection Attacks Against Open-Source LLMs

264

20 May 2025

Fragments to Facts: Partial-Information Fragment Inference from LLMs

332

20 May 2025

Adversarially Pretrained Transformers May Be Universally Robust In-Context Learners

549

20 May 2025

One-Step Offline Distillation of Diffusion-based Models via Koopman Modeling

474

19 May 2025

PANORAMA: A synthetic PII-laced dataset for studying sensitive data memorization in LLMs

Sriram Selvam

Anneswa Ghosh

178

18 May 2025

PIG: Privacy Jailbreak Attack on LLMs via Gradient-based Iterative In-Context OptimizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

502

15 May 2025

DMRL: Data- and Model-aware Reward Learning for Data Extraction

Zhiqiang Wang

Ruoxi Cheng

182

07 May 2025

OBLIVIATE: Robust and Practical Machine Unlearning for Large Language Models

456

07 May 2025

Automatic Calibration for Membership Inference Attack on Large Language Models

Mohammad Amin Roshani

Prashant Khanduri

Dongxiao Zhu

267

06 May 2025

Transferable Adversarial Attacks on Black-Box Vision-Language Models

410

02 May 2025

Towards Harnessing the Collaborative Power of Large and Small Models for Domain Tasks

...

1.0K

24 Apr 2025

Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction

622

21 Apr 2025

Antidistillation Sampling

448

17 Apr 2025

The Hidden Space of Safety: Understanding Preference-Tuned LLMs in Multilingual context

Nikhil Verma

Manasa Bharadwaj

273

03 Apr 2025

SUV: Scalable Large Language Model Copyright Compliance with Regularized Selective Unlearning

551

29 Mar 2025

Spend Your Budget Wisely: Towards an Intelligent Distribution of the Privacy Budget in Differentially Private Text RewritingConference on Data and Application Security and Privacy (CODASPY), 2024

Stephen Meisenbacher

Chaeeun Joy Lee

Florian Matthes

263

28 Mar 2025

How do language models learn facts? Dynamics, curricula and hallucinations

367

27 Mar 2025

Gemma 3 Technical Report

...

576

781

25 Mar 2025

Language Models May Verbatim Complete Text They Were Not Explicitly Trained On

Katja Filippova

Christopher A. Choquette-Choo

472

21 Mar 2025

In-House Evaluation Is Not Enough: Towards Robust Third-Party Flaw Disclosure for General-Purpose AI

...

391

21 Mar 2025

Inspecting the Representation Manifold of Differentially-Private Text

Stefan Arnold

236

19 Mar 2025

Empirical Privacy Variance

509

16 Mar 2025

Synthesizing Privacy-Preserving Text Data via Finetuning without Finetuning Billion-Scale LLMs

399

16 Mar 2025

PoisonedParrot: Subtle Data Poisoning Attacks to Elicit Copyright-Infringing Content from Large Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

Michael-Andrei Panaitescu-Liess

Pankayaraj Pathmanathan

357

10 Mar 2025

Privacy Auditing of Large Language ModelsInternational Conference on Learning Representations (ICLR), 2025

Ashwinee Panda

Xinyu Tang

Milad Nasr

Christopher A. Choquette-Choo

Prateek Mittal

PILM

349

09 Mar 2025

Mitigating Memorization in LLMs using Activation Steering

345

08 Mar 2025

Machine Learners Should Acknowledge the Legal Implications of Large Language Models as Personal Data

451

03 Mar 2025

Towards Label-Only Membership Inference Attack against Pre-trained Large Language Models

466

26 Feb 2025

Merger-as-a-Stealer: Stealing Targeted PII from Aligned LLMs with Model Merging

356

22 Feb 2025

Protecting Users From Themselves: Safeguarding Contextual Privacy in Interactions with Conversational AgentsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Karthikeyan N. Ramamurthy

276

22 Feb 2025

Privacy Ripple Effects from Adding or Removing Personal Information in Language Model TrainingAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Jaydeep Borkar

Matthew Jagielski

Katherine Lee

Niloofar Mireshghallah

David A. Smith

Christopher A. Choquette-Choo

PILM

689

21 Feb 2025

Generative AI Training and Copyright Law

Tim W. Dornis

Sebastian Stober

387

21 Feb 2025

The Canary's Echo: Auditing Privacy Risks of LLM-Generated Synthetic Text

Matthieu Meeus

Lukas Wutschitz

Santiago Zanella Béguelin

Shruti Tople

Reza Shokri

442

19 Feb 2025

Commercial LLM Agents Are Already Vulnerable to Simple Yet Dangerous Attacks

Ang Li

Yin Zhou

Vethavikashini Chithrra Raghuram

Tom Goldstein

Micah Goldblum

AAML

347

12 Feb 2025

LLM Unlearning via Neural Activation Redirection

359

11 Feb 2025

MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations

...

654

10 Feb 2025

Can We Trust AI Benchmarks? An Interdisciplinary Review of Current Issues in AI Evaluation

David Fernandez-Llorca

ELM

734

10 Feb 2025

On the Impact of Noise in Differentially Private Text RewritingNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

Stephen Meisenbacher

Maulik Chevli

Florian Matthes

219

31 Jan 2025

The Pitfalls of "Security by Obscurity" And What They Mean for Transparent AIAAAI Conference on Artificial Intelligence (AAAI), 2025

Peter Hall

Olivia Mundahl

Sunoo Park

383

30 Jan 2025