v1v2 (latest)

When is Memorization of Irrelevant Training Data Necessary for High-Accuracy Learning?

Symposium on the Theory of Computing (STOC), 2020

11 December 2020

Papers citing "When is Memorization of Irrelevant Training Data Necessary for High-Accuracy Learning?"

50 / 92 papers shown

Extracting alignment data in open models

Federico Barbero

Xiangming Gu

Christopher A. Choquette-Choo

282

21 Oct 2025

AI Agents as Universal Task Solvers

Alessandro Achille

Stefano Soatto

LRM

133

14 Oct 2025

A Law of Data Reconstruction for Random Features (and Beyond)

157

26 Sep 2025

Efficiently Attacking Memorization Scores

281

24 Sep 2025

Synth-MIA: A Testbed for Auditing Privacy Leakage in Tabular Data Synthesis

167

22 Sep 2025

Access Paths for Efficient Ordering with Large Language Models

Dimitris Tsirogiannis

215

30 Aug 2025

Unveiling Over-Memorization in Finetuning LLMs for Reasoning Tasks

229

06 Aug 2025

A Common Pool of Privacy Problems: Legal and Technical Lessons from a Large-Scale Web-Scraped Machine Learning Dataset

391

20 Jun 2025

Black-Box Privacy Attacks on Shared Representations in Multitask Learning

276

19 Jun 2025

Memorization in Language Models through the Lens of Intrinsic Dimension

Stefan Arnold

PILM

364

11 Jun 2025

Trade-offs in Data Memorization via Strong Data Processing InequalitiesAnnual Conference Computational Learning Theory (COLT), 2025

451

02 Jun 2025

How much do language models memorize?

423

30 May 2025

Bayesian Perspective on Memorization and Reconstruction

268

29 May 2025

Querying Kernel Methods Suffices for Reconstructing their Training Data

219

25 May 2025

T1: Tool-integrated Self-verification for Test-time Compute Scaling in Small Language Models

324

07 Apr 2025

Trustworthy Machine Learning via Memorization and the Granular Long-Tail: A Survey on Interactions, Tradeoffs, and Beyond

565

10 Mar 2025

Machine Learners Should Acknowledge the Legal Implications of Large Language Models as Personal Data

488

03 Mar 2025

The Pitfalls of Memorization: When Memorization Hurts GeneralizationInternational Conference on Learning Representations (ICLR), 2024

362

10 Dec 2024

Improved Localized Machine Unlearning Through the Lens of Memorization

Reihaneh Torkzadehmahani

Reza Nasirigerdeh

Georgios Kaissis

Daniel Rueckert

Gintare Karolina Dziugaite

Eleni Triantafillou

219

03 Dec 2024

Slowing Down Forgetting in Continual Learning

456

11 Nov 2024

Undesirable Memorization in Large Language Models: A Survey

629

03 Oct 2024

Range Membership Inference Attacks

Jiashu Tao

Reza Shokri

472

09 Aug 2024

Demystifying Verbatim Memorization in Large Language Models

Jing Huang

Diyi Yang

Christopher Potts

ELM PILM MU

338

25 Jul 2024

A Survey on Machine Unlearning: Techniques and New Emerged Privacy RisksJournal of Information Security and Applications (JISA), 2024

Hengzhu Liu

Ping Xiong

Tianqing Zhu

Philip S. Yu

248

10 Jun 2024

Data Reconstruction: When You See It and When You Don't

315

24 May 2024

Exploring prompts to elicit memorization in masked language model-based named entity recognitionPLoS ONE (PLoS ONE), 2024

Yuxi Xia

Anastasiia Sedova

Pedro Henrique Luz de Araujo

Vasiliki Kougia

Lisa Nussbaumer

Benjamin Roth

296

05 May 2024

Differentially Private Reinforcement Learning with Self-Play

Dan Qiao

Yu Wang

274

11 Apr 2024

Gradient Descent is Pareto-Optimal in the Oracle Complexity and Memory Tradeoff for Feasibility ProblemsIEEE Annual Symposium on Foundations of Computer Science (FOCS), 2024

Moise Blanchard

265

10 Apr 2024

Unveiling Privacy, Memorization, and Input Curvature Links

305

28 Feb 2024

Information Complexity of Stochastic Convex Optimization: Applications to Generalization and Memorization

Idan Attias

Gintare Karolina Dziugaite

Mahdi Haghifam

Roi Livni

Daniel M. Roy

360

14 Feb 2024

Do LLMs Dream of Ontologies?ACM Transactions on Intelligent Systems and Technology (ACM TIST), 2024

Marco Bombieri

Paolo Fiorini

Simone Paolo Ponzetto

M. Rospocher

CLL

367

26 Jan 2024

Memorization in Self-Supervised Learning Improves Downstream Generalization

Wenhao Wang

Muhammad Ahmad Kaleem

429

19 Jan 2024

The Stronger the Diffusion Model, the Easier the Backdoor: Data Poisoning to Induce Copyright Breaches Without Adjusting Finetuning Pipeline

Qianli Shen

307

07 Jan 2024

SoK: Unintended Interactions among Machine Learning Defenses and Risks

384

07 Dec 2023

Differentially Private Non-Convex Optimization under the KL Condition with Optimal RatesInternational Conference on Algorithmic Learning Theory (ALT), 2023

343

22 Nov 2023

On Retrieval Augmentation and the Limitations of Language Model Training

227

16 Nov 2023

Privacy Threats in Stable Diffusion Models

Thomas Cilloni

Charles Fleming

Charles Walter

211

15 Nov 2023

SoK: Memorisation in machine learning

Dmitrii Usynin

Moritz Knolle

Georgios Kaissis

333

06 Nov 2023

Why Train More? Effective and Efficient Membership Inference via Memorization

Jihye Choi

276

12 Oct 2023

What do larger image classifiers memorise?

Sanjiv Kumar

268

09 Oct 2023

Anonymous Learning via Look-Alike Clustering: A Precise Analysis of Model GeneralizationNeural Information Processing Systems (NeurIPS), 2023

Adel Javanmard

Vahab Mirrokni

434

06 Oct 2023

Deconstructing Data Reconstruction: Multiclass, Weight Decay and General LossesNeural Information Processing Systems (NeurIPS), 2023

Gal Vardi

279

04 Jul 2023

Deconstructing Classifiers: Towards A Data Reconstruction Attack Against Text Classification Models

Adel M. Elmahdy

A. Salem

SILM

326

23 Jun 2023

Memory-Query Tradeoffs for Randomized Convex OptimizationIEEE Annual Symposium on Foundations of Computer Science (FOCS), 2023

Xinyu Chen

Binghui Peng

240

21 Jun 2023

Machine Unlearning: A SurveyACM Computing Surveys (ACM Comput. Surv.), 2023

Philip S. Yu

282

06 Jun 2023

TMI! Finetuned Models Leak Private Information from their Pretraining DataProceedings on Privacy Enhancing Technologies (PoPETs), 2023

313

01 Jun 2023

Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-Intensive TasksNeural Information Processing Systems (NeurIPS), 2023

309

28 May 2023

Private Everlasting PredictionNeural Information Processing Systems (NeurIPS), 2023

213

16 May 2023

AI Model Disgorgement: Methods and ChoicesProceedings of the National Academy of Sciences of the United States of America (PNAS), 2023

249

07 Apr 2023

Near Optimal Memory-Regret Tradeoff for Online LearningIEEE Annual Symposium on Foundations of Computer Science (FOCS), 2023

Binghui Peng

A. Rubinstein

CLL

318

03 Mar 2023