v1v2v3 (latest)

The Secret Sharer: Evaluating and Testing Unintended Memorization in Neural Networks

22 February 2018

Papers citing "The Secret Sharer: Evaluating and Testing Unintended Memorization in Neural Networks"

50 / 791 papers shown

Hide and Seek in Noise Labels: Noise-Robust Collaborative Active Learning with LLM-Powered AssistanceAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

405

03 Apr 2025

SemEval-2025 Task 4: Unlearning sensitive content from Large Language Models

990

02 Apr 2025

Forward Learning with Differential Privacy

283

01 Apr 2025

Leaking LoRa: An Evaluation of Password Leaks and Knowledge Storage in Large Language Models

Ryan Marinelli

Magnus Eckhoff

PILM

182

29 Mar 2025

Efficient Verified Machine Unlearning For Distillation

Yijun Quan

Zushu Li

Giovanni Montana

258

28 Mar 2025

Instance-Level Data-Use Auditing of Visual ML Models

426

28 Mar 2025

Malicious and Unintentional Disclosure Risks in Large Language Models for Code Generation

256

27 Mar 2025

Language Models May Verbatim Complete Text They Were Not Explicitly Trained On

Katja Filippova

Christopher A. Choquette-Choo

474

21 Mar 2025

Beyond Next Token Probabilities: Learnable, Fast Detection of Hallucinations and Data Contamination on LLM Output Distributions

424

18 Mar 2025

Privacy Auditing of Large Language ModelsInternational Conference on Learning Representations (ICLR), 2025

Ashwinee Panda

Xinyu Tang

Milad Nasr

Christopher A. Choquette-Choo

Prateek Mittal

PILM

350

09 Mar 2025

Energy-Latency Attacks: A New Adversarial Threat to Deep Learning

228

06 Mar 2025

Memorize or Generalize? Evaluating LLM Code Generation with Code Rewriting

350

04 Mar 2025

Machine Learners Should Acknowledge the Legal Implications of Large Language Models as Personal Data

456

03 Mar 2025

When Personalization Meets Reality: A Multi-Faceted Analysis of Personalized Preference Learning

312

26 Feb 2025

A General Pseudonymization Framework for Cloud-Based LLMs: Replacing Privacy Information in Controlled Text Generation

289

24 Feb 2025

Proactive Privacy Amnesia for Large Language Models: Safeguarding PII with Negligible Impact on Model UtilityInternational Conference on Learning Representations (ICLR), 2025

...

415

24 Feb 2025

Protecting Users From Themselves: Safeguarding Contextual Privacy in Interactions with Conversational AgentsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Karthikeyan N. Ramamurthy

279

22 Feb 2025

Interrogating LLM design under a fair learning doctrine

Johnny Tian-Zheng Wei

299

22 Feb 2025

Privacy Ripple Effects from Adding or Removing Personal Information in Language Model TrainingAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Jaydeep Borkar

Matthew Jagielski

Katherine Lee

Niloofar Mireshghallah

David A. Smith

Christopher A. Choquette-Choo

PILM

695

21 Feb 2025

UPCORE: Utility-Preserving Coreset Selection for Balanced Unlearning

393

20 Feb 2025

The Canary's Echo: Auditing Privacy Risks of LLM-Generated Synthetic Text

Matthieu Meeus

Lukas Wutschitz

Santiago Zanella Béguelin

Shruti Tople

Reza Shokri

450

19 Feb 2025

R.R.: Unveiling LLM Training Privacy through Recollection and RankingAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

327

18 Feb 2025

Episodic Memories Generation and Evaluation Benchmark for Large Language ModelsInternational Conference on Learning Representations (ICLR), 2025

224

21 Jan 2025

Enhancing Privacy in the Early Detection of Sexual Predators Through Federated Learning and Differential PrivacyAAAI Conference on Artificial Intelligence (AAAI), 2025

394

21 Jan 2025

Modeling Neural Networks with Privacy Using Neural Stochastic Differential Equations

298

12 Jan 2025

TAPFed: Threshold Secure Aggregation for Privacy-Preserving Federated LearningIEEE Transactions on Dependable and Secure Computing (IEEE TDSC), 2024

279

10 Jan 2025

Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models

796

31 Dec 2024

Multi-PA: A Multi-perspective Benchmark on Privacy Assessment for Large Vision-Language Models

263

27 Dec 2024

Where Did Your Model Learn That? Label-free Influence for Self-supervised Learning

221

22 Dec 2024

The Vulnerability of Language Model Benchmarks: Do They Accurately Reflect True LLM Performance?

266

02 Dec 2024

Adversarial Sample-Based Approach for Tighter Privacy Auditing in Final Model-Only Scenarios

Sangyeon Yoon

Wonje Jeung

Albert No

403

02 Dec 2024

Efficient and Private: Memorisation under differentially private parameter-efficient fine-tuning in language models

Olivia Ma

Jonathan Passerat-Palmbach

Dmitrii Usynin

373

24 Nov 2024

Preempting Text Sanitization Utility in Resource-Constrained Privacy-Preserving LLM Interactions

486

18 Nov 2024

Near-Optimal Reinforcement Learning with Shuffle Differential Privacy

Shaojie Bai

Mohammad Sadegh Talebi

453

18 Nov 2024

CODECLEANER: Elevating Standards with A Robust Data Contamination Mitigation Toolkit

284

16 Nov 2024

On the Privacy Risk of In-context Learning

305

15 Nov 2024

Measuring Non-Adversarial Reproduction of Training Data in Large Language ModelsInternational Conference on Learning Representations (ICLR), 2024

256

15 Nov 2024

TEESlice: Protecting Sensitive Neural Network Models in Trusted Execution Environments When Attackers have Pre-Trained ModelsACM Transactions on Software Engineering and Methodology (TOSEM), 2024

257

15 Nov 2024

On Active Privacy Auditing in Supervised Fine-tuning for White-Box Language Models

Qian Sun

Hanpeng Wu

Xi Sheryl Zhang

272

11 Nov 2024

Slowing Down Forgetting in Continual Learning

448

11 Nov 2024

Unlearning in- vs. out-of-distribution data in LLMs under gradient-based method

Gintare Karolina Dziugaite

240

07 Nov 2024

Membership Inference Attacks against Large Vision-Language ModelsNeural Information Processing Systems (NeurIPS), 2024

208

05 Nov 2024

TDDBench: A Benchmark for Training data detectionInternational Conference on Learning Representations (ICLR), 2024

Zhihao Zhu

Yi Yang

Defu Lian

300

05 Nov 2024

Trustworthy Federated Learning: Privacy, Security, and BeyondKnowledge and Information Systems (KAIS), 2024

293

03 Nov 2024

Do LLMs Know to Respect Copyright Notice?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024

273

02 Nov 2024

WaKA: Data Attribution using K-Nearest Neighbors and Membership Privacy PrinciplesProceedings on Privacy Enhancing Technologies (PoPETs), 2024

288

02 Nov 2024

Public Domain 12M: A Highly Aesthetic Image-Text Dataset with Novel Governance Mechanisms

203

30 Oct 2024

Take Caution in Using LLMs as Human Surrogates: Scylla Ex MachinaProceedings of the National Academy of Sciences of the United States of America (PNAS), 2024

513

25 Oct 2024

Does Data Contamination Detection Work (Well) for LLMs? A Survey and Evaluation on Detection AssumptionsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

450

24 Oct 2024

Uncovering Attacks and Defenses in Secure Aggregation for Federated Deep Learning

235

13 Oct 2024