Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1802.08232
Cited By
v1
v2
v3 (latest)
The Secret Sharer: Evaluating and Testing Unintended Memorization in Neural Networks
22 February 2018
Nicholas Carlini
Chang-rui Liu
Ulfar Erlingsson
Jernej Kos
Basel Alomair
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Secret Sharer: Evaluating and Testing Unintended Memorization in Neural Networks"
50 / 791 papers shown
Hide and Seek in Noise Labels: Noise-Robust Collaborative Active Learning with LLM-Powered Assistance
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Bo Yuan
Yulin Chen
Yin Zhang
Wei Jiang
NoLa
405
23
0
03 Apr 2025
SemEval-2025 Task 4: Unlearning sensitive content from Large Language Models
Anil Ramakrishna
Yixin Wan
Xiaomeng Jin
Kai-Wei Chang
Zhiqi Bu
Bhanukiran Vinzamuri
Volkan Cevher
Mingyi Hong
Rahul Gupta
AILaw
MU
990
6
0
02 Apr 2025
Forward Learning with Differential Privacy
Mingqian Feng
Zeliang Zhang
Jinyang Jiang
Yijie Peng
Chenliang Xu
283
0
0
01 Apr 2025
Leaking LoRa: An Evaluation of Password Leaks and Knowledge Storage in Large Language Models
Ryan Marinelli
Magnus Eckhoff
PILM
182
0
0
29 Mar 2025
Efficient Verified Machine Unlearning For Distillation
Yijun Quan
Zushu Li
Giovanni Montana
MU
258
0
0
28 Mar 2025
Instance-Level Data-Use Auditing of Visual ML Models
Zonghao Huang
Neil Zhenqiang Gong
Michael K. Reiter
MLAU
426
2
0
28 Mar 2025
Malicious and Unintentional Disclosure Risks in Large Language Models for Code Generation
Rafiqul Rabin
Sean McGregor
Nick Judd
AAML
PILM
256
0
0
27 Mar 2025
Language Models May Verbatim Complete Text They Were Not Explicitly Trained On
Katja Filippova
Christopher A. Choquette-Choo
Matthew Jagielski
Peter Kairouz
Sanmi Koyejo
Abigail Z. Jacobs
Nicolas Papernot
474
13
0
21 Mar 2025
Beyond Next Token Probabilities: Learnable, Fast Detection of Hallucinations and Data Contamination on LLM Output Distributions
Guy Bar-Shalom
Fabrizio Frasca
Derek Lim
Yoav Gelberg
Yftah Ziser
Ran El-Yaniv
Gal Chechik
Haggai Maron
424
2
0
18 Mar 2025
Privacy Auditing of Large Language Models
International Conference on Learning Representations (ICLR), 2025
Ashwinee Panda
Xinyu Tang
Milad Nasr
Christopher A. Choquette-Choo
Prateek Mittal
PILM
350
20
0
09 Mar 2025
Energy-Latency Attacks: A New Adversarial Threat to Deep Learning
H. B. Meftah
W. Hamidouche
Sid Ahmed Fezza
Olivier Déforges
AAML
228
0
0
06 Mar 2025
Memorize or Generalize? Evaluating LLM Code Generation with Code Rewriting
Lizhe Zhang
Wentao Chen
Li Zhong
Letian Peng
Zilong Wang
Jingbo Shang
ELM
350
10
0
04 Mar 2025
Machine Learners Should Acknowledge the Legal Implications of Large Language Models as Personal Data
Henrik Nolte
Michèle Finck
Kristof Meding
AILaw
PILM
456
2
0
03 Mar 2025
When Personalization Meets Reality: A Multi-Faceted Analysis of Personalized Preference Learning
Yijiang River Dong
Tiancheng Hu
Yinhong Liu
Ahmet Üstün
Nigel Collier
312
7
0
26 Feb 2025
A General Pseudonymization Framework for Cloud-Based LLMs: Replacing Privacy Information in Controlled Text Generation
Shilong Hou
Ruilin Shang
Zi Long
Xianghua Fu
Yin Chen
289
2
0
24 Feb 2025
Proactive Privacy Amnesia for Large Language Models: Safeguarding PII with Negligible Impact on Model Utility
International Conference on Learning Representations (ICLR), 2025
Martin Kuo
Jingyang Zhang
Jianyi Zhang
Minxue Tang
Louis DiValentin
...
William Chen
Amin Hass
Tianlong Chen
Yuxiao Chen
Haoyang Li
MU
KELM
415
8
0
24 Feb 2025
Protecting Users From Themselves: Safeguarding Contextual Privacy in Interactions with Conversational Agents
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Ivoline Ngong
Swanand Kadhe
Hao Wang
K. Murugesan
Justin D. Weisz
Amit Dhurandhar
Karthikeyan N. Ramamurthy
279
12
0
22 Feb 2025
Interrogating LLM design under a fair learning doctrine
Johnny Tian-Zheng Wei
Maggie Wang
Ameya Godbole
Jonathan H. Choi
Robin Jia
299
0
0
22 Feb 2025
Privacy Ripple Effects from Adding or Removing Personal Information in Language Model Training
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Jaydeep Borkar
Matthew Jagielski
Katherine Lee
Niloofar Mireshghallah
David A. Smith
Christopher A. Choquette-Choo
PILM
695
7
0
21 Feb 2025
UPCORE: Utility-Preserving Coreset Selection for Balanced Unlearning
Vaidehi Patil
Elias Stengel-Eskin
Joey Tianyi Zhou
MU
CLL
393
6
0
20 Feb 2025
The Canary's Echo: Auditing Privacy Risks of LLM-Generated Synthetic Text
Matthieu Meeus
Lukas Wutschitz
Santiago Zanella Béguelin
Shruti Tople
Reza Shokri
450
7
0
19 Feb 2025
R.R.: Unveiling LLM Training Privacy through Recollection and Ranking
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Wenlong Meng
Zhenyuan Guo
Lenan Wu
Chen Gong
Wenyan Liu
Weixian Li
Chengkun Wei
Wenzhi Chen
PILM
327
4
0
18 Feb 2025
Episodic Memories Generation and Evaluation Benchmark for Large Language Models
International Conference on Learning Representations (ICLR), 2025
Alexis Huet
Zied Ben-Houidi
Dario Rossi
LLMAG
224
7
0
21 Jan 2025
Enhancing Privacy in the Early Detection of Sexual Predators Through Federated Learning and Differential Privacy
AAAI Conference on Artificial Intelligence (AAAI), 2025
Khaoula Chehbouni
Martine De Cock
Gilles Caporossi
Afaf Taik
Reihaneh Rabbany
G. Farnadi
394
4
0
21 Jan 2025
Modeling Neural Networks with Privacy Using Neural Stochastic Differential Equations
Sanghyun Hong
Fan Wu
A. Gruber
Kookjin Lee
298
0
0
12 Jan 2025
TAPFed: Threshold Secure Aggregation for Privacy-Preserving Federated Learning
IEEE Transactions on Dependable and Secure Computing (IEEE TDSC), 2024
Runhua Xu
Bo Li
Chao Li
J. Joshi
Shuai Ma
Jianxin Li
FedML
279
26
0
10 Jan 2025
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Yulei Qin
Yuncheng Yang
Pengcheng Guo
Gang Li
Hang Shao
Yuchen Shi
Zihan Xu
Yun Gu
Ke Li
Xing Sun
ALM
796
23
0
31 Dec 2024
Multi-PA: A Multi-perspective Benchmark on Privacy Assessment for Large Vision-Language Models
Jie M. Zhang
Xiangkui Cao
Zhouyu Han
Shiguang Shan
Xilin Chen
ELM
263
0
0
27 Dec 2024
Where Did Your Model Learn That? Label-free Influence for Self-supervised Learning
Nidhin Harilal
Amit Rege
Reza Akbarian Bafghi
M. Raissi
C. Monteleoni
TDI
221
0
0
22 Dec 2024
The Vulnerability of Language Model Benchmarks: Do They Accurately Reflect True LLM Performance?
Sourav Banerjee
Ayushi Agarwal
Eishkaran Singh
ELM
266
20
0
02 Dec 2024
Adversarial Sample-Based Approach for Tighter Privacy Auditing in Final Model-Only Scenarios
Sangyeon Yoon
Wonje Jeung
Albert No
403
1
0
02 Dec 2024
Efficient and Private: Memorisation under differentially private parameter-efficient fine-tuning in language models
Olivia Ma
Jonathan Passerat-Palmbach
Dmitrii Usynin
373
2
0
24 Nov 2024
Preempting Text Sanitization Utility in Resource-Constrained Privacy-Preserving LLM Interactions
Robin Carpentier
B. Zhao
Hassan Jameel Asghar
Dali Kaafar
486
1
0
18 Nov 2024
Near-Optimal Reinforcement Learning with Shuffle Differential Privacy
Shaojie Bai
Mohammad Sadegh Talebi
Chengcheng Zhao
Peng Cheng
Jiming Chen
OffRL
453
0
0
18 Nov 2024
CODECLEANER: Elevating Standards with A Robust Data Contamination Mitigation Toolkit
Jialun Cao
Songqiang Chen
Wuqi Zhang
Hau Ching Lo
Shing-Chi Cheung
284
2
0
16 Nov 2024
On the Privacy Risk of In-context Learning
Haonan Duan
Adam Dziedzic
Mohammad Yaghini
Nicolas Papernot
Franziska Boenisch
SILM
PILM
305
54
0
15 Nov 2024
Measuring Non-Adversarial Reproduction of Training Data in Large Language Models
International Conference on Learning Representations (ICLR), 2024
Michael Aerni
Javier Rando
Edoardo Debenedetti
Nicholas Carlini
Daphne Ippolito
F. Tramèr
256
13
0
15 Nov 2024
TEESlice: Protecting Sensitive Neural Network Models in Trusted Execution Environments When Attackers have Pre-Trained Models
ACM Transactions on Software Engineering and Methodology (TOSEM), 2024
Ding Li
Ziqi Zhang
Mengyu Yao
Y. Cai
Yao Guo
Xiangqun Chen
FedML
257
9
0
15 Nov 2024
On Active Privacy Auditing in Supervised Fine-tuning for White-Box Language Models
Qian Sun
Hanpeng Wu
Xi Sheryl Zhang
272
1
0
11 Nov 2024
Slowing Down Forgetting in Continual Learning
Pascal Janetzky
Tobias Schlagenhauf
Stefan Feuerriegel
CLL
448
0
0
11 Nov 2024
Unlearning in- vs. out-of-distribution data in LLMs under gradient-based method
Teodora Baluta
Pascal Lamblin
Daniel Tarlow
Fabian Pedregosa
Gintare Karolina Dziugaite
MU
240
4
0
07 Nov 2024
Membership Inference Attacks against Large Vision-Language Models
Neural Information Processing Systems (NeurIPS), 2024
Zhan Li
Yongtao Wu
Yihang Chen
F. Tonin
Elias Abad Rocamora
Volkan Cevher
208
21
0
05 Nov 2024
TDDBench: A Benchmark for Training data detection
International Conference on Learning Representations (ICLR), 2024
Zhihao Zhu
Yi Yang
Defu Lian
300
1
0
05 Nov 2024
Trustworthy Federated Learning: Privacy, Security, and Beyond
Knowledge and Information Systems (KAIS), 2024
Chunlu Chen
Ji Liu
Haowen Tan
Xingjian Li
Kevin I-Kai Wang
Peng Li
Kouichi Sakurai
Dejing Dou
FedML
293
48
0
03 Nov 2024
Do LLMs Know to Respect Copyright Notice?
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Jialiang Xu
Shenglan Li
Zhaozhuo Xu
Denghui Zhang
273
16
0
02 Nov 2024
WaKA: Data Attribution using K-Nearest Neighbors and Membership Privacy Principles
Proceedings on Privacy Enhancing Technologies (PoPETs), 2024
Patrick Mesana
Clément Bénesse
H. Lautraite
Gilles Caporossi
Sébastien Gambs
TDI
288
1
0
02 Nov 2024
Public Domain 12M: A Highly Aesthetic Image-Text Dataset with Novel Governance Mechanisms
Jordan Meyer
Nick Padgett
Cullen Miller
Laura Exline
203
12
0
30 Oct 2024
Take Caution in Using LLMs as Human Surrogates: Scylla Ex Machina
Proceedings of the National Academy of Sciences of the United States of America (PNAS), 2024
Yuan Gao
Dokyun Lee
Gordon Burtch
Sina Fazelpour
LRM
513
42
0
25 Oct 2024
Does Data Contamination Detection Work (Well) for LLMs? A Survey and Evaluation on Detection Assumptions
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Yujuan Fu
Özlem Uzuner
Meliha Yetisgen
Fei Xia
450
17
0
24 Oct 2024
Uncovering Attacks and Defenses in Secure Aggregation for Federated Deep Learning
Yiwei Zhang
R. Behnia
A. Yavuz
Reza Ebrahimi
E. Bertino
FedML
235
7
0
13 Oct 2024
Previous
1
2
3
4
5
6
...
14
15
16
Next
Page 3 of 16
Page
of 16
Go