Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2402.15159
Cited By
v1
v2
v3 (latest)
Machine Unlearning of Pre-trained Large Language Models
23 February 2024
Jin Yao
Eli Chien
Minxin Du
Xinyao Niu
Tianhao Wang
Zezhou Cheng
Xiang Yue
MU
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Machine Unlearning of Pre-trained Large Language Models"
39 / 39 papers shown
Title
Geometric-Disentangelment Unlearning
Duo Zhou
Yuji Zhang
Tianxin Wei
Ruizhong Qiu
Ke Yang
...
Cheng Qian
Jingrui He
Hanghang Tong
Heng Ji
Huan Zhang
MU
189
0
0
21 Nov 2025
A Survey on Unlearning in Large Language Models
Ruichen Qiu
Jiajun Tan
Jiayue Pu
Honglin Wang
Xiao-Shan Gao
Fei Sun
MU
AILaw
PILM
590
0
0
29 Oct 2025
Reducing the Probability of Undesirable Outputs in Language Models Using Probabilistic Inference
S. Zhao
Aidan Li
Rob Brekelmans
Roger C. Grosse
68
0
0
24 Oct 2025
KnowledgeSmith: Uncovering Knowledge Updating in LLMs with Model Editing and Unlearning
Yinyi Luo
Z. Zhou
Hao Chen
Kai Qiu
Marios Savvides
Shouqing Yang
James Evans
KELM
MU
152
0
0
01 Oct 2025
Scalable and Robust LLM Unlearning by Correcting Responses with Retrieved Exclusions
Junbeom Kim
Kyuyoung Kim
Jihoon Tack
Dongha Lim
Jinwoo Shin
MU
KELM
137
1
0
30 Sep 2025
Erase or Hide? Suppressing Spurious Unlearning Neurons for Robust Unlearning
Nakyeong Yang
Dong-Kyum Kim
Jea Kwon
Minsung Kim
Kyomin Jung
M. Cha
MU
KELM
100
0
0
26 Sep 2025
CUFG: Curriculum Unlearning Guided by the Forgetting Gradient
Jiaxing Miao
Liang Hu
Qi Zhang
Lai Zhong Yuan
Usman Naseem
MU
122
0
0
18 Sep 2025
Towards Mitigating Excessive Forgetting in LLM Unlearning via Entanglement-Aware Unlearning with Proxy Constraint
Zhihao Liu
Jian Lou
Yuke Hu
Xiaochen Li
Tailun Chen
Yitian Chen
Zhan Qin
MU
148
0
0
28 Aug 2025
A Survey on Generative Model Unlearning: Fundamentals, Taxonomy, Evaluation, and Future Direction
Xiaohua Feng
Jiaming Zhang
Fengyuan Yu
C. Wang
Li Zhang
Kaixiang Li
Yuyuan Li
Chaochao Chen
Jianwei Yin
MU
218
2
0
26 Jul 2025
PULSE: Practical Evaluation Scenarios for Large Multimodal Model Unlearning
Tatsuki Kawakami
Kazuki Egashira
Atsuyuki Miyai
Go Irie
Kiyoharu Aizawa
MU
303
1
0
02 Jul 2025
Learning-Time Encoding Shapes Unlearning in LLMs
Ruihan Wu
Konstantin Garov
Kamalika Chaudhuri
MU
200
0
0
18 Jun 2025
GUARD: Guided Unlearning and Retention via Data Attribution for Large Language Models
Peizhi Niu
Duo Zhou
Peizhi Niu
Huiting Zhou
Huan Zhang
S. Rasoul Etesami
S. Rasoul Etesami
MU
359
0
0
12 Jun 2025
Lifting Data-Tracing Machine Unlearning to Knowledge-Tracing for Foundation Models
Yuwen Tan
Boqing Gong
MU
231
1
0
12 Jun 2025
SoK: Machine Unlearning for Large Language Models
Jie Ren
Yue Xing
Yingqian Cui
Charu C. Aggarwal
Hui Liu
MU
162
2
0
10 Jun 2025
LLM Unlearning Should Be Form-Independent
Xiaotian Ye
Mengqi Zhang
Shu Wu
MU
213
0
0
09 Jun 2025
Do LLMs Really Forget? Evaluating Unlearning with Knowledge Correlation and Confidence Awareness
Rongzhe Wei
Peizhi Niu
Hans Hao-Hsun Hsu
Ruihan Wu
Haoteng Yin
...
Vamsi K. Potluru
Eli Chien
Kamalika Chaudhuri
S. Rasoul Etesami
P. Li
MU
KELM
467
6
0
06 Jun 2025
Distillation Robustifies Unlearning
Bruce W. Lee
Addie Foote
Alex Infanger
Leni Shor
Harish Kamath
Jacob Goldman-Wetzler
Bryce Woodworth
Alex Cloud
Alexander Matt Turner
MU
377
4
0
06 Jun 2025
Invariance Makes LLM Unlearning Resilient Even to Unanticipated Downstream Fine-Tuning
Changsheng Wang
Yihua Zhang
Jinghan Jia
Parikshit Ram
Dennis L. Wei
Yuguang Yao
Soumyadeep Pal
Nathalie Baracaldo
Sijia Liu
MU
218
4
0
02 Jun 2025
WaterDrum: Watermarking for Data-centric Unlearning Metric
Xinyang Lu
Xinyuan Niu
Gregory Kang Ruey Lau
Bui Thi Cam Nhung
Rachael Hwee Ling Sim
Fanyu Wen
Chuan-Sheng Foo
Szu Hui Ng
Bryan Kian Hsiang Low
MU
249
4
0
08 May 2025
DP2Unlearning: An Efficient and Guaranteed Unlearning Framework for LLMs
Neural Networks (NN), 2025
Tamim Al Mahmud
N. Jebreel
Josep Domingo-Ferrer
David Sánchez
MU
252
0
0
18 Apr 2025
GRAIL: Gradient-Based Adaptive Unlearning for Privacy and Copyright in LLMs
Kun-Woo Kim
Ji-Hoon Park
Ju-Min Han
Seong-Whan Lee
MU
PILM
297
2
0
17 Apr 2025
SAUCE: Selective Concept Unlearning in Vision-Language Models with Sparse Autoencoders
Qing Li
Fauzan Farooqui
Derui Zhu
Fengyu Cai
Chenyang Lyu
Fakhri Karray
MU
249
3
0
16 Mar 2025
CE-U: Cross Entropy Unlearning
Bo Yang
MU
397
1
0
03 Mar 2025
Towards Label-Only Membership Inference Attack against Pre-trained Large Language Models
Yu He
Boheng Li
Lu Liu
Zhongjie Ba
Wei Dong
Yiming Li
Zhan Qin
Kui Ren
Chong Chen
MIALM
434
14
0
26 Feb 2025
Proactive Privacy Amnesia for Large Language Models: Safeguarding PII with Negligible Impact on Model Utility
International Conference on Learning Representations (ICLR), 2025
Martin Kuo
Jingyang Zhang
Jianyi Zhang
Minxue Tang
Louis DiValentin
...
William Chen
Amin Hass
Tianlong Chen
Yuxiao Chen
Haoyang Li
MU
KELM
370
7
0
24 Feb 2025
A Comprehensive Survey of Machine Unlearning Techniques for Large Language Models
Fauzan Farooqui
Qing Li
Herbert Woisetschlaeger
Zongxiong Chen
Longji Xu
Preslav Nakov
Preslav Nakov
Hans-Arno Jacobsen
Fakhri Karray
MU
287
14
0
22 Feb 2025
Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset
International Conference on Learning Representations (ICLR), 2024
Yingzi Ma
Jiongxiao Wang
Haiwei Yang
Siyuan Ma
Jiazhao Li
...
B. Li
Yejin Choi
Mengzhao Chen
Chaowei Xiao
Chaowei Xiao
MU
397
13
0
05 Nov 2024
WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models
Neural Information Processing Systems (NeurIPS), 2024
Jinghan Jia
Jiancheng Liu
Yihua Zhang
Parikshit Ram
Nathalie Baracaldo
Sijia Liu
MU
365
15
0
23 Oct 2024
When Machine Unlearning Meets Retrieval-Augmented Generation (RAG): Keep Secret or Forget Knowledge?
IEEE Transactions on Dependable and Secure Computing (IEEE TDSC), 2024
Shang Wang
Tianqing Zhu
Dayong Ye
Wanlei Zhou
MU
320
10
0
20 Oct 2024
Meta-Unlearning on Diffusion Models: Preventing Relearning Unlearned Concepts
Hongcheng Gao
Tianyu Pang
Chao Du
Taihang Hu
Zhijie Deng
Min Lin
DiffM
248
17
0
16 Oct 2024
A Closer Look at Machine Unlearning for Large Language Models
International Conference on Learning Representations (ICLR), 2024
Xiaojian Yuan
Tianyu Pang
Chao Du
Kejiang Chen
Weiming Zhang
Min Lin
MU
716
27
0
10 Oct 2024
Mitigating Memorization In Language Models
Mansi Sakarvadia
Aswathy Ajith
Arham Khan
Nathaniel Hudson
Caleb Geniesse
Kyle Chard
Yaoqing Yang
Ian Foster
Michael W. Mahoney
KELM
MU
306
8
0
03 Oct 2024
Composable Interventions for Language Models
Arinbjorn Kolbeinsson
Kyle O'Brien
Tianjin Huang
Shanghua Gao
Shiwei Liu
...
Anurag J. Vaidya
Faisal Mahmood
Marinka Zitnik
Tianlong Chen
Thomas Hartvigsen
KELM
MU
461
4
0
09 Jul 2024
RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models
Zhuoran Jin
Pengfei Cao
Chenhao Wang
Zhitao He
Hongbang Yuan
Jiachun Li
Yubo Chen
Kang Liu
Jun Zhao
KELM
MU
291
48
0
16 Jun 2024
Large Scale Knowledge Washing
Yu Wang
Ruihan Wu
Zexue He
Xinyu Chen
Julian McAuley
MU
KELM
366
11
0
26 May 2024
Single Image Unlearning: Efficient Machine Unlearning in Multimodal Large Language Models
Jiaqi Li
Qianshan Wei
Chuanyi Zhang
Guilin Qi
Miaozeng Du
Yongrui Chen
Sheng Bi
Fan Liu
VLM
MU
436
30
0
21 May 2024
Offset Unlearning for Large Language Models
James Y. Huang
Wenxuan Zhou
Fei Wang
Fred Morstatter
Sheng Zhang
Hoifung Poon
Muhao Chen
MU
363
25
0
17 Apr 2024
Min-K%++: Improved Baseline for Detecting Pre-Training Data from Large Language Models
Jingyang Zhang
Jingwei Sun
Eric C. Yeats
Ouyang Yang
Martin Kuo
Jianyi Zhang
Hao Frank Yang
Hai "Helen" Li
610
78
0
03 Apr 2024
Yi: Open Foundation Models by 01.AI
01. AI
Alex Young
01.AI Alex Young
Bei Chen
Chao Li
...
Yue Wang
Yuxuan Cai
Zhenyu Gu
Zhiyuan Liu
Zonghong Dai
OSLM
LRM
773
756
0
07 Mar 2024
1