Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.05105
Cited By
Safe Latent Diffusion: Mitigating Inappropriate Degeneration in Diffusion Models
9 November 2022
P. Schramowski
Manuel Brack
Bjorn Deiseroth
Kristian Kersting
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Safe Latent Diffusion: Mitigating Inappropriate Degeneration in Diffusion Models"
50 / 211 papers shown
Title
Towards SFW sampling for diffusion models via external conditioning
Camilo Carvajal Reyes
J. Fontbona
Felipe A. Tobar
DiffM
15
0
0
12 May 2025
Jailbreaking the Text-to-Video Generative Models
Jiayang Liu
Siyuan Liang
Shiqian Zhao
Rongcheng Tu
Wenbo Zhou
Xiaochun Cao
D. Tao
Siew Kei Lam
EGVM
VGen
39
0
0
10 May 2025
The Dual Power of Interpretable Token Embeddings: Jailbreaking Attacks and Defenses for Diffusion Model Unlearning
Siyi Chen
Yimeng Zhang
Sijia Liu
Q. Qu
AAML
82
0
0
30 Apr 2025
Erased but Not Forgotten: How Backdoors Compromise Concept Erasure
Jonas Henry Grebe
Tobias Braun
Marcus Rohrbach
Anna Rohrbach
AAML
77
0
0
29 Apr 2025
DualOptim: Enhancing Efficacy and Stability in Machine Unlearning with Dual Optimizers
Xuyang Zhong
Haochen Luo
Chen Liu
MU
25
0
0
22 Apr 2025
What Lurks Within? Concept Auditing for Shared Diffusion Models at Scale
Xiaoyong Yuan
Xiaolong Ma
Linke Guo
Lan Zhang
DiffM
37
0
0
21 Apr 2025
Towards NSFW-Free Text-to-Image Generation via Safety-Constraint Direct Preference Optimization
Shouwei Ruan
Zhenyu Wu
Yao Huang
Ruochen Zhang
Yitong Sun
Caixin Kang
Xingxing Wei
EGVM
35
0
0
19 Apr 2025
Set You Straight: Auto-Steering Denoising Trajectories to Sidestep Unwanted Concepts
Leyang Li
Shilin Lu
Yan Ren
A. Kong
DiffM
40
1
0
17 Apr 2025
ACE: Attentional Concept Erasure in Diffusion Models
Finn Carter
DiffM
52
0
0
16 Apr 2025
Token-Level Constraint Boundary Search for Jailbreaking Text-to-Image Models
J. Liu
Zhaoxin Wang
Handing Wang
Cong Tian
Yaochu Jin
26
0
0
15 Apr 2025
Sculpting Memory: Multi-Concept Forgetting in Diffusion Models via Dynamic Mask and Concept-Aware Optimization
Gen Li
Yang Xiao
Jie Ji
Kaiyuan Deng
Bo Hui
Linke Guo
Xiaolong Ma
24
0
0
12 Apr 2025
Mind the Trojan Horse: Image Prompt Adapter Enabling Scalable and Deceptive Jailbreaking
Junxi Chen
Junhao Dong
Xiaohua Xie
33
0
0
08 Apr 2025
Prompting Forgetting: Unlearning in GANs via Textual Guidance
Piyush Nagasubramaniam
Neeraj Karamchandani
Chen Wu
Sencun Zhu
DiffM
AILaw
MU
54
0
0
01 Apr 2025
ShieldGemma 2: Robust and Tractable Image Content Moderation
Wenjun Zeng
D. Kurniawan
Ryan Mullins
Yuchi Liu
Tamoghna Saha
...
Mani Malek
Hamid Palangi
Joon Baek
Rick Pereira
Karthik Narasimhan
AI4MH
31
0
0
01 Apr 2025
Decoupled Distillation to Erase: A General Unlearning Method for Any Class-centric Tasks
Yu Zhou
Dian Zheng
Qijie Mo
Renjie Lu
Kun-Yu Lin
Wei-Shi Zheng
MU
68
0
0
31 Mar 2025
Fine-Grained Erasure in Text-to-Image Diffusion-based Foundation Models
K. Thakral
Tamar Glaser
Tal Hassner
Mayank Vatsa
Richa Singh
41
2
0
25 Mar 2025
Safe and Reliable Diffusion Models via Subspace Projection
Huiqiang Chen
Tianqing Zhu
Linlin Wang
Xin Yu
Longxiang Gao
Wanlei Zhou
DiffM
55
2
0
21 Mar 2025
Towards Understanding the Safety Boundaries of DeepSeek Models: Evaluation and Findings
Zonghao Ying
Guangyi Zheng
Yongxin Huang
Deyue Zhang
Wenxin Zhang
Quanchen Zou
Aishan Liu
X. Liu
Dacheng Tao
ELM
74
6
0
19 Mar 2025
Detect-and-Guide: Self-regulation of Diffusion Models for Safe Text-to-Image Generation via Guideline Token Optimization
Feifei Li
Mi Zhang
Yiming Sun
Min Yang
DiffM
51
1
0
19 Mar 2025
TarPro: Targeted Protection against Malicious Image Editing
Kaixin Shen
Ruijie Quan
Jiaxu Miao
Jun Xiao
Yi Yang
60
1
0
18 Mar 2025
CRCE: Coreference-Retention Concept Erasure in Text-to-Image Diffusion Models
Yuyang Xue
Edward Moroshko
Feng Chen
Steven G. McDonagh
Sotirios A. Tsaftaris
56
1
0
18 Mar 2025
Hyperbolic Safety-Aware Vision-Language Models
Tobia Poppi
Tejaswi Kasarla
Pascal Mettes
Lorenzo Baraldi
Rita Cucchiara
VLM
MU
56
0
0
15 Mar 2025
Safe Vision-Language Models via Unsafe Weights Manipulation
Moreno DÍncà
E. Peruzzo
Xingqian Xu
Humphrey Shi
N. Sebe
Massimiliano Mancini
MU
55
0
0
14 Mar 2025
Efficient Fine-Tuning and Concept Suppression for Pruned Diffusion Models
Reza Shirkavand
Peiran Yu
Shangqian Gao
Gowthami Somepalli
Tom Goldstein
Heng-Chiao Huang
113
1
0
13 Mar 2025
Sparse Autoencoder as a Zero-Shot Classifier for Concept Erasing in Text-to-Image Diffusion Models
Zhihua Tian
Sirun Nan
Ming Xu
Shengfang Zhai
Wenjie Qu
Jian Liu
Kui Ren
Ruoxi Jia
Jiaheng Zhang
DiffM
85
1
0
12 Mar 2025
Controlling Latent Diffusion Using Latent CLIP
Jason Becker
Chris Wendler
Peter Baylies
Robert West
Christian Wressnegger
DiffM
VLM
63
0
0
11 Mar 2025
ACE: Concept Editing in Diffusion Models without Performance Degradation
Ruipeng Wang
Junfeng Fang
Jiaqi Li
Hao Chen
Jie Shi
K. Wang
X. Wang
DiffM
51
2
0
11 Mar 2025
Exploring Bias in over 100 Text-to-Image Generative Models
J. Vice
Naveed Akhtar
Richard I. Hartley
Ajmal Saeed Mian
EGVM
67
2
0
11 Mar 2025
TRCE: Towards Reliable Malicious Concept Erasure in Text-to-Image Diffusion Models
Ruidong Chen
Honglin Guo
Lanjun Wang
Chenyu Zhang
Weizhi Nie
An-an Liu
DiffM
64
1
0
10 Mar 2025
SPEED: Scalable, Precise, and Efficient Concept Erasure for Diffusion Models
Ouxiang Li
Yuan Wang
Xinting Hu
Houcheng Jiang
Tao Liang
Y. Hao
Guojun Ma
Fuli Feng
DiffM
44
1
0
10 Mar 2025
Learning to Unlearn while Retaining: Combating Gradient Conflicts in Machine Unlearning
Gaurav Patel
Qiang Qiu
MU
60
1
0
08 Mar 2025
Jailbreaking Safeguarded Text-to-Image Models via Large Language Models
Zhengyuan Jiang
Yuepeng Hu
Y. Yang
Yinzhi Cao
Neil Gong
62
0
0
03 Mar 2025
Data Unlearning in Diffusion Models
Silas Alberti
Kenan Hasanaliyev
Manav Shah
Stefano Ermon
DiffM
MU
34
1
0
02 Mar 2025
SafeText: Safe Text-to-image Models via Aligning the Text Encoder
Yuepeng Hu
Zhengyuan Jiang
Neil Zhenqiang Gong
45
1
0
28 Feb 2025
Unified Prompt Attack Against Text-to-Image Generation Models
Duo Peng
Qiuhong Ke
Mark He Huang
Ping Hu
J. Liu
41
0
0
23 Feb 2025
A Systematic Review of Open Datasets Used in Text-to-Image (T2I) Gen AI Model Safety
Rakeen Rouf
Trupti Bavalatti
Osama Ahmed
Dhaval Potdar
Faraz Jawed
EGVM
58
1
0
23 Feb 2025
Concept Corrector: Erase concepts on the fly for text-to-image diffusion models
Zheling Meng
Bo Peng
Xiaochuan Jin
Yueming Lyu
Wei Wang
Jing Dong
DiffM
38
2
0
22 Feb 2025
T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image Generation
Lijun Li
Zhelun Shi
Xuhao Hu
Bowen Dong
Yiran Qin
Xihui Liu
Lu Sheng
Jing Shao
112
1
0
21 Feb 2025
Robust Concept Erasure Using Task Vectors
Minh Pham
Kelly O. Marshall
Chinmay Hegde
Niv Cohen
112
17
0
21 Feb 2025
Precise Parameter Localization for Textual Generation in Diffusion Models
Łukasz Staniszewski
Bartosz Cywiñski
Franziska Boenisch
Kamil Deja
Adam Dziedzic
DiffM
104
0
0
17 Feb 2025
A Comprehensive Survey on Concept Erasure in Text-to-Image Diffusion Models
Changhoon Kim
Yanjun Qi
DiffM
33
1
0
17 Feb 2025
SAeUron: Interpretable Concept Unlearning in Diffusion Models with Sparse Autoencoders
Bartosz Cywiñski
Kamil Deja
DiffM
61
6
0
29 Jan 2025
CE-SDWV: Effective and Efficient Concept Erasure for Text-to-Image Diffusion Models via a Semantic-Driven Word Vocabulary
Jiahang Tu
Qian Feng
Chufan Chen
Jiahua Dong
Hanbin Zhao
Chao Zhang
Hui Qian
72
2
0
28 Jan 2025
Direct Unlearning Optimization for Robust and Safe Text-to-Image Models
Yong-Hyun Park
Sangdoo Yun
Jin-Hwa Kim
Junho Kim
Geonhui Jang
Yonghyun Jeong
Junghyo Jo
Gayoung Lee
73
13
0
17 Jan 2025
Focus-N-Fix: Region-Aware Fine-Tuning for Text-to-Image Generation
Xiaoying Xing
Avinab Saha
Junfeng He
Susan Hao
Paul Vicol
...
Sahil Singla
Sarah Young
Yinxiao Li
Feng Yang
Deepak Ramachandran
DiffM
48
0
0
11 Jan 2025
DuMo: Dual Encoder Modulation Network for Precise Concept Erasure
Feng Han
Kai-xiang Chen
Chao Gong
Zhipeng Wei
Jingjing Chen
Yu-Gang Jiang
39
2
0
03 Jan 2025
Dynamic Negative Guidance of Diffusion Models
Felix Koulischer
Johannes Deleu
G. Raya
T. Demeester
L. Ambrogioni
DiffM
49
2
0
03 Jan 2025
Ethical-Lens: Curbing Malicious Usages of Open-Source Text-to-Image Models
Yuzhu Cai
Sheng Yin
Yuxi Wei
Chenxin Xu
Weibo Mao
Felix Juefei Xu
Siheng Chen
Yanfeng Wang
EGVM
79
2
0
03 Jan 2025
SafetyDPO: Scalable Safety Alignment for Text-to-Image Generation
Runtao Liu
Chen I Chieh
Jindong Gu
Jipeng Zhang
Renjie Pi
Qifeng Chen
Philip H. S. Torr
Ashkan Khakzar
Fabio Pizzati
EGVM
99
0
0
13 Dec 2024
Buster: Implanting Semantic Backdoor into Text Encoder to Mitigate NSFW Content Generation
Xin Zhao
Xiaojun Chen
Yuexin Xuan
Zhendong Zhao
Xiaojun Jia
Xinfeng Li
Xiaofeng Wang
72
0
0
10 Dec 2024
1
2
3
4
5
Next