Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2310.16613
Cited By
v1
v2 (latest)
On the Proactive Generation of Unsafe Images From Text-To-Image Models Using Benign Prompts
25 October 2023
Yixin Wu
Ning Yu
Michael Backes
Yun Shen
Yang Zhang
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"On the Proactive Generation of Unsafe Images From Text-To-Image Models Using Benign Prompts"
50 / 51 papers shown
Title
Patronus: Safeguarding Text-to-Image Models against White-Box Adversaries
Xinfeng Li
Shengyuan Pang
Jialin Wu
Jiangyi Deng
Huanlong Zhong
Yanjiao Chen
Jie Zhang
Wenyuan Xu
92
0
0
18 Oct 2025
Breaking Diffusion with Cache: Exploiting Approximate Caches in Diffusion Models
Desen Sun
Shuncheng Jie
Sihang Liu
DiffM
108
0
0
28 Aug 2025
Understanding Implosion in Text-to-Image Generative Models
Conference on Computer and Communications Security (CCS), 2024
Wenxin Ding
Cathy Y. Li
Shawn Shan
Ben Y. Zhao
Haitao Zheng
306
5
0
18 Sep 2024
Image-Perfect Imperfections: Safety, Bias, and Authenticity in the Shadow of Text-To-Image Model Evolution
Conference on Computer and Communications Security (CCS), 2024
Yixin Wu
Yun Shen
Michael Backes
Yang Zhang
235
7
0
30 Aug 2024
Replication in Visual Diffusion Models: A Survey and Outlook
Wenhao Wang
Yifan Sun
Zongxin Yang
Zhengdong Hu
Zhentao Tan
Yi Yang
454
15
0
07 Jul 2024
Toxic Memes: A Survey of Computational Perspectives on the Detection and Explanation of Meme Toxicities
Delfina Sol Martinez Pandiani
Erik Tjong Kim Sang
Davide Ceolin
213
8
0
11 Jun 2024
UnsafeBench: Benchmarking Image Safety Classifiers on Real-World and AI-Generated Images
Y. Qu
Xinyue Shen
Yixin Wu
Michael Backes
Savvas Zannettou
Yang Zhang
EGVM
417
35
0
06 May 2024
Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models
Neural Information Processing Systems (NeurIPS), 2024
Yuancheng Xu
Jiarui Yao
Manli Shu
Yanchao Sun
Zichu Wu
Ning Yu
Tom Goldstein
Furong Huang
AAML
277
37
0
05 Feb 2024
VA3: Virtually Assured Amplification Attack on Probabilistic Copyright Protection for Text-to-Image Generative Models
Computer Vision and Pattern Recognition (CVPR), 2023
Xiang Li
Qianli Shen
Kenji Kawaguchi
254
8
0
29 Nov 2023
Nightshade: Prompt-Specific Poisoning Attacks on Text-to-Image Generative Models
Shawn Shan
Wenxin Ding
Josephine Passananti
Stanley Wu
Haitao Zheng
Ben Y. Zhao
SILM
DiffM
341
80
0
20 Oct 2023
Composite Backdoor Attacks Against Large Language Models
Hai Huang
Subrat Kishore Dutta
Michael Backes
Yun Shen
Yang Zhang
AAML
181
75
0
11 Oct 2023
Towards Safe Self-Distillation of Internet-Scale Text-to-Image Diffusion Models
Sanghyun Kim
Seohyeong Jung
Balhae Kim
Moonseok Choi
Jinwoo Shin
Juho Lee
DiffM
132
37
0
12 Jul 2023
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
International Conference on Learning Representations (ICLR), 2023
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
1.0K
3,711
0
04 Jul 2023
Unsafe Diffusion: On the Generation of Unsafe Images and Hateful Memes From Text-To-Image Models
Conference on Computer and Communications Security (CCS), 2023
Y. Qu
Xinyue Shen
Xinlei He
Michael Backes
Savvas Zannettou
Yang Zhang
213
163
0
23 May 2023
Uncurated Image-Text Datasets: Shedding Light on Demographic Bias
Computer Vision and Pattern Recognition (CVPR), 2023
Noa Garcia
Yusuke Hirota
Yankun Wu
Yuta Nakashima
EGVM
192
70
0
06 Apr 2023
Erasing Concepts from Diffusion Models
IEEE International Conference on Computer Vision (ICCV), 2023
Rohit Gandikota
Joanna Materzyñska
Jaden Fiotto-Kaufman
David Bau
DiffM
473
420
0
13 Mar 2023
On the Evolution of (Hateful) Memes by Means of Multimodal Contrastive Learning
Y. Qu
Xinlei He
S. Pierson
Michael Backes
Yang Zhang
Savvas Zannettou
154
34
0
13 Dec 2022
How to Backdoor Diffusion Models?
Computer Vision and Pattern Recognition (CVPR), 2022
Sheng-Yen Chou
Pin-Yu Chen
Tsung-Yi Ho
DiffM
SILM
409
114
0
11 Dec 2022
InstructPix2Pix: Learning to Follow Image Editing Instructions
Computer Vision and Pattern Recognition (CVPR), 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
544
2,453
0
17 Nov 2022
Safe Latent Diffusion: Mitigating Inappropriate Degeneration in Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2022
P. Schramowski
Manuel Brack
Bjorn Deiseroth
Kristian Kersting
489
438
0
09 Nov 2022
Easily Accessible Text-to-Image Generation Amplifies Demographic Stereotypes at Large Scale
Conference on Fairness, Accountability and Transparency (FAccT), 2022
Federico Bianchi
Pratyusha Kalluri
Esin Durmus
Faisal Ladhak
Myra Cheng
Debora Nozza
Tatsunori Hashimoto
Dan Jurafsky
James Zou
Aylin Caliskan
DiffM
VLM
321
420
0
07 Nov 2022
Imagic: Text-Based Real Image Editing with Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2022
Bahjat Kawar
Shiran Zada
Oran Lang
Omer Tov
Hui-Tang Chang
Tali Dekel
Inbar Mosseri
Michal Irani
479
1,315
0
17 Oct 2022
LAION-5B: An open large-scale dataset for training next generation image-text models
Neural Information Processing Systems (NeurIPS), 2022
Christoph Schuhmann
Romain Beaumont
Richard Vencu
Cade Gordon
Ross Wightman
...
Srivatsa Kundurthy
Katherine Crowson
Ludwig Schmidt
R. Kaczmarczyk
J. Jitsev
VLM
MLLM
CLIP
764
4,479
0
16 Oct 2022
Adapting Pretrained Vision-Language Foundational Models to Medical Imaging Domains
Pierre J. Chambon
Christian Blüthgen
C. Langlotz
Akshay S. Chaudhari
DiffM
MedIm
LM&MA
154
133
0
09 Oct 2022
Red-Teaming the Stable Diffusion Safety Filter
Javier Rando
Daniel Paleka
David Lindner
Lennard Heim
Florian Tramèr
DiffM
603
250
0
03 Oct 2022
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
Computer Vision and Pattern Recognition (CVPR), 2022
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
869
3,685
0
25 Aug 2022
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion
International Conference on Learning Representations (ICLR), 2022
Rinon Gal
Yuval Alaluf
Yuval Atzmon
Or Patashnik
Amit H. Bermano
Gal Chechik
Daniel Cohen-Or
469
2,405
0
02 Aug 2022
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Jiahui Yu
Yuanzhong Xu
Jing Yu Koh
Thang Luong
Gunjan Baid
...
Zarana Parekh
Xin Li
Han Zhang
Jason Baldridge
Yonghui Wu
EGVM
573
1,349
0
22 Jun 2022
Hierarchical Text-Conditional Image Generation with CLIP Latents
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLM
DiffM
1.0K
8,200
0
13 Apr 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
International Conference on Machine Learning (ICML), 2022
Junnan Li
Dongxu Li
Caiming Xiong
Guosheng Lin
MLLM
BDL
VLM
CLIP
1.3K
5,651
0
28 Jan 2022
High-Resolution Image Synthesis with Latent Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2021
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
DiffM
1.8K
20,684
0
20 Dec 2021
GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
International Conference on Machine Learning (ICML), 2021
Alex Nichol
Prafulla Dhariwal
Aditya A. Ramesh
Pranav Shyam
Pamela Mishkin
Bob McGrew
Ilya Sutskever
Mark Chen
960
4,327
0
20 Dec 2021
Learning Transferable Visual Models From Natural Language Supervision
International Conference on Machine Learning (ICML), 2021
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
2.0K
40,471
0
26 Feb 2021
Zero-Shot Text-to-Image Generation
International Conference on Machine Learning (ICML), 2021
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
748
5,915
0
24 Feb 2021
Neural Attention Distillation: Erasing Backdoor Triggers from Deep Neural Networks
International Conference on Learning Representations (ICLR), 2021
Yige Li
Lingjuan Lyu
Nodens Koren
X. Lyu
Yue Liu
Jiabo He
AAML
FedML
355
497
0
15 Jan 2021
Dataset Security for Machine Learning: Data Poisoning, Backdoor Attacks, and Defenses
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Micah Goldblum
Dimitris Tsipras
Chulin Xie
Xinyun Chen
Avi Schwarzschild
Basel Alomair
Aleksander Madry
Yue Liu
Tom Goldstein
SILM
425
346
0
18 Dec 2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
1.3K
54,134
0
22 Oct 2020
Backdoor Attacks and Countermeasures on Deep Learning: A Comprehensive Review
Yansong Gao
Bao Gia Doan
Zhi-Li Zhang
Siqi Ma
Jiliang Zhang
Anmin Fu
Surya Nepal
Hyoungshick Kim
AAML
291
267
0
21 Jul 2020
Backdoor Learning: A Survey
IEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2020
Yiming Li
Yong Jiang
Zhifeng Li
Shutao Xia
AAML
545
730
0
17 Jul 2020
Data Poisoning Attacks Against Federated Learning Systems
European Symposium on Research in Computer Security (ESORICS), 2020
Vale Tolpegin
Stacey Truex
Mehmet Emre Gursoy
Ling Liu
FedML
228
815
0
16 Jul 2020
Attack of the Tails: Yes, You Really Can Backdoor Federated Learning
Neural Information Processing Systems (NeurIPS), 2020
Hongyi Wang
Kartik K. Sreenivasan
Shashank Rajput
Harit Vishwakarma
Saurabh Agarwal
Jy-yong Sohn
Kangwook Lee
Dimitris Papailiopoulos
FedML
263
721
0
09 Jul 2020
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
4.5K
25,255
0
19 Jun 2020
The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes
Douwe Kiela
Hamed Firooz
Aravind Mohan
Vedanuj Goswami
Amanpreet Singh
Pratik Ringshia
Davide Testuggine
318
753
0
10 May 2020
Decision-Making with Auto-Encoding Variational Bayes
Neural Information Processing Systems (NeurIPS), 2020
Romain Lopez
Pierre Boyeau
Nir Yosef
Michael I. Jordan
Jeffrey Regier
BDL
1.5K
19,430
0
17 Feb 2020
Poison Frogs! Targeted Clean-Label Poisoning Attacks on Neural Networks
Ali Shafahi
Wenjie Huang
Mahyar Najibi
Octavian Suciu
Christoph Studer
Tudor Dumitras
Tom Goldstein
AAML
517
1,195
0
03 Apr 2018
Manipulating Machine Learning: Poisoning Attacks and Countermeasures for Regression Learning
Matthew Jagielski
Alina Oprea
Battista Biggio
Chang-rui Liu
Cristina Nita-Rotaru
Yue Liu
AAML
292
832
0
01 Apr 2018
Technical Report: When Does Machine Learning FAIL? Generalized Transferability for Evasion and Poisoning Attacks
Octavian Suciu
R. Marginean
Yigitcan Kaya
Hal Daumé
Tudor Dumitras
AAML
287
308
0
19 Mar 2018
Machine Learning Models that Remember Too Much
Congzheng Song
Thomas Ristenpart
Vitaly Shmatikov
VLM
165
555
0
22 Sep 2017
BadNets: Identifying Vulnerabilities in the Machine Learning Model Supply Chain
Tianyu Gu
Brendan Dolan-Gavitt
S. Garg
SILM
534
2,011
0
22 Aug 2017
Understanding Black-box Predictions via Influence Functions
Pang Wei Koh
Abigail Z. Jacobs
TDI
486
3,272
0
14 Mar 2017
1
2
Next