ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.16613
  4. Cited By
On the Proactive Generation of Unsafe Images From Text-To-Image Models Using Benign Prompts
v1v2 (latest)

On the Proactive Generation of Unsafe Images From Text-To-Image Models Using Benign Prompts

25 October 2023
Yixin Wu
Ning Yu
Michael Backes
Yun Shen
Yang Zhang
    DiffM
ArXiv (abs)PDFHTML

Papers citing "On the Proactive Generation of Unsafe Images From Text-To-Image Models Using Benign Prompts"

50 / 51 papers shown
Title
Patronus: Safeguarding Text-to-Image Models against White-Box Adversaries
Patronus: Safeguarding Text-to-Image Models against White-Box Adversaries
Xinfeng Li
Shengyuan Pang
Jialin Wu
Jiangyi Deng
Huanlong Zhong
Yanjiao Chen
Jie Zhang
Wenyuan Xu
92
0
0
18 Oct 2025
Breaking Diffusion with Cache: Exploiting Approximate Caches in Diffusion Models
Breaking Diffusion with Cache: Exploiting Approximate Caches in Diffusion Models
Desen Sun
Shuncheng Jie
Sihang Liu
DiffM
108
0
0
28 Aug 2025
Understanding Implosion in Text-to-Image Generative Models
Understanding Implosion in Text-to-Image Generative ModelsConference on Computer and Communications Security (CCS), 2024
Wenxin Ding
Cathy Y. Li
Shawn Shan
Ben Y. Zhao
Haitao Zheng
306
5
0
18 Sep 2024
Image-Perfect Imperfections: Safety, Bias, and Authenticity in the
  Shadow of Text-To-Image Model Evolution
Image-Perfect Imperfections: Safety, Bias, and Authenticity in the Shadow of Text-To-Image Model EvolutionConference on Computer and Communications Security (CCS), 2024
Yixin Wu
Yun Shen
Michael Backes
Yang Zhang
235
7
0
30 Aug 2024
Replication in Visual Diffusion Models: A Survey and Outlook
Replication in Visual Diffusion Models: A Survey and Outlook
Wenhao Wang
Yifan Sun
Zongxin Yang
Zhengdong Hu
Zhentao Tan
Yi Yang
454
15
0
07 Jul 2024
Toxic Memes: A Survey of Computational Perspectives on the Detection and
  Explanation of Meme Toxicities
Toxic Memes: A Survey of Computational Perspectives on the Detection and Explanation of Meme Toxicities
Delfina Sol Martinez Pandiani
Erik Tjong Kim Sang
Davide Ceolin
213
8
0
11 Jun 2024
UnsafeBench: Benchmarking Image Safety Classifiers on Real-World and AI-Generated Images
UnsafeBench: Benchmarking Image Safety Classifiers on Real-World and AI-Generated Images
Y. Qu
Xinyue Shen
Yixin Wu
Michael Backes
Savvas Zannettou
Yang Zhang
EGVM
417
35
0
06 May 2024
Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language
  Models
Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language ModelsNeural Information Processing Systems (NeurIPS), 2024
Yuancheng Xu
Jiarui Yao
Manli Shu
Yanchao Sun
Zichu Wu
Ning Yu
Tom Goldstein
Furong Huang
AAML
277
37
0
05 Feb 2024
VA3: Virtually Assured Amplification Attack on Probabilistic Copyright
  Protection for Text-to-Image Generative Models
VA3: Virtually Assured Amplification Attack on Probabilistic Copyright Protection for Text-to-Image Generative ModelsComputer Vision and Pattern Recognition (CVPR), 2023
Xiang Li
Qianli Shen
Kenji Kawaguchi
254
8
0
29 Nov 2023
Nightshade: Prompt-Specific Poisoning Attacks on Text-to-Image
  Generative Models
Nightshade: Prompt-Specific Poisoning Attacks on Text-to-Image Generative Models
Shawn Shan
Wenxin Ding
Josephine Passananti
Stanley Wu
Haitao Zheng
Ben Y. Zhao
SILMDiffM
341
80
0
20 Oct 2023
Composite Backdoor Attacks Against Large Language Models
Composite Backdoor Attacks Against Large Language Models
Hai Huang
Subrat Kishore Dutta
Michael Backes
Yun Shen
Yang Zhang
AAML
181
75
0
11 Oct 2023
Towards Safe Self-Distillation of Internet-Scale Text-to-Image Diffusion
  Models
Towards Safe Self-Distillation of Internet-Scale Text-to-Image Diffusion Models
Sanghyun Kim
Seohyeong Jung
Balhae Kim
Moonseok Choi
Jinwoo Shin
Juho Lee
DiffM
132
37
0
12 Jul 2023
SDXL: Improving Latent Diffusion Models for High-Resolution Image
  Synthesis
SDXL: Improving Latent Diffusion Models for High-Resolution Image SynthesisInternational Conference on Learning Representations (ICLR), 2023
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
1.0K
3,711
0
04 Jul 2023
Unsafe Diffusion: On the Generation of Unsafe Images and Hateful Memes
  From Text-To-Image Models
Unsafe Diffusion: On the Generation of Unsafe Images and Hateful Memes From Text-To-Image ModelsConference on Computer and Communications Security (CCS), 2023
Y. Qu
Xinyue Shen
Xinlei He
Michael Backes
Savvas Zannettou
Yang Zhang
213
163
0
23 May 2023
Uncurated Image-Text Datasets: Shedding Light on Demographic Bias
Uncurated Image-Text Datasets: Shedding Light on Demographic BiasComputer Vision and Pattern Recognition (CVPR), 2023
Noa Garcia
Yusuke Hirota
Yankun Wu
Yuta Nakashima
EGVM
192
70
0
06 Apr 2023
Erasing Concepts from Diffusion Models
Erasing Concepts from Diffusion ModelsIEEE International Conference on Computer Vision (ICCV), 2023
Rohit Gandikota
Joanna Materzyñska
Jaden Fiotto-Kaufman
David Bau
DiffM
473
420
0
13 Mar 2023
On the Evolution of (Hateful) Memes by Means of Multimodal Contrastive
  Learning
On the Evolution of (Hateful) Memes by Means of Multimodal Contrastive Learning
Y. Qu
Xinlei He
S. Pierson
Michael Backes
Yang Zhang
Savvas Zannettou
154
34
0
13 Dec 2022
How to Backdoor Diffusion Models?
How to Backdoor Diffusion Models?Computer Vision and Pattern Recognition (CVPR), 2022
Sheng-Yen Chou
Pin-Yu Chen
Tsung-Yi Ho
DiffMSILM
409
114
0
11 Dec 2022
InstructPix2Pix: Learning to Follow Image Editing Instructions
InstructPix2Pix: Learning to Follow Image Editing InstructionsComputer Vision and Pattern Recognition (CVPR), 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
544
2,453
0
17 Nov 2022
Safe Latent Diffusion: Mitigating Inappropriate Degeneration in
  Diffusion Models
Safe Latent Diffusion: Mitigating Inappropriate Degeneration in Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2022
P. Schramowski
Manuel Brack
Bjorn Deiseroth
Kristian Kersting
489
438
0
09 Nov 2022
Easily Accessible Text-to-Image Generation Amplifies Demographic
  Stereotypes at Large Scale
Easily Accessible Text-to-Image Generation Amplifies Demographic Stereotypes at Large ScaleConference on Fairness, Accountability and Transparency (FAccT), 2022
Federico Bianchi
Pratyusha Kalluri
Esin Durmus
Faisal Ladhak
Myra Cheng
Debora Nozza
Tatsunori Hashimoto
Dan Jurafsky
James Zou
Aylin Caliskan
DiffMVLM
321
420
0
07 Nov 2022
Imagic: Text-Based Real Image Editing with Diffusion Models
Imagic: Text-Based Real Image Editing with Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2022
Bahjat Kawar
Shiran Zada
Oran Lang
Omer Tov
Hui-Tang Chang
Tali Dekel
Inbar Mosseri
Michal Irani
479
1,315
0
17 Oct 2022
LAION-5B: An open large-scale dataset for training next generation
  image-text models
LAION-5B: An open large-scale dataset for training next generation image-text modelsNeural Information Processing Systems (NeurIPS), 2022
Christoph Schuhmann
Romain Beaumont
Richard Vencu
Cade Gordon
Ross Wightman
...
Srivatsa Kundurthy
Katherine Crowson
Ludwig Schmidt
R. Kaczmarczyk
J. Jitsev
VLMMLLMCLIP
764
4,479
0
16 Oct 2022
Adapting Pretrained Vision-Language Foundational Models to Medical
  Imaging Domains
Adapting Pretrained Vision-Language Foundational Models to Medical Imaging Domains
Pierre J. Chambon
Christian Blüthgen
C. Langlotz
Akshay S. Chaudhari
DiffMMedImLM&MA
154
133
0
09 Oct 2022
Red-Teaming the Stable Diffusion Safety Filter
Red-Teaming the Stable Diffusion Safety Filter
Javier Rando
Daniel Paleka
David Lindner
Lennard Heim
Florian Tramèr
DiffM
603
250
0
03 Oct 2022
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for
  Subject-Driven Generation
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven GenerationComputer Vision and Pattern Recognition (CVPR), 2022
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
869
3,685
0
25 Aug 2022
An Image is Worth One Word: Personalizing Text-to-Image Generation using
  Textual Inversion
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual InversionInternational Conference on Learning Representations (ICLR), 2022
Rinon Gal
Yuval Alaluf
Yuval Atzmon
Or Patashnik
Amit H. Bermano
Gal Chechik
Daniel Cohen-Or
469
2,405
0
02 Aug 2022
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Jiahui Yu
Yuanzhong Xu
Jing Yu Koh
Thang Luong
Gunjan Baid
...
Zarana Parekh
Xin Li
Han Zhang
Jason Baldridge
Yonghui Wu
EGVM
573
1,349
0
22 Jun 2022
Hierarchical Text-Conditional Image Generation with CLIP Latents
Hierarchical Text-Conditional Image Generation with CLIP Latents
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLMDiffM
1.0K
8,200
0
13 Apr 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified
  Vision-Language Understanding and Generation
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and GenerationInternational Conference on Machine Learning (ICML), 2022
Junnan Li
Dongxu Li
Caiming Xiong
Guosheng Lin
MLLMBDLVLMCLIP
1.3K
5,651
0
28 Jan 2022
High-Resolution Image Synthesis with Latent Diffusion Models
High-Resolution Image Synthesis with Latent Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2021
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
DiffM
1.8K
20,684
0
20 Dec 2021
GLIDE: Towards Photorealistic Image Generation and Editing with
  Text-Guided Diffusion Models
GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion ModelsInternational Conference on Machine Learning (ICML), 2021
Alex Nichol
Prafulla Dhariwal
Aditya A. Ramesh
Pranav Shyam
Pamela Mishkin
Bob McGrew
Ilya Sutskever
Mark Chen
960
4,327
0
20 Dec 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language SupervisionInternational Conference on Machine Learning (ICML), 2021
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIPVLM
2.0K
40,471
0
26 Feb 2021
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image GenerationInternational Conference on Machine Learning (ICML), 2021
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
748
5,915
0
24 Feb 2021
Neural Attention Distillation: Erasing Backdoor Triggers from Deep
  Neural Networks
Neural Attention Distillation: Erasing Backdoor Triggers from Deep Neural NetworksInternational Conference on Learning Representations (ICLR), 2021
Yige Li
Lingjuan Lyu
Nodens Koren
X. Lyu
Yue Liu
Jiabo He
AAMLFedML
355
497
0
15 Jan 2021
Dataset Security for Machine Learning: Data Poisoning, Backdoor Attacks,
  and Defenses
Dataset Security for Machine Learning: Data Poisoning, Backdoor Attacks, and DefensesIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Micah Goldblum
Dimitris Tsipras
Chulin Xie
Xinyun Chen
Avi Schwarzschild
Basel Alomair
Aleksander Madry
Yue Liu
Tom Goldstein
SILM
425
346
0
18 Dec 2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at
  Scale
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
1.3K
54,134
0
22 Oct 2020
Backdoor Attacks and Countermeasures on Deep Learning: A Comprehensive
  Review
Backdoor Attacks and Countermeasures on Deep Learning: A Comprehensive Review
Yansong Gao
Bao Gia Doan
Zhi-Li Zhang
Siqi Ma
Jiliang Zhang
Anmin Fu
Surya Nepal
Hyoungshick Kim
AAML
291
267
0
21 Jul 2020
Backdoor Learning: A Survey
Backdoor Learning: A SurveyIEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2020
Yiming Li
Yong Jiang
Zhifeng Li
Shutao Xia
AAML
545
730
0
17 Jul 2020
Data Poisoning Attacks Against Federated Learning Systems
Data Poisoning Attacks Against Federated Learning SystemsEuropean Symposium on Research in Computer Security (ESORICS), 2020
Vale Tolpegin
Stacey Truex
Mehmet Emre Gursoy
Ling Liu
FedML
228
815
0
16 Jul 2020
Attack of the Tails: Yes, You Really Can Backdoor Federated Learning
Attack of the Tails: Yes, You Really Can Backdoor Federated LearningNeural Information Processing Systems (NeurIPS), 2020
Hongyi Wang
Kartik K. Sreenivasan
Shashank Rajput
Harit Vishwakarma
Saurabh Agarwal
Jy-yong Sohn
Kangwook Lee
Dimitris Papailiopoulos
FedML
263
721
0
09 Jul 2020
Denoising Diffusion Probabilistic Models
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
4.5K
25,255
0
19 Jun 2020
The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes
The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes
Douwe Kiela
Hamed Firooz
Aravind Mohan
Vedanuj Goswami
Amanpreet Singh
Pratik Ringshia
Davide Testuggine
318
753
0
10 May 2020
Decision-Making with Auto-Encoding Variational Bayes
Decision-Making with Auto-Encoding Variational BayesNeural Information Processing Systems (NeurIPS), 2020
Romain Lopez
Pierre Boyeau
Nir Yosef
Michael I. Jordan
Jeffrey Regier
BDL
1.5K
19,430
0
17 Feb 2020
Poison Frogs! Targeted Clean-Label Poisoning Attacks on Neural Networks
Poison Frogs! Targeted Clean-Label Poisoning Attacks on Neural Networks
Ali Shafahi
Wenjie Huang
Mahyar Najibi
Octavian Suciu
Christoph Studer
Tudor Dumitras
Tom Goldstein
AAML
517
1,195
0
03 Apr 2018
Manipulating Machine Learning: Poisoning Attacks and Countermeasures for
  Regression Learning
Manipulating Machine Learning: Poisoning Attacks and Countermeasures for Regression Learning
Matthew Jagielski
Alina Oprea
Battista Biggio
Chang-rui Liu
Cristina Nita-Rotaru
Yue Liu
AAML
292
832
0
01 Apr 2018
Technical Report: When Does Machine Learning FAIL? Generalized
  Transferability for Evasion and Poisoning Attacks
Technical Report: When Does Machine Learning FAIL? Generalized Transferability for Evasion and Poisoning Attacks
Octavian Suciu
R. Marginean
Yigitcan Kaya
Hal Daumé
Tudor Dumitras
AAML
287
308
0
19 Mar 2018
Machine Learning Models that Remember Too Much
Machine Learning Models that Remember Too Much
Congzheng Song
Thomas Ristenpart
Vitaly Shmatikov
VLM
165
555
0
22 Sep 2017
BadNets: Identifying Vulnerabilities in the Machine Learning Model
  Supply Chain
BadNets: Identifying Vulnerabilities in the Machine Learning Model Supply Chain
Tianyu Gu
Brendan Dolan-Gavitt
S. Garg
SILM
534
2,011
0
22 Aug 2017
Understanding Black-box Predictions via Influence Functions
Understanding Black-box Predictions via Influence Functions
Pang Wei Koh
Abigail Z. Jacobs
TDI
486
3,272
0
14 Mar 2017
12
Next