Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2301.13826
Cited By
Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models
31 January 2023
Hila Chefer
Yuval Alaluf
Yael Vinker
Lior Wolf
Daniel Cohen-Or
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models"
50 / 403 papers shown
Title
MCGM: Mask Conditional Text-to-Image Generative Model
Rami Skaik
Leonardo Rossi
Tomaso Fontanini
Andrea Prati
DiffM
28
0
0
01 Oct 2024
Scene Graph Disentanglement and Composition for Generalizable Complex Image Generation
Yunnan Wang
Ziqiang Li
Zequn Zhang
Wenyao Zhang
Baao Xie
Xihui Liu
Wenjun Zeng
Xin Jin
CoGe
DiffM
21
2
0
01 Oct 2024
A Cat Is A Cat (Not A Dog!): Unraveling Information Mix-ups in Text-to-Image Encoders through Causal Analysis and Embedding Optimization
Chieh-Yun Chen
Chiang Tseng
Li-Wu Tsao
Hong-Han Shuai
11
7
0
01 Oct 2024
Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function
Chenyi Zhuang
Ying Hu
Pan Gao
DiffM
VLM
33
12
0
30 Sep 2024
ABHINAW: A method for Automatic Evaluation of Typography within AI-Generated Images
Abhinaw Jagtap
Nachiket Tapas
R. G. Brajesh
EGVM
18
0
0
18 Sep 2024
Optimizing Resource Consumption in Diffusion Models through Hallucination Early Detection
Federico Betti
Lorenzo Baraldi
Lorenzo Baraldi
Rita Cucchiara
N. Sebe
DiffM
26
0
0
16 Sep 2024
Data Augmentation via Latent Diffusion for Saliency Prediction
Bahar Aydemir
Deblina Bhattacharjee
Tong Zhang
Mathieu Salzmann
Sabine Süsstrunk
18
1
0
11 Sep 2024
SPDiffusion: Semantic Protection Diffusion Models for Multi-concept Text-to-image Generation
Yang Zhang
Rui Zhang
Xuecheng Nie
Haochen Li
Jikun Chen
Yifan Hao
Xin Zhang
Luoqi Liu
Ling Li
36
0
0
02 Sep 2024
Training-Free Sketch-Guided Diffusion with Latent Optimization
Sandra Zhang Ding
Jiafeng Mao
Kiyoharu Aizawa
DiffM
86
1
0
31 Aug 2024
ConceptMix: A Compositional Image Generation Benchmark with Controllable Difficulty
Xindi Wu
Dingli Yu
Yangsibo Huang
Olga Russakovsky
Sanjeev Arora
CoGe
EGVM
39
12
0
26 Aug 2024
SwiftBrush v2: Make Your One-step Diffusion Model Better Than Its Teacher
T. Dao
Thuan Hoang Nguyen
T. Le
D. Vu
Khoi Nguyen
Cuong Pham
Anh Tran
DiffM
29
11
0
26 Aug 2024
Foodfusion: A Novel Approach for Food Image Composition via Diffusion Models
Chaohua Shi
Xuan Wang
Si Shi
Xule Wang
Mingrui Zhu
Nannan Wang
X. Gao
CoGe
22
1
0
26 Aug 2024
Draw Like an Artist: Complex Scene Generation with Diffusion Model via Composition, Painting, and Retouching
Minghao Liu
Le Zhang
Yingjie Tian
Xiaochao Qu
Luoqi Liu
Ting Liu
DiffM
CoGe
24
2
0
25 Aug 2024
Latent Space Disentanglement in Diffusion Transformers Enables Zero-shot Fine-grained Semantic Editing
Zitao Shuai
Chenwei Wu
Zhengxu Tang
Bowen Song
Liyue Shen
33
0
0
23 Aug 2024
Diffusion-Based Visual Art Creation: A Survey and New Perspectives
Bingyuan Wang
Qifeng Chen
Zeyu Wang
41
7
0
22 Aug 2024
FRAP: Faithful and Realistic Text-to-Image Generation with Adaptive Prompt Weighting
Liyao Jiang
Negar Hassanpour
Mohammad Salameh
Mohan Sai Singamsetti
Fengyu Sun
Wei Lu
Di Niu
DiffM
72
2
0
21 Aug 2024
Latent Feature and Attention Dual Erasure Attack against Multi-View Diffusion Models for 3D Assets Protection
Jingwei Sun
Xuchong Zhang
Changfeng Sun
Qicheng Bai
Hongbin Sun
AAML
DiffM
30
0
0
21 Aug 2024
Not Every Image is Worth a Thousand Words: Quantifying Originality in Stable Diffusion
Adi Haviv
Shahar Sarfaty
Uri Y. Hacohen
N. Elkin-Koren
Roi Livni
Amit H. Bermano
27
2
0
15 Aug 2024
MagicFace: Training-free Universal-Style Human Image Customized Synthesis
Yibin Wang
Weizhong Zhang
Cheng Jin
DiffM
29
3
0
14 Aug 2024
REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models
Agneet Chatterjee
Yiran Luo
Tejas Gokhale
Yezhou Yang
Chitta Baral
LRM
22
5
0
05 Aug 2024
Few-shot Defect Image Generation based on Consistency Modeling
Qingfeng Shi
Jing Wei
Fei Shen
Zheng Zhang
32
2
0
01 Aug 2024
Faster Image2Video Generation: A Closer Look at CLIP Image Embedding's Impact on Spatio-Temporal Cross-Attentions
Ashkan Taghipour
Morteza Ghahremani
Bennamoun
Aref Miri Rekavandi
Zinuo Li
Hamid Laga
F. Boussaïd
VGen
68
2
0
27 Jul 2024
AttentionHand: Text-driven Controllable Hand Image Generation for 3D Hand Reconstruction in the Wild
Jun-Young Park
Kyeongbo Kong
Suk-ju Kang
DiffM
24
2
0
25 Jul 2024
ReCorD: Reasoning and Correcting Diffusion for HOI Generation
Jian-Yu Jiang-Lin
Kang-Yang Huang
Ling Lo
Yi-Ning Huang
Terence Lin
Jhih-Ciang Wu
Hong-Han Shuai
Wen-Huang Cheng
DiffM
24
5
0
25 Jul 2024
PreciseControl: Enhancing Text-To-Image Diffusion Models with Fine-Grained Attribute Control
Rishubh Parihar
VS Sachidanand
Sabariswaran Mani
Tejan Karmali
R. V. Babu
DiffM
31
13
0
24 Jul 2024
Text2Place: Affordance-aware Text Guided Human Placement
Rishubh Parihar
Harsh Gupta
VS Sachidanand
R. V. Babu
DiffM
34
5
0
22 Jul 2024
Chronologically Accurate Retrieval for Temporal Grounding of Motion-Language Models
Kent Fujiwara
Mikihiro Tanaka
Qing Yu
34
2
0
22 Jul 2024
LSReGen: Large-Scale Regional Generator via Backward Guidance Framework
Bowen Zhang
Cheng Yang
Xuanhui Liu
DiffM
14
0
0
21 Jul 2024
AGLLDiff: Guiding Diffusion Models Towards Unsupervised Training-free Real-world Low-light Image Enhancement
Yunlong Lin
Tian-Chun Ye
Sixiang Chen
Zhenqi Fu
Yingying Wang
Wenhao Chai
Zhaohu Xing
Lei Zhu
Xinghao Ding
DiffM
34
4
0
20 Jul 2024
Not All Noises Are Created Equally:Diffusion Noise Selection and Optimization
Zipeng Qi
Lichen Bai
Haoyi Xiong
Zeke Xie
DiffM
33
17
0
19 Jul 2024
Training-free Composite Scene Generation for Layout-to-Image Synthesis
Jiaqi Liu
Tao Huang
Chang Xu
DiffM
22
5
0
18 Jul 2024
The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation
Yi Yao
Chan-Feng Hsu
Jhe-Hao Lin
Hongxia Xie
Terence Lin
Yi-Ning Huang
Hong-Han Shuai
Wen-Huang Cheng
DiffM
24
4
0
17 Jul 2024
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Huiguo He
Huan Yang
Zixi Tuo
Yuan Zhou
Qiuyue Wang
Yuhang Zhang
Zeyu Liu
Wenhao Huang
Hongyang Chao
Jian Yin
DiffM
VGen
52
11
0
17 Jul 2024
Adversarial Attacks and Defenses on Text-to-Image Diffusion Models: A Survey
Chenyu Zhang
Mingwang Hu
Wenhui Li
Lanjun Wang
37
13
0
10 Jul 2024
MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
Wanggui He
Siming Fu
Mushui Liu
Xierui Wang
Wenyi Xiao
...
Zhelun Yu
Haoyuan Li
Ziwei Huang
Leilei Gan
Hao Jiang
DiffM
24
23
0
10 Jul 2024
Sketch-Guided Scene Image Generation
Tianyu Zhang
Xiaoxuan Xie
Xusheng Du
H. Xie
DiffM
30
2
0
09 Jul 2024
GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing
Zhenyu Wang
Aoxue Li
Zhenguo Li
Xihui Liu
MLLM
DiffM
36
25
0
08 Jul 2024
Replication in Visual Diffusion Models: A Survey and Outlook
Wenhao Wang
Yifan Sun
Zongxin Yang
Zhengdong Hu
Zhentao Tan
Yi Yang
63
6
0
07 Jul 2024
PartCraft: Crafting Creative Objects by Parts
Kam Woh Ng
Xiatian Zhu
Yi-Zhe Song
Tao Xiang
27
6
0
05 Jul 2024
MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis
Dewei Zhou
Y. Li
Fan Ma
Zongxin Yang
Y. Yang
88
11
0
02 Jul 2024
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language
Yicheng Chen
Xiangtai Li
Yining Li
Yanhong Zeng
Jianzong Wu
Xiangyu Zhao
Kai Chen
VLM
DiffM
54
3
0
28 Jun 2024
AlignIT: Enhancing Prompt Alignment in Customization of Text-to-Image Models
Aishwarya Agarwal
Srikrishna Karanam
Balaji Vasan Srinivasan
21
1
0
27 Jun 2024
ResMaster: Mastering High-Resolution Image Generation via Structural and Fine-Grained Guidance
Shuwei Shi
Wenbo Li
Yuechen Zhang
Jingwen He
Biao Gong
Yinqiang Zheng
46
10
0
24 Jun 2024
Fantastic Copyrighted Beasts and How (Not) to Generate Them
Luxi He
Yangsibo Huang
Weijia Shi
Tinghao Xie
Haotian Liu
Yue Wang
Luke Zettlemoyer
Chiyuan Zhang
Danqi Chen
Peter Henderson
39
9
0
20 Jun 2024
Composing Object Relations and Attributes for Image-Text Matching
Khoi Pham
Chuong Huynh
Ser-Nam Lim
Abhinav Shrivastava
CoGe
25
3
0
17 Jun 2024
Understanding Multi-Granularity for Open-Vocabulary Part Segmentation
Jiho Choi
Seonho Lee
Seungho Lee
Minhyun Lee
Hyunjung Shim
OCL
33
0
0
17 Jun 2024
Self-Supervised Vision Transformer for Enhanced Virtual Clothes Try-On
Lingxiao Lu
Shengyi Wu
Haoxuan Sun
Junhong Gou
Jianlou Si
Chen Qian
Jianfu Zhang
Liqing Zhang
ViT
DiffM
34
0
0
15 Jun 2024
Make It Count: Text-to-Image Generation with an Accurate Number of Objects
Lital Binyamin
Yoad Tewel
Hilit Segev
Eran Hirsch
Royi Rassin
Gal Chechik
24
6
0
14 Jun 2024
Crafting Parts for Expressive Object Composition
Harsh Rangwani
Aishwarya Agarwal
Kuldeep Kulkarni
R. Venkatesh Babu
Srikrishna Karanam
DiffM
30
3
0
14 Jun 2024
Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image Diffusion Models
Ziyi Wu
Yulia Rubanova
Rishabh Kabra
Drew A. Hudson
Igor Gilitschenski
Yusuf Aytar
Sjoerd van Steenkiste
Kelsey R. Allen
Thomas Kipf
VGen
DiffM
34
10
0
13 Jun 2024
Previous
1
2
3
4
5
6
7
8
9
Next