Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2301.13826
Cited By
Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models
31 January 2023
Hila Chefer
Yuval Alaluf
Yael Vinker
Lior Wolf
Daniel Cohen-Or
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models"
50 / 403 papers shown
Title
Idea-2-3D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputs
Junhao Chen
Xiang Li
Xiaojun Ye
Chao Li
Zhaoxin Fan
Hao Zhao
VGen
3DV
197
4
0
05 Apr 2024
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
Dongzhi Jiang
Guanglu Song
Xiaoshi Wu
Renrui Zhang
Dazhong Shen
Zhuofan Zong
Yu Liu
Hongsheng Li
VLM
28
20
0
04 Apr 2024
Diverse and Tailored Image Generation for Zero-shot Multi-label Classification
Kai Zhang
Zhixiang Yuan
Tao Huang
VLM
26
4
0
04 Apr 2024
Faster Diffusion via Temporal Attention Decomposition
Haozhe Liu
Wentian Zhang
Jinheng Xie
Francesco Faccio
Mengmeng Xu
Tao Xiang
Mike Zheng Shou
Juan-Manuel Perez-Rua
Jürgen Schmidhuber
DiffM
64
19
0
03 Apr 2024
CosmicMan: A Text-to-Image Foundation Model for Humans
Shikai Li
Jianglin Fu
Kaiyuan Liu
Wentao Wang
Kwan-Yee Lin
Wayne Wu
DiffM
35
18
0
01 Apr 2024
Getting it Right: Improving Spatial Consistency in Text-to-Image Models
Agneet Chatterjee
Gabriela Ben-Melech Stan
Estelle Aflalo
Sayak Paul
Dhruba Ghosh
...
Ludwig Schmidt
Hanna Hajishirzi
Vasudev Lal
Chitta Baral
Yezhou Yang
EGVM
VLM
57
14
0
01 Apr 2024
Relation Rectification in Diffusion Model
Yinwei Wu
Xingyi Yang
Xinchao Wang
28
6
0
29 Mar 2024
Motion Inversion for Video Customization
Luozhou Wang
Guibao Shen
Yixun Liang
Xin Tao
Pengfei Wan
Di Zhang
Yijun Li
Yingcong Chen
VGen
DiffM
32
7
0
29 Mar 2024
CLoRA: A Contrastive Approach to Compose Multiple LoRA Models
Tuna Han Salih Meral
Enis Simsar
Federico Tombari
Pinar Yanardag
MoMe
21
0
0
28 Mar 2024
FlexEdit: Flexible and Controllable Diffusion-based Object-centric Image Editing
Trong-Tung Nguyen
Duc A. Nguyen
Anh Tran
Cuong Pham
DiffM
29
7
0
27 Mar 2024
Attention Calibration for Disentangled Text-to-Image Personalization
Yanbing Zhang
Mengping Yang
Qin Zhou
Zhe Wang
20
15
0
27 Mar 2024
ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis
Muhammad Hamza Mughal
Rishabh Dabral
I. Habibie
Lucia Donatelli
Marc Habermann
Christian Theobalt
SLR
30
14
0
26 Mar 2024
Improving Text-to-Image Consistency via Automatic Prompt Optimization
Oscar Manas
Pietro Astolfi
Melissa Hall
Candace Ross
Jack Urbanek
Adina Williams
Aishwarya Agrawal
Adriana Romero Soriano
M. Drozdzal
29
26
0
26 Mar 2024
Be Yourself: Bounded Attention for Multi-Subject Text-to-Image Generation
Omer Dahary
Or Patashnik
Kfir Aberman
Daniel Cohen-Or
DiffM
19
27
0
25 Mar 2024
Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic Directions
S. A. Baumann
Felix Krause
Michael Neumayr
Nick Stracke
Vincent Tao Hu
Bjorn Ommer
Björn Ommer
DiffM
LM&Ro
66
11
0
25 Mar 2024
EVA: Zero-shot Accurate Attributes and Multi-Object Video Editing
Xiangpeng Yang
Linchao Zhu
Hehe Fan
Yi Yang
DiffM
VGen
12
9
0
24 Mar 2024
Selectively Informative Description can Reduce Undesired Embedding Entanglements in Text-to-Image Personalization
Jimyeong Kim
Jungwon Park
Wonjong Rhee
DiffM
17
5
0
22 Mar 2024
DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing
Yueru Jia
Yuhui Yuan
Aosong Cheng
Chuke Wang
Ji Li
Huizhu Jia
Shanghang Zhang
DiffM
23
7
0
21 Mar 2024
Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models
Pablo Marcos-Manchón
Roberto Alcover-Couso
Juan C. Sanmiguel
Jose M. Martínez
VLM
37
18
0
21 Mar 2024
VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis
Yumeng Li
William H. Beluch
M. Keuper
Dan Zhang
Anna Khoreva
DiffM
VGen
71
5
0
20 Mar 2024
Tuning-Free Image Customization with Image and Text Guidance
Pengzhi Li
Qiang Nie
Ying Chen
Xi Jiang
Kai Wu
Yuhuan Lin
Yong-Jin Liu
Jinlong Peng
Chengjie Wang
Feng Zheng
DiffM
17
19
0
19 Mar 2024
One-Step Image Translation with Text-to-Image Models
Gaurav Parmar
Taesung Park
Srinivasa Narasimhan
Jun-Yan Zhu
32
44
0
18 Mar 2024
LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models
Yang Yang
Wen Wang
Liang Peng
Chaotian Song
Yao Chen
...
Xiaolong Yang
Qinglin Lu
Deng Cai
Boxi Wu
Wei Liu
MoMe
60
24
0
18 Mar 2024
Unveiling and Mitigating Memorization in Text-to-image Diffusion Models through Cross Attention
Jie Ren
Yaxin Li
Shenglai Zeng
Han Xu
Lingjuan Lyu
Yue Xing
Jiliang Tang
25
25
0
17 Mar 2024
Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing
Wonjun Kang
Kevin Galim
Hyung Il Koo
DiffM
26
5
0
14 Mar 2024
iCONTRA: Toward Thematic Collection Design Via Interactive Concept Transfer
Dinh-Khoi Vo
Duy-Nam Ly
Khanh-Duy Le
Tam V. Nguyen
Minh-Triet Tran
Trung-Truc Huynh-Le
32
0
0
13 Mar 2024
Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model
Yuxuan Zhang
Lifu Wei
Qing Zhang
Yiren Song
DiffM
28
12
0
12 Mar 2024
DivCon: Divide and Conquer for Progressive Text-to-Image Generation
Yuhao Jia
Wenhan Tan
DiffM
31
1
0
11 Mar 2024
Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models
Yang Zhang
Teoh Tze Tzun
Lim Wei Hern
Tiviatis Sim
Kenji Kawaguchi
DiffM
24
9
0
11 Mar 2024
MACE: Mass Concept Erasure in Diffusion Models
Shilin Lu
Zilan Wang
Leyang Li
Yanzhu Liu
A. Kong
DiffM
25
75
0
10 Mar 2024
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Xiwei Hu
Rui Wang
Yixiao Fang
Bin-Bin Fu
Pei Cheng
Gang Yu
VLM
57
39
0
08 Mar 2024
PrimeComposer: Faster Progressively Combined Diffusion for Image Composition with Attention Steering
Yibin Wang
Weizhong Zhang
Jianwei Zheng
Cheng Jin
DiffM
66
9
0
08 Mar 2024
Discriminative Probing and Tuning for Text-to-Image Generation
Leigang Qu
Wenjie Wang
Yongqi Li
Hanwang Zhang
Liqiang Nie
Tat-Seng Chua
31
7
0
07 Mar 2024
Controllable Generation with Text-to-Image Diffusion Models: A Survey
Pu Cao
Feng Zhou
Qing-Huang Song
Lu Yang
67
35
0
07 Mar 2024
Improving Explicit Spatial Relationships in Text-to-Image Generation through an Automatically Derived Dataset
Ander Salaberria
Gorka Azkune
Oier López de Lacalle
A. Soroa
Eneko Agirre
Frank Keller
EGVM
19
2
0
01 Mar 2024
RealCustom: Narrowing Real Text Word for Real-Time Open-Domain Text-to-Image Customization
Mengqi Huang
Zhendong Mao
Mingcong Liu
Qian He
Yongdong Zhang
DiffM
33
21
0
01 Mar 2024
LoMOE: Localized Multi-Object Editing via Multi-Diffusion
Goirik Chakrabarty
Aditya Chandrasekar
Ramya Hebbalaguppe
AP Prathosh
DiffM
43
6
0
01 Mar 2024
Box It to Bind It: Unified Layout Control and Attribute Binding in T2I Diffusion Models
Ashkan Taghipour
Morteza Ghahremani
Bennamoun
Aref Miri Rekavandi
Hamid Laga
F. Boussaïd
DiffM
22
5
0
27 Feb 2024
CustomSketching: Sketch Concept Extraction for Sketch-based Image Synthesis and Editing
Chufeng Xiao
Hongbo Fu
DiffM
23
3
0
27 Feb 2024
Referee Can Play: An Alternative Approach to Conditional Generation via Model Inversion
Xuantong Liu
Tianyang Hu
Wenjia Wang
Kenji Kawaguchi
Yuan Yao
DiffM
44
3
0
26 Feb 2024
Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition
Chun-Hsiao Yeh
Ta-Ying Cheng
He-Yen Hsieh
Chuan-En Lin
Yi Ma
Andrew Markham
Niki Trigoni
H. T. Kung
Yubei Chen
DiffM
19
3
0
23 Feb 2024
Consolidating Attention Features for Multi-view Image Editing
Or Patashnik
Rinon Gal
Daniel Cohen-Or
Jun-Yan Zhu
Fernando de la Torre
32
6
0
22 Feb 2024
Layout-to-Image Generation with Localized Descriptions using ControlNet with Cross-Attention Control
Denis Lukovnikov
Asja Fischer
DiffM
22
3
0
20 Feb 2024
RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models
Xinchen Zhang
Ling Yang
Yaqi Cai
Zhaochen Yu
Kai-Ni Wang
...
Ye Tian
Minkai Xu
Yong Tang
Yujiu Yang
Bin Cui
DiffM
27
5
0
20 Feb 2024
ComFusion: Personalized Subject Generation in Multiple Specific Scenes From Single Image
Yan Hong
Jianfu Zhang
DiffM
18
3
0
19 Feb 2024
Visual Concept-driven Image Generation with Text-to-Image Diffusion Model
Tanzila Rahman
Shweta Mahajan
Hsin-Ying Lee
Jian Ren
Sergey Tulyakov
Leonid Sigal
72
4
0
18 Feb 2024
Textual Localization: Decomposing Multi-concept Images for Subject-Driven Text-to-Image Generation
Junjie Shentu
Matthew Watson
Noura Al Moubayed
15
0
0
15 Feb 2024
L3GO: Language Agents with Chain-of-3D-Thoughts for Generating Unconventional Objects
Yutaro Yamada
Khyathi Raghavi Chandu
Yuchen Lin
Jack Hessel
Ilker Yildirim
Yejin Choi
AI4CE
12
12
0
14 Feb 2024
MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis
Dewei Zhou
You Li
Fan Ma
Zongxin Yang
Yi Yang
DiffM
18
57
0
08 Feb 2024
Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models
Senmao Li
J. Weijer
Taihang Hu
Fahad Shahbaz Khan
Qibin Hou
Yaxing Wang
Jian Yang
DiffM
38
27
0
08 Feb 2024
Previous
1
2
3
4
5
6
7
8
9
Next