ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2301.13826
  4. Cited By
Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image
  Diffusion Models

Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models

31 January 2023
Hila Chefer
Yuval Alaluf
Yael Vinker
Lior Wolf
Daniel Cohen-Or
    DiffM
ArXivPDFHTML

Papers citing "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models"

50 / 403 papers shown
Title
Idea-2-3D: Collaborative LMM Agents Enable 3D Model Generation from
  Interleaved Multimodal Inputs
Idea-2-3D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputs
Junhao Chen
Xiang Li
Xiaojun Ye
Chao Li
Zhaoxin Fan
Hao Zhao
VGen
3DV
197
4
0
05 Apr 2024
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept
  Matching
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
Dongzhi Jiang
Guanglu Song
Xiaoshi Wu
Renrui Zhang
Dazhong Shen
Zhuofan Zong
Yu Liu
Hongsheng Li
VLM
28
20
0
04 Apr 2024
Diverse and Tailored Image Generation for Zero-shot Multi-label
  Classification
Diverse and Tailored Image Generation for Zero-shot Multi-label Classification
Kai Zhang
Zhixiang Yuan
Tao Huang
VLM
26
4
0
04 Apr 2024
Faster Diffusion via Temporal Attention Decomposition
Faster Diffusion via Temporal Attention Decomposition
Haozhe Liu
Wentian Zhang
Jinheng Xie
Francesco Faccio
Mengmeng Xu
Tao Xiang
Mike Zheng Shou
Juan-Manuel Perez-Rua
Jürgen Schmidhuber
DiffM
64
19
0
03 Apr 2024
CosmicMan: A Text-to-Image Foundation Model for Humans
CosmicMan: A Text-to-Image Foundation Model for Humans
Shikai Li
Jianglin Fu
Kaiyuan Liu
Wentao Wang
Kwan-Yee Lin
Wayne Wu
DiffM
35
18
0
01 Apr 2024
Getting it Right: Improving Spatial Consistency in Text-to-Image Models
Getting it Right: Improving Spatial Consistency in Text-to-Image Models
Agneet Chatterjee
Gabriela Ben-Melech Stan
Estelle Aflalo
Sayak Paul
Dhruba Ghosh
...
Ludwig Schmidt
Hanna Hajishirzi
Vasudev Lal
Chitta Baral
Yezhou Yang
EGVM
VLM
57
14
0
01 Apr 2024
Relation Rectification in Diffusion Model
Relation Rectification in Diffusion Model
Yinwei Wu
Xingyi Yang
Xinchao Wang
28
6
0
29 Mar 2024
Motion Inversion for Video Customization
Motion Inversion for Video Customization
Luozhou Wang
Guibao Shen
Yixun Liang
Xin Tao
Pengfei Wan
Di Zhang
Yijun Li
Yingcong Chen
VGen
DiffM
32
7
0
29 Mar 2024
CLoRA: A Contrastive Approach to Compose Multiple LoRA Models
CLoRA: A Contrastive Approach to Compose Multiple LoRA Models
Tuna Han Salih Meral
Enis Simsar
Federico Tombari
Pinar Yanardag
MoMe
21
0
0
28 Mar 2024
FlexEdit: Flexible and Controllable Diffusion-based Object-centric Image
  Editing
FlexEdit: Flexible and Controllable Diffusion-based Object-centric Image Editing
Trong-Tung Nguyen
Duc A. Nguyen
Anh Tran
Cuong Pham
DiffM
29
7
0
27 Mar 2024
Attention Calibration for Disentangled Text-to-Image Personalization
Attention Calibration for Disentangled Text-to-Image Personalization
Yanbing Zhang
Mengping Yang
Qin Zhou
Zhe Wang
20
15
0
27 Mar 2024
ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture
  Synthesis
ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis
Muhammad Hamza Mughal
Rishabh Dabral
I. Habibie
Lucia Donatelli
Marc Habermann
Christian Theobalt
SLR
30
14
0
26 Mar 2024
Improving Text-to-Image Consistency via Automatic Prompt Optimization
Improving Text-to-Image Consistency via Automatic Prompt Optimization
Oscar Manas
Pietro Astolfi
Melissa Hall
Candace Ross
Jack Urbanek
Adina Williams
Aishwarya Agrawal
Adriana Romero Soriano
M. Drozdzal
29
26
0
26 Mar 2024
Be Yourself: Bounded Attention for Multi-Subject Text-to-Image
  Generation
Be Yourself: Bounded Attention for Multi-Subject Text-to-Image Generation
Omer Dahary
Or Patashnik
Kfir Aberman
Daniel Cohen-Or
DiffM
19
27
0
25 Mar 2024
Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic Directions
Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic Directions
S. A. Baumann
Felix Krause
Michael Neumayr
Nick Stracke
Vincent Tao Hu
Bjorn Ommer
Björn Ommer
DiffM
LM&Ro
66
11
0
25 Mar 2024
EVA: Zero-shot Accurate Attributes and Multi-Object Video Editing
EVA: Zero-shot Accurate Attributes and Multi-Object Video Editing
Xiangpeng Yang
Linchao Zhu
Hehe Fan
Yi Yang
DiffM
VGen
12
9
0
24 Mar 2024
Selectively Informative Description can Reduce Undesired Embedding
  Entanglements in Text-to-Image Personalization
Selectively Informative Description can Reduce Undesired Embedding Entanglements in Text-to-Image Personalization
Jimyeong Kim
Jungwon Park
Wonjong Rhee
DiffM
17
5
0
22 Mar 2024
DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified &
  Accurate Image Editing
DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing
Yueru Jia
Yuhui Yuan
Aosong Cheng
Chuke Wang
Ji Li
Huizhu Jia
Shanghang Zhang
DiffM
23
7
0
21 Mar 2024
Open-Vocabulary Attention Maps with Token Optimization for Semantic
  Segmentation in Diffusion Models
Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models
Pablo Marcos-Manchón
Roberto Alcover-Couso
Juan C. Sanmiguel
Jose M. Martínez
VLM
37
18
0
21 Mar 2024
VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis
VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis
Yumeng Li
William H. Beluch
M. Keuper
Dan Zhang
Anna Khoreva
DiffM
VGen
71
5
0
20 Mar 2024
Tuning-Free Image Customization with Image and Text Guidance
Tuning-Free Image Customization with Image and Text Guidance
Pengzhi Li
Qiang Nie
Ying Chen
Xi Jiang
Kai Wu
Yuhuan Lin
Yong-Jin Liu
Jinlong Peng
Chengjie Wang
Feng Zheng
DiffM
17
19
0
19 Mar 2024
One-Step Image Translation with Text-to-Image Models
One-Step Image Translation with Text-to-Image Models
Gaurav Parmar
Taesung Park
Srinivasa Narasimhan
Jun-Yan Zhu
32
44
0
18 Mar 2024
LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept
  Customization in Training-Free Diffusion Models
LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models
Yang Yang
Wen Wang
Liang Peng
Chaotian Song
Yao Chen
...
Xiaolong Yang
Qinglin Lu
Deng Cai
Boxi Wu
Wei Liu
MoMe
60
24
0
18 Mar 2024
Unveiling and Mitigating Memorization in Text-to-image Diffusion Models
  through Cross Attention
Unveiling and Mitigating Memorization in Text-to-image Diffusion Models through Cross Attention
Jie Ren
Yaxin Li
Shenglai Zeng
Han Xu
Lingjuan Lyu
Yue Xing
Jiliang Tang
25
25
0
17 Mar 2024
Eta Inversion: Designing an Optimal Eta Function for Diffusion-based
  Real Image Editing
Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing
Wonjun Kang
Kevin Galim
Hyung Il Koo
DiffM
26
5
0
14 Mar 2024
iCONTRA: Toward Thematic Collection Design Via Interactive Concept
  Transfer
iCONTRA: Toward Thematic Collection Design Via Interactive Concept Transfer
Dinh-Khoi Vo
Duy-Nam Ly
Khanh-Duy Le
Tam V. Nguyen
Minh-Triet Tran
Trung-Truc Huynh-Le
32
0
0
13 Mar 2024
Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model
Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model
Yuxuan Zhang
Lifu Wei
Qing Zhang
Yiren Song
DiffM
28
12
0
12 Mar 2024
DivCon: Divide and Conquer for Progressive Text-to-Image Generation
DivCon: Divide and Conquer for Progressive Text-to-Image Generation
Yuhao Jia
Wenhan Tan
DiffM
31
1
0
11 Mar 2024
Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention
  Regulation in Diffusion Models
Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models
Yang Zhang
Teoh Tze Tzun
Lim Wei Hern
Tiviatis Sim
Kenji Kawaguchi
DiffM
24
9
0
11 Mar 2024
MACE: Mass Concept Erasure in Diffusion Models
MACE: Mass Concept Erasure in Diffusion Models
Shilin Lu
Zilan Wang
Leyang Li
Yanzhu Liu
A. Kong
DiffM
25
75
0
10 Mar 2024
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Xiwei Hu
Rui Wang
Yixiao Fang
Bin-Bin Fu
Pei Cheng
Gang Yu
VLM
57
39
0
08 Mar 2024
PrimeComposer: Faster Progressively Combined Diffusion for Image
  Composition with Attention Steering
PrimeComposer: Faster Progressively Combined Diffusion for Image Composition with Attention Steering
Yibin Wang
Weizhong Zhang
Jianwei Zheng
Cheng Jin
DiffM
66
9
0
08 Mar 2024
Discriminative Probing and Tuning for Text-to-Image Generation
Discriminative Probing and Tuning for Text-to-Image Generation
Leigang Qu
Wenjie Wang
Yongqi Li
Hanwang Zhang
Liqiang Nie
Tat-Seng Chua
31
7
0
07 Mar 2024
Controllable Generation with Text-to-Image Diffusion Models: A Survey
Controllable Generation with Text-to-Image Diffusion Models: A Survey
Pu Cao
Feng Zhou
Qing-Huang Song
Lu Yang
67
35
0
07 Mar 2024
Improving Explicit Spatial Relationships in Text-to-Image Generation
  through an Automatically Derived Dataset
Improving Explicit Spatial Relationships in Text-to-Image Generation through an Automatically Derived Dataset
Ander Salaberria
Gorka Azkune
Oier López de Lacalle
A. Soroa
Eneko Agirre
Frank Keller
EGVM
19
2
0
01 Mar 2024
RealCustom: Narrowing Real Text Word for Real-Time Open-Domain
  Text-to-Image Customization
RealCustom: Narrowing Real Text Word for Real-Time Open-Domain Text-to-Image Customization
Mengqi Huang
Zhendong Mao
Mingcong Liu
Qian He
Yongdong Zhang
DiffM
33
21
0
01 Mar 2024
LoMOE: Localized Multi-Object Editing via Multi-Diffusion
LoMOE: Localized Multi-Object Editing via Multi-Diffusion
Goirik Chakrabarty
Aditya Chandrasekar
Ramya Hebbalaguppe
AP Prathosh
DiffM
43
6
0
01 Mar 2024
Box It to Bind It: Unified Layout Control and Attribute Binding in T2I
  Diffusion Models
Box It to Bind It: Unified Layout Control and Attribute Binding in T2I Diffusion Models
Ashkan Taghipour
Morteza Ghahremani
Bennamoun
Aref Miri Rekavandi
Hamid Laga
F. Boussaïd
DiffM
22
5
0
27 Feb 2024
CustomSketching: Sketch Concept Extraction for Sketch-based Image
  Synthesis and Editing
CustomSketching: Sketch Concept Extraction for Sketch-based Image Synthesis and Editing
Chufeng Xiao
Hongbo Fu
DiffM
23
3
0
27 Feb 2024
Referee Can Play: An Alternative Approach to Conditional Generation via
  Model Inversion
Referee Can Play: An Alternative Approach to Conditional Generation via Model Inversion
Xuantong Liu
Tianyang Hu
Wenjia Wang
Kenji Kawaguchi
Yuan Yao
DiffM
44
3
0
26 Feb 2024
Gen4Gen: Generative Data Pipeline for Generative Multi-Concept
  Composition
Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition
Chun-Hsiao Yeh
Ta-Ying Cheng
He-Yen Hsieh
Chuan-En Lin
Yi Ma
Andrew Markham
Niki Trigoni
H. T. Kung
Yubei Chen
DiffM
19
3
0
23 Feb 2024
Consolidating Attention Features for Multi-view Image Editing
Consolidating Attention Features for Multi-view Image Editing
Or Patashnik
Rinon Gal
Daniel Cohen-Or
Jun-Yan Zhu
Fernando de la Torre
32
6
0
22 Feb 2024
Layout-to-Image Generation with Localized Descriptions using ControlNet
  with Cross-Attention Control
Layout-to-Image Generation with Localized Descriptions using ControlNet with Cross-Attention Control
Denis Lukovnikov
Asja Fischer
DiffM
22
3
0
20 Feb 2024
RealCompo: Balancing Realism and Compositionality Improves Text-to-Image
  Diffusion Models
RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models
Xinchen Zhang
Ling Yang
Yaqi Cai
Zhaochen Yu
Kai-Ni Wang
...
Ye Tian
Minkai Xu
Yong Tang
Yujiu Yang
Bin Cui
DiffM
27
5
0
20 Feb 2024
ComFusion: Personalized Subject Generation in Multiple Specific Scenes
  From Single Image
ComFusion: Personalized Subject Generation in Multiple Specific Scenes From Single Image
Yan Hong
Jianfu Zhang
DiffM
18
3
0
19 Feb 2024
Visual Concept-driven Image Generation with Text-to-Image Diffusion Model
Visual Concept-driven Image Generation with Text-to-Image Diffusion Model
Tanzila Rahman
Shweta Mahajan
Hsin-Ying Lee
Jian Ren
Sergey Tulyakov
Leonid Sigal
72
4
0
18 Feb 2024
Textual Localization: Decomposing Multi-concept Images for
  Subject-Driven Text-to-Image Generation
Textual Localization: Decomposing Multi-concept Images for Subject-Driven Text-to-Image Generation
Junjie Shentu
Matthew Watson
Noura Al Moubayed
15
0
0
15 Feb 2024
L3GO: Language Agents with Chain-of-3D-Thoughts for Generating
  Unconventional Objects
L3GO: Language Agents with Chain-of-3D-Thoughts for Generating Unconventional Objects
Yutaro Yamada
Khyathi Raghavi Chandu
Yuchen Lin
Jack Hessel
Ilker Yildirim
Yejin Choi
AI4CE
12
12
0
14 Feb 2024
MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis
MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis
Dewei Zhou
You Li
Fan Ma
Zongxin Yang
Yi Yang
DiffM
18
57
0
08 Feb 2024
Get What You Want, Not What You Don't: Image Content Suppression for
  Text-to-Image Diffusion Models
Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models
Senmao Li
J. Weijer
Taihang Hu
Fahad Shahbaz Khan
Qibin Hou
Yaxing Wang
Jian Yang
DiffM
38
27
0
08 Feb 2024
Previous
123456789
Next