ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.15518
  4. Cited By
ReCo: Region-Controlled Text-to-Image Generation

ReCo: Region-Controlled Text-to-Image Generation

23 November 2022
Zhengyuan Yang
Jianfeng Wang
Zhe Gan
Linjie Li
Kevin Qinghong Lin
Chenfei Wu
Nan Duan
Zicheng Liu
Ce Liu
Michael Zeng
Lijuan Wang
    DiffM
ArXivPDFHTML

Papers citing "ReCo: Region-Controlled Text-to-Image Generation"

50 / 125 papers shown
Title
Lay-Your-Scene: Natural Scene Layout Generation with Diffusion Transformers
Lay-Your-Scene: Natural Scene Layout Generation with Diffusion Transformers
Divyansh Srivastava
Xiang Zhang
He Wen
Chenru Wen
Zhuowen Tu
DiffM
26
0
0
07 May 2025
MCCD: Multi-Agent Collaboration-based Compositional Diffusion for Complex Text-to-Image Generation
MCCD: Multi-Agent Collaboration-based Compositional Diffusion for Complex Text-to-Image Generation
Mingcheng Li
Xiaolu Hou
Ziyang Liu
Dingkang Yang
Ziyun Qian
Jiawei Chen
Jinjie Wei
Y. Jiang
Qingyao Xu
L. Zhang
DiffM
46
0
0
05 May 2025
DIVE: Inverting Conditional Diffusion Models for Discriminative Tasks
DIVE: Inverting Conditional Diffusion Models for Discriminative Tasks
Yinqi Li
Hong Chang
Ruibing Hou
Shiguang Shan
Xilin Chen
DiffM
50
0
0
24 Apr 2025
Hierarchical and Step-Layer-Wise Tuning of Attention Specialty for Multi-Instance Synthesis in Diffusion Transformers
Hierarchical and Step-Layer-Wise Tuning of Attention Specialty for Multi-Instance Synthesis in Diffusion Transformers
Chunyang Zhang
Zhenhong Sun
Zhicheng Zhang
Junyan Wang
Yu Zhang
Dong Gong
H. Mo
Daoyi Dong
33
0
0
14 Apr 2025
Marmot: Multi-Agent Reasoning for Multi-Object Self-Correcting in Improving Image-Text Alignment
Marmot: Multi-Agent Reasoning for Multi-Object Self-Correcting in Improving Image-Text Alignment
Jiayang Sun
H. Wang
Jie Cao
Huaibo Huang
R. He
DiffM
68
0
0
10 Apr 2025
BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation
BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation
Yuyang Peng
Shishi Xiao
Keming Wu
Qisheng Liao
Bohan Chen
Kevin Lin
Danqing Huang
Ji Li
Yuhui Yuan
DiffM
66
1
0
26 Mar 2025
DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models
DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models
Dewei Zhou
Mingwei Li
Zongxin Yang
Yi Yang
87
0
0
17 Mar 2025
UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing
UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing
Tsu-jui Fu
Yusu Qian
Chen Chen
Wenze Hu
Zhe Gan
Y. Yang
85
1
0
16 Mar 2025
Investigating and Improving Counter-Stereotypical Action Relation in Text-to-Image Diffusion Models
Sina Malakouti
Adriana Kovashka
EGVM
62
0
0
13 Mar 2025
Consistent Image Layout Editing with Diffusion Models
Tao Xia
Yudi Zhang
Ting Liu Lei Zhang
DiffM
49
1
0
09 Mar 2025
PixelPonder: Dynamic Patch Adaptation for Enhanced Multi-Conditional Text-to-Image Generation
Yanjie Pan
Q. He
Zhengkai Jiang
P. Xu
Chaoyi Wang
...
Yun Cao
Zhenye Gan
M. Chi
Bo Peng
Y. Wang
DiffM
61
0
0
09 Mar 2025
Fine-Grained Alignment and Noise Refinement for Compositional Text-to-Image Generation
Amir Mohammad Izadi
Seyed Mohsen Hosseini
Soroush Vafaie Tabar
Ali Abdollahi
Armin Saghafian
M. Baghshah
EGVM
37
0
0
09 Mar 2025
Replicating Human Social Perception in Generative AI: Evaluating the Valence-Dominance Model
Necdet Gurkan
Kimathi Njoki
Jordan W. Suchow
41
0
0
05 Mar 2025
StageDesigner: Artistic Stage Generation for Scenography via Theater Scripts
Zhaoxing Gan
Mengtian Li
Ruhua Chen
Zhongxia Ji
Sichen Guo
Huanling Hu
Guangnan Ye
Zuo Hu
DiffM
VGen
80
0
0
04 Mar 2025
ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation
ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation
Yifan Pu
Yiming Zhao
Zhicong Tang
Ruihong Yin
Haoxing Ye
...
Ji Li
Xiu Li
Z. Lian
Gao Huang
Baining Guo
DiffM
62
1
0
25 Feb 2025
ComposeAnyone: Controllable Layout-to-Human Generation with Decoupled Multimodal Conditions
ComposeAnyone: Controllable Layout-to-Human Generation with Decoupled Multimodal Conditions
Shiyue Zhang
Zheng Chong
Xi Lu
Wenqing Zhang
Haoxiang Li
Xujie Zhang
Jiehui Huang
Xiao Dong
Xiaodan Liang
DiffM
40
0
0
21 Jan 2025
Grounding Text-to-Image Diffusion Models for Controlled High-Quality Image Generation
Grounding Text-to-Image Diffusion Models for Controlled High-Quality Image Generation
Ahmad Süleyman
Göksel Biricik
41
2
0
15 Jan 2025
Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM Guidance
Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM Guidance
Dongmin Park
Sebin Kim
Taehong Moon
Minkyu Kim
Kangwook Lee
Jaewoong Cho
DiffM
CoGe
62
2
0
08 Jan 2025
MMO-IG: Multi-Class and Multi-Scale Object Image Generation for Remote Sensing
MMO-IG: Multi-Class and Multi-Scale Object Image Generation for Remote Sensing
Chuang Yang
Bingxuan Zhao
Qing Zhou
Qi Wang
76
1
0
18 Dec 2024
Mojito: Motion Trajectory and Intensity Control for Video Generation
Mojito: Motion Trajectory and Intensity Control for Video Generation
Xuehai He
Shuohang Wang
Jianwei Yang
Xiaoxia Wu
Y. Wang
Kuan-Chieh Jackson Wang
Z. Zhan
Olatunji Ruwase
Yelong Shen
X. Wang
VGen
83
1
0
12 Dec 2024
DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Q. He
Jinlong Peng
P. Xu
Boyuan Jiang
Xiaobin Hu
...
Y. Liu
Y. Wang
Chengjie Wang
X. Li
J. Zhang
DiffM
120
1
0
04 Dec 2024
SimuScope: Realistic Endoscopic Synthetic Dataset Generation through
  Surgical Simulation and Diffusion Models
SimuScope: Realistic Endoscopic Synthetic Dataset Generation through Surgical Simulation and Diffusion Models
Sabina Martyniak
Joanna Kaleta
Diego DallÁlba
Michał Naskręt
Szymon Płotka
Przemysław Korzeniowski
MedIm
66
0
0
03 Dec 2024
Imagine and Seek: Improving Composed Image Retrieval with an Imagined
  Proxy
Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy
Y. Li
Fan Ma
Yi Yang
133
2
0
24 Nov 2024
AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data Generation
AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data Generation
Datao Tang
Xiangyong Cao
Xuan Wu
Jialin Li
Jing Yao
Xueru Bai
Deyu Meng
Yin Li
Deyu Meng
DiffM
72
5
0
23 Nov 2024
TKG-DM: Training-free Chroma Key Content Generation Diffusion Model
TKG-DM: Training-free Chroma Key Content Generation Diffusion Model
Ryugo Morita
Stanislav Frolov
Brian B. Moser
Takahiro Shirakawa
Ko Watanabe
Andreas Dengel
Jinjia Zhou
DiffM
77
0
0
23 Nov 2024
Generating Compositional Scenes via Text-to-image RGBA Instance Generation
Alessandro Fontanella
Petru-Daniel Tudosiu
Yongxin Yang
Shifeng Zhang
Sarah Parisot
21
1
0
16 Nov 2024
Boundary Attention Constrained Zero-Shot Layout-To-Image Generation
Boundary Attention Constrained Zero-Shot Layout-To-Image Generation
Huancheng Chen
Jingtao Li
Weiming Zhuang
H. Vikalo
Lingjuan Lyu
DiffM
24
0
0
15 Nov 2024
Token Merging for Training-Free Semantic Binding in Text-to-Image
  Synthesis
Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis
Taihang Hu
Linxuan Li
Joost van de Weijer
Hongcheng Gao
Fahad Shahbaz Khan
Jian Yang
Ming-Ming Cheng
Kai Wang
Yaxing Wang
DiffM
43
4
0
11 Nov 2024
Layout Control and Semantic Guidance with Attention Loss Backward for
  T2I Diffusion Model
Layout Control and Semantic Guidance with Attention Loss Backward for T2I Diffusion Model
Guandong Li
DiffM
20
0
0
11 Nov 2024
Region-Aware Text-to-Image Generation via Hard Binding and Soft
  Refinement
Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement
Zhennan Chen
Yajie Li
Haofan Wang
Z. Chen
Zhengkai Jiang
Jun Yu Li
Qian Wang
Jian Yang
Ying Tai
DiffM
47
8
0
10 Nov 2024
Attention Overlap Is Responsible for The Entity Missing Problem in Text-to-image Diffusion Models!
Attention Overlap Is Responsible for The Entity Missing Problem in Text-to-image Diffusion Models!
Arash Marioriyad
Mohammadali Banayeeanzade
Reza Abbasi
M. Rohban
M. Baghshah
DiffM
63
3
0
28 Oct 2024
3DIS: Depth-Driven Decoupled Instance Synthesis for Text-to-Image
  Generation
3DIS: Depth-Driven Decoupled Instance Synthesis for Text-to-Image Generation
Dewei Zhou
Ji Xie
Zongxin Yang
Yi Yang
DiffM
62
6
0
16 Oct 2024
SceneCraft: Layout-Guided 3D Scene Generation
SceneCraft: Layout-Guided 3D Scene Generation
Xiuyu Yang
Yunze Man
Jun-Kun Chen
Yu-Xiong Wang
3DV
74
8
0
11 Oct 2024
IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation
IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation
Xinchen Zhang
Ling Yang
G. Li
Yaqi Cai
Jiake Xie
Yong Tang
Yujiu Yang
Mengdi Wang
Bin Cui
EGVM
CoGe
28
5
0
09 Oct 2024
Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image
  Generation
Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation
Abdelrahman Eldesokey
Peter Wonka
DiffM
25
1
0
27 Aug 2024
Draw Like an Artist: Complex Scene Generation with Diffusion Model via
  Composition, Painting, and Retouching
Draw Like an Artist: Complex Scene Generation with Diffusion Model via Composition, Painting, and Retouching
Minghao Liu
Le Zhang
Yingjie Tian
Xiaochao Qu
Luoqi Liu
Ting Liu
DiffM
CoGe
18
2
0
25 Aug 2024
Scene123: One Prompt to 3D Scene Generation via Video-Assisted and
  Consistency-Enhanced MAE
Scene123: One Prompt to 3D Scene Generation via Video-Assisted and Consistency-Enhanced MAE
Yiying Yang
Fukun Yin
Jiayuan Fan
Xin Chen
Wanzhang Li
Gang Yu
VGen
44
0
0
10 Aug 2024
REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language
  Models
REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models
Agneet Chatterjee
Yiran Luo
Tejas Gokhale
Yezhou Yang
Chitta Baral
LRM
22
5
0
05 Aug 2024
Bayesian Active Learning for Semantic Segmentation
Bayesian Active Learning for Semantic Segmentation
Sima Didari
Wenjun Hu
Jae Oh Woo
Heng Hao
Hankyu Moon
Seungjai Min
31
0
0
03 Aug 2024
EgoSonics: Generating Synchronized Audio for Silent Egocentric Videos
EgoSonics: Generating Synchronized Audio for Silent Egocentric Videos
Aashish Rai
Srinath Sridhar
DiffM
34
4
0
30 Jul 2024
DDAP: Dual-Domain Anti-Personalization against Text-to-Image Diffusion
  Models
DDAP: Dual-Domain Anti-Personalization against Text-to-Image Diffusion Models
Jing Yang
Runping Xi
Yingxin Lai
Xun Lin
Zitong Yu
DiffM
24
1
0
29 Jul 2024
LSReGen: Large-Scale Regional Generator via Backward Guidance Framework
LSReGen: Large-Scale Regional Generator via Backward Guidance Framework
Bowen Zhang
Cheng Yang
Xuanhui Liu
DiffM
14
0
0
21 Jul 2024
The Fabrication of Reality and Fantasy: Scene Generation with
  LLM-Assisted Prompt Interpretation
The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation
Yi Yao
Chan-Feng Hsu
Jhe-Hao Lin
Hongxia Xie
Terence Lin
Yi-Ning Huang
Hong-Han Shuai
Wen-Huang Cheng
DiffM
21
4
0
17 Jul 2024
PartCraft: Crafting Creative Objects by Parts
PartCraft: Crafting Creative Objects by Parts
Kam Woh Ng
Xiatian Zhu
Yi-Zhe Song
Tao Xiang
27
6
0
05 Jul 2024
Boosting Consistency in Story Visualization with Rich-Contextual
  Conditional Diffusion Models
Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models
Fei Shen
Hu Ye
Sibo Liu
Jun Zhang
Cong Wang
Xiao Han
Wei Yang
85
33
0
02 Jul 2024
GVDIFF: Grounded Text-to-Video Generation with Diffusion Models
GVDIFF: Grounded Text-to-Video Generation with Diffusion Models
Huanzhang Dou
Ruixiang Li
Wei Su
Xi Li
DiffM
21
1
0
02 Jul 2024
MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis
MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis
Dewei Zhou
Y. Li
Fan Ma
Zongxin Yang
Y. Yang
85
11
0
02 Jul 2024
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language
Yicheng Chen
Xiangtai Li
Yining Li
Yanhong Zeng
Jianzong Wu
Xiangyu Zhao
Kai Chen
VLM
DiffM
52
3
0
28 Jun 2024
AnyControl: Create Your Artwork with Versatile Control on Text-to-Image
  Generation
AnyControl: Create Your Artwork with Versatile Control on Text-to-Image Generation
Yanan Sun
Yanchen Liu
Yinhao Tang
Wenjie Pei
Kai Chen
DiffM
19
1
0
27 Jun 2024
Poetry2Image: An Iterative Correction Framework for Images Generated
  from Chinese Classical Poetry
Poetry2Image: An Iterative Correction Framework for Images Generated from Chinese Classical Poetry
Jing Jiang
Yiran Ling
Binzhu Li
Pengxiang Li
Junming Piao
Yu Zhang
EGVM
DiffM
27
1
0
15 Jun 2024
123
Next