ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.15518
  4. Cited By
ReCo: Region-Controlled Text-to-Image Generation

ReCo: Region-Controlled Text-to-Image Generation

23 November 2022
Zhengyuan Yang
Jianfeng Wang
Zhe Gan
Linjie Li
Kevin Qinghong Lin
Chenfei Wu
Nan Duan
Zicheng Liu
Ce Liu
Michael Zeng
Lijuan Wang
    DiffM
ArXivPDFHTML

Papers citing "ReCo: Region-Controlled Text-to-Image Generation"

25 / 125 papers shown
Title
VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided
  Planning
VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning
Han Lin
Abhaysinh Zala
Jaemin Cho
Mohit Bansal
LM&Ro
VGen
DiffM
26
74
0
26 Sep 2023
DiffusionEngine: Diffusion Model is Scalable Data Engine for Object
  Detection
DiffusionEngine: Diffusion Model is Scalable Data Engine for Object Detection
Manlin Zhang
Jie Wu
Yuxi Ren
Ming Li
Jie Qin
Xuefeng Xiao
Wei Liu
Rui Wang
Min Zheng
Andy J. Ma
DiffM
15
20
0
07 Sep 2023
A Survey of Diffusion Based Image Generation Models: Issues and Their
  Solutions
A Survey of Diffusion Based Image Generation Models: Issues and Their Solutions
Tianyi Zhang
Zheng Wang
Jin Huang
M. M. Tasnim
Wei Shi
VLM
11
21
0
25 Aug 2023
SSMG: Spatial-Semantic Map Guided Diffusion Model for Free-form
  Layout-to-Image Generation
SSMG: Spatial-Semantic Map Guided Diffusion Model for Free-form Layout-to-Image Generation
Chengyou Jia
Minnan Luo
Zhuohang Dang
Guangwen Dai
Xiaojun Chang
Mengmeng Wang
Jingdong Wang
DiffM
31
13
0
20 Aug 2023
Edit Temporal-Consistent Videos with Image Diffusion Model
Edit Temporal-Consistent Videos with Image Diffusion Model
Yuan-Zheng Wang
Yong Li
Xiaoya Zhang
Xin Liu
Anbo Dai
Antoni B. Chan
Zhen Cui
DiffM
17
5
0
17 Aug 2023
LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts
LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts
Binbin Yang
Yinzheng Luo
Ziliang Chen
Guangrun Wang
Xiaodan Liang
Liang Lin
DiffM
11
12
0
13 Aug 2023
BEVControl: Accurately Controlling Street-view Elements with
  Multi-perspective Consistency via BEV Sketch Layout
BEVControl: Accurately Controlling Street-view Elements with Multi-perspective Consistency via BEV Sketch Layout
Kairui Yang
Enhui Ma
Jibing Peng
Qing-Wu Guo
Di Lin
Kaicheng Yu
DiffM
11
57
0
03 Aug 2023
Visual Instruction Inversion: Image Editing via Visual Prompting
Visual Instruction Inversion: Image Editing via Visual Prompting
Thao Nguyen
Yuheng Li
Utkarsh Ojha
Yong Jae Lee
DiffM
19
22
0
26 Jul 2023
BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained
  Diffusion
BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion
Jinheng Xie
Yuexiang Li
Yawen Huang
Haozhe Liu
Wentian Zhang
Yefeng Zheng
Mike Zheng Shou
DiffM
20
192
0
20 Jul 2023
Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image
  Alignment with Iterative VQA Feedback
Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback
Jaskirat Singh
Liang Zheng
18
18
0
10 Jul 2023
Automating Computational Design with Generative AI
Automating Computational Design with Generative AI
J. Ploennigs
Markus Berger
AI4CE
DiffM
31
2
0
05 Jul 2023
Generate Anything Anywhere in Any Scene
Generate Anything Anywhere in Any Scene
Yuheng Li
Haotian Liu
Yangming Wen
Yong Jae Lee
DiffM
27
12
0
29 Jun 2023
A-STAR: Test-time Attention Segregation and Retention for Text-to-image
  Synthesis
A-STAR: Test-time Attention Segregation and Retention for Text-to-image Synthesis
Aishwarya Agarwal
Srikrishna Karanam
K. J. Joseph
Apoorv Saxena
Koustava Goswami
Balaji Vasan Srinivasan
VLM
DiffM
11
46
0
26 Jun 2023
GeoDiffusion: Text-Prompted Geometric Control for Object Detection Data
  Generation
GeoDiffusion: Text-Prompted Geometric Control for Object Detection Data Generation
Kai Chen
Enze Xie
Zhe Chen
Yibo Wang
Lanqing Hong
Zhenguo Li
Dit-Yan Yeung
DiffM
20
21
0
07 Jun 2023
Make-Your-Video: Customized Video Generation Using Textual and
  Structural Guidance
Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance
Jinbo Xing
Menghan Xia
Yuxin Liu
Yuechen Zhang
Yong Zhang
...
Haoxin Chen
Xiaodong Cun
Xintao Wang
Ying Shan
T. Wong
VGen
DiffM
28
83
0
01 Jun 2023
Photoswap: Personalized Subject Swapping in Images
Photoswap: Personalized Subject Swapping in Images
Jing Gu
Yilin Wang
Nanxuan Zhao
Tsu-jui Fu
Wei Xiong
...
Zhifei Zhang
He Zhang
Jianming Zhang
Hyun-Sun Jung
Xin Eric Wang
DiffM
21
37
0
29 May 2023
Towards Language-guided Interactive 3D Generation: LLMs as Layout
  Interpreter with Generative Feedback
Towards Language-guided Interactive 3D Generation: LLMs as Layout Interpreter with Generative Feedback
Yiqi Lin
Hao Wu
Ruichen Wang
H. Lu
Xiaodong Lin
Hui Xiong
Lin Wang
3DV
19
12
0
25 May 2023
LayoutGPT: Compositional Visual Planning and Generation with Large
  Language Models
LayoutGPT: Compositional Visual Planning and Generation with Large Language Models
Weixi Feng
Wanrong Zhu
Tsu-jui Fu
Varun Jampani
Arjun Reddy Akula
Xuehai He
Sugato Basu
X. Wang
William Yang Wang
MLLM
20
160
0
24 May 2023
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image
  Diffusion Models with Large Language Models
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models
Long Lian
Boyi Li
Adam Yala
Trevor Darrell
17
150
0
23 May 2023
Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image
  Generation
Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation
Jaemin Cho
Linjie Li
Zhengyuan Yang
Zhe Gan
Lijuan Wang
Mohit Bansal
EGVM
6
5
0
13 Apr 2023
Training-Free Layout Control with Cross-Attention Guidance
Training-Free Layout Control with Cross-Attention Guidance
Minghao Chen
Iro Laina
Andrea Vedaldi
DiffM
124
217
0
06 Apr 2023
GLIGEN: Open-Set Grounded Text-to-Image Generation
GLIGEN: Open-Set Grounded Text-to-Image Generation
Yuheng Li
Haotian Liu
Qingyang Wu
Fangzhou Mu
Jianwei Yang
Jianfeng Gao
Chunyuan Li
Yong Jae Lee
VLM
44
567
1
17 Jan 2023
Pix2seq: A Language Modeling Framework for Object Detection
Pix2seq: A Language Modeling Framework for Object Detection
Ting-Li Chen
Saurabh Saxena
Lala Li
David J. Fleet
Geoffrey E. Hinton
MLLM
ViT
VLM
233
341
0
22 Sep 2021
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
253
4,735
0
24 Feb 2021
Image Generation from Scene Graphs
Image Generation from Scene Graphs
Justin Johnson
Agrim Gupta
Li Fei-Fei
GNN
211
809
0
04 Apr 2018
Previous
123