ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.05408
  4. Cited By
MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis
v1v2 (latest)

MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis

8 February 2024
Dewei Zhou
You Li
Fan Ma
Zongxin Yang
Yi Yang
    DiffM
ArXiv (abs)PDFHTML

Papers citing "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis"

44 / 44 papers shown
Title
DreamingComics: A Story Visualization Pipeline via Subject and Layout Customized Generation using Video Models
Patrick Kwon
Chen Chen
DiffMAI4TSVGen
96
0
0
01 Dec 2025
A Training-Free Approach for Multi-ID Customization via Attention Adjustment and Spatial Control
A Training-Free Approach for Multi-ID Customization via Attention Adjustment and Spatial Control
Jiawei Lin
Guanlong Jiao
Jianjin Xu
259
0
0
25 Nov 2025
Are Image-to-Video Models Good Zero-Shot Image Editors?
Are Image-to-Video Models Good Zero-Shot Image Editors?
Zechuan Zhang
Zhenyuan Chen
Zongxin Yang
Yi Yang
DiffMVGen
513
0
0
24 Nov 2025
BideDPO: Conditional Image Generation with Simultaneous Text and Condition Alignment
BideDPO: Conditional Image Generation with Simultaneous Text and Condition Alignment
Dewei Zhou
Mingwei Li
Zongxin Yang
Yu Lu
Yunqiu Xu
Zhizhong Wang
Zeyi Huang
Yi Yang
DiffMEGVM
148
0
0
24 Nov 2025
ConsistCompose: Unified Multimodal Layout Control for Image Composition
ConsistCompose: Unified Multimodal Layout Control for Image Composition
Xuanke Shi
B. Li
Xiaoyang Han
Zhongang Cai
Lei Yang
Dahua Lin
Quan-ding Wang
MLLM
302
0
0
23 Nov 2025
Compositional Image Synthesis with Inference-Time Scaling
Compositional Image Synthesis with Inference-Time Scaling
Minsuk Ji
Sanghyeok Lee
Namhyuk Ahn
DiffMMLLMEGVMVLM
242
0
0
28 Oct 2025
UltraHR-100K: Enhancing UHR Image Synthesis with A Large-Scale High-Quality Dataset
UltraHR-100K: Enhancing UHR Image Synthesis with A Large-Scale High-Quality Dataset
Chen Zhao
En Ci
Yunzhe Xu
Tiehan Fan
Shanyan Guan
Yanhao Ge
Jian Yang
Ying Tai
148
7
0
23 Oct 2025
ContextGen: Contextual Layout Anchoring for Identity-Consistent Multi-Instance Generation
ContextGen: Contextual Layout Anchoring for Identity-Consistent Multi-Instance Generation
Ruihang Xu
Dewei Zhou
Fan Ma
Yi Yang
DiffM
108
2
0
13 Oct 2025
DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing
DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing
Zihan Zhou
Shilin Lu
Shuli Leng
Shaocong Zhang
Zhuming Lian
Xinlei Yu
A. Kong
DiffM
231
7
0
02 Oct 2025
Does FLUX Already Know How to Perform Physically Plausible Image Composition?
Does FLUX Already Know How to Perform Physically Plausible Image Composition?
Shilin Lu
Zhuming Lian
Zihan Zhou
Shaocong Zhang
Chen Zhao
A. Kong
262
11
0
25 Sep 2025
OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps
OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps
Bingnan Li
Chen Wang
Haiyang Xu
Xiang Zhang
Ethan Armand
Divyansh Srivastava
Xiaojun Shan
Zeyuan Chen
Jianwen Xie
Zhuowen Tu
VLM
118
1
0
23 Sep 2025
InstanceAssemble: Layout-Aware Image Generation via Instance Assembling Attention
InstanceAssemble: Layout-Aware Image Generation via Instance Assembling Attention
Qiang Xiang
Shuang Sun
Binglei Li
Dejia Song
Huaxia Li
Nemo Chen
Xu Tang
Yao Hu
Junping Zhang
DiffM
252
0
0
20 Sep 2025
Double Helix Diffusion for Cross-Domain Anomaly Image Generation
Double Helix Diffusion for Cross-Domain Anomaly Image Generation
Linchun Wu
Qin Zou
Xianbiao Qi
Bo Du
Zhongyuan Wang
Qingquan Li
124
0
0
16 Sep 2025
MUSE: Multi-Subject Unified Synthesis via Explicit Layout Semantic Expansion
MUSE: Multi-Subject Unified Synthesis via Explicit Layout Semantic Expansion
Fei Peng
Junqiang Wu
Yan Li
Tingting Gao
Di Zhang
Huiyuan Fu
DiffM
128
2
0
20 Aug 2025
LaRender: Training-Free Occlusion Control in Image Generation via Latent Rendering
LaRender: Training-Free Occlusion Control in Image Generation via Latent Rendering
Xiaohang Zhan
Dingming Liu
DiffM
100
2
0
11 Aug 2025
YOLO-Count: Differentiable Object Counting for Text-to-Image Generation
YOLO-Count: Differentiable Object Counting for Text-to-Image Generation
Guanning Zeng
Xiang Zhang
Zirui Wang
Haiyang Xu
Zeyuan Chen
Bingnan Li
Zhuowen Tu
138
5
0
01 Aug 2025
LLMControl: Grounded Control of Text-to-Image Diffusion-based Synthesis with Multimodal LLMs
LLMControl: Grounded Control of Text-to-Image Diffusion-based Synthesis with Multimodal LLMs
Jiaze Wang
Rui Chen
Haowang Cui
145
0
0
26 Jul 2025
GenEscape: Hierarchical Multi-Agent Generation of Escape Room Puzzles
GenEscape: Hierarchical Multi-Agent Generation of Escape Room Puzzles
Mengyi Shan
Brian L. Curless
Ira Kemelmacher-Shlizerman
S. M. Seitz
153
0
0
27 Jun 2025
ControlThinker: Unveiling Latent Semantics for Controllable Image Generation through Visual Reasoning
ControlThinker: Unveiling Latent Semantics for Controllable Image Generation through Visual Reasoning
Feng Han
Yang Jiao
Shaoxiang Chen
Junhao Xu
Yue Yu
Yu-Gang Jiang
DiffMLRM
248
2
0
04 Jun 2025
Seg2Any: Open-set Segmentation-Mask-to-Image Generation with Precise Shape and Semantic Control
Seg2Any: Open-set Segmentation-Mask-to-Image Generation with Precise Shape and Semantic Control
Danfeng li
Hui Zhang
Sheng Wang
Jiacheng Li
Zuxuan Wu
DiffMVLM
322
0
0
31 May 2025
ISAC: Training-Free Instance-to-Semantic Attention Control for Improving Multi-Instance Generation
ISAC: Training-Free Instance-to-Semantic Attention Control for Improving Multi-Instance Generation
Sanghyun Jo
Wooyeol Lee
Ziseok Lee
Kyungsu Kim
1.1K
0
0
27 May 2025
CreatiDesign: A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design
CreatiDesign: A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design
H. Zhang
Dexiang Hong
Maoke Yang
Yutao Chen
Zhao Zhang
Jie Shao
Xinglong Wu
Zuxuan Wu
Yu Jiang
DiffMAI4CE
499
12
0
25 May 2025
Hierarchical and Step-Layer-Wise Tuning of Attention Specialty for Multi-Instance Synthesis in Diffusion Transformers
Hierarchical and Step-Layer-Wise Tuning of Attention Specialty for Multi-Instance Synthesis in Diffusion Transformers
Chunyang Zhang
Zhenhong Sun
Zhicheng Zhang
Junyan Wang
Yu Zhang
Dong Gong
H. Mo
Daoyi Dong
369
1
0
14 Apr 2025
Marmot: Object-Level Self-Correction via Multi-Agent Reasoning
Marmot: Object-Level Self-Correction via Multi-Agent Reasoning
Jiayang Sun
Hongru Wang
Jie Cao
Huaibo Huang
Ran He
DiffM
373
0
0
10 Apr 2025
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes
Nikai Du
Zhennan Chen
Zheyu Chen
Shan Gao
Xi Chen
Zhengkai Jiang
Jian Yang
Ying Tai
DiffM
560
13
0
30 Mar 2025
DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models
DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models
Dewei Zhou
Mingwei Li
Zongxin Yang
Yi Yang
434
14
0
17 Mar 2025
PlanGen: Towards Unified Layout Planning and Image Generation in Auto-Regressive Vision Language Models
PlanGen: Towards Unified Layout Planning and Image Generation in Auto-Regressive Vision Language Models
Runze He
Bo Cheng
Yuhang Ma
Qingxiang Jia
Shanyuan Liu
Ao Ma
Xiaoyu Wu
Xiaoyu Wu
Dawei Leng
Yuhui Yin
DiffMVLM
462
6
0
13 Mar 2025
CE-SDWV: Effective and Efficient Concept Erasure for Text-to-Image Diffusion Models via a Semantic-Driven Word Vocabulary
CE-SDWV: Effective and Efficient Concept Erasure for Text-to-Image Diffusion Models via a Semantic-Driven Word Vocabulary
Jiahang Tu
Qian Feng
Jiahua Dong
Jiahua Dong
Hanbin Zhao
Chao Zhang
Hui Qian
DiffM
337
4
0
26 Jan 2025
DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Qu He
Jinlong Peng
P. Xu
Boyuan Jiang
Xiaobin Hu
...
Wenshu Fan
Yun Wang
Chengjie Wang
Xuelong Li
Jing Zhang
DiffM
525
3
0
04 Dec 2024
AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
AnyEdit: Mastering Unified High-Quality Image Editing for Any IdeaComputer Vision and Pattern Recognition (CVPR), 2024
Qifan Yu
Wei Chow
Zhongqi Yue
Kaihang Pan
Yang Wu
Xiaoyang Wan
Juncheng Billy Li
Siliang Tang
Hao Zhang
Yueting Zhuang
DiffM
437
107
0
24 Nov 2024
AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data Generation
AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data GenerationComputer Vision and Pattern Recognition (CVPR), 2024
Datao Tang
Xiangyong Cao
Xuan Wu
Jialin Li
Jing Yao
Xueru Bai
Deyu Meng
Yin Li
Deyu Meng
DiffM
485
34
0
23 Nov 2024
Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances
Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to AdvancesInternational Conference on Learning Representations (ICLR), 2024
Shilin Lu
Zihan Zhou
Jiayou Lu
Yuanzhi Zhu
A. Kong
WIGM
541
74
0
24 Oct 2024
The Scene Language: Representing Scenes with Programs, Words, and Embeddings
The Scene Language: Representing Scenes with Programs, Words, and EmbeddingsComputer Vision and Pattern Recognition (CVPR), 2024
Yunzhi Zhang
Zizhang Li
Mingyuan Zhou
Shangzhe Wu
Jiajun Wu
452
14
0
22 Oct 2024
Layout-your-3D: Controllable and Precise 3D Generation with 2D Blueprint
Layout-your-3D: Controllable and Precise 3D Generation with 2D BlueprintInternational Conference on Learning Representations (ICLR), 2024
Junwei Zhou
Xueting Li
Lu Qi
Ming-Hsuan Yang
DiffM
262
14
0
20 Oct 2024
3DIS: Depth-Driven Decoupled Instance Synthesis for Text-to-Image Generation
3DIS: Depth-Driven Decoupled Instance Synthesis for Text-to-Image Generation
Dewei Zhou
Ji Xie
Zongxin Yang
Yi Yang
DiffM
306
21
0
16 Oct 2024
IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation
IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image GenerationInternational Conference on Learning Representations (ICLR), 2024
Xinchen Zhang
Ling Yang
Ge Li
Yaqi Cai
Jiake Xie
Yong Tang
Yujiu Yang
Mengdi Wang
Bin Cui
EGVMCoGe
294
18
0
09 Oct 2024
MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis
MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis
Dewei Zhou
Yuchen Ren
Fan Ma
Zongxin Yang
Yue Yang
363
27
0
02 Jul 2024
Semantic-guided Adversarial Diffusion Model for Self-supervised Shadow
  Removal
Semantic-guided Adversarial Diffusion Model for Self-supervised Shadow Removal
Ziqi Zeng
Chen Zhao
Weiling Cai
Chenyu Dong
DiffM
225
5
0
01 Jul 2024
Prompt-Consistency Image Generation (PCIG): A Unified Framework
  Integrating LLMs, Knowledge Graphs, and Controllable Diffusion Models
Prompt-Consistency Image Generation (PCIG): A Unified Framework Integrating LLMs, Knowledge Graphs, and Controllable Diffusion Models
Yichen Sun
Zhixuan Chu
Zhan Qin
Kui Ren
DiffM
211
2
0
24 Jun 2024
MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
X. Wang
Siming Fu
Qihan Huang
Wanggui He
Hao Jiang
DiffM
482
97
0
11 Jun 2024
A Survey on Personalized Content Synthesis with Diffusion Models
A Survey on Personalized Content Synthesis with Diffusion ModelsMachine Intelligence Research (MIR), 2024
Xu-Lu Zhang
Xiao Wei
Wengyu Zhang
Jinlin Wu
Jiaxin Wu
Zhen Lei
Zhaoxiang Zhang
Zhen Lei
Qing Li
EGVM
508
30
0
09 May 2024
HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting
HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting
Zhen Zhou
Fan Ma
Hehe Fan
Yi Yang
3DGS
173
34
0
09 Feb 2024
Wavelet-based Fourier Information Interaction with Frequency Diffusion
  Adjustment for Underwater Image Restoration
Wavelet-based Fourier Information Interaction with Frequency Diffusion Adjustment for Underwater Image RestorationComputer Vision and Pattern Recognition (CVPR), 2023
Chen Zhao
Weiling Cai
Chenyu Dong
Chengwei Hu
307
126
0
28 Nov 2023
LoCo: Locally Constrained Training-Free Layout-to-Image Synthesis
LoCo: Locally Constrained Training-Free Layout-to-Image Synthesis
Peiang Zhao
Han Li
Ruiyang Jin
S. Kevin Zhou
DiffM
385
18
0
21 Nov 2023
1