ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.05408
  4. Cited By
MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis
v1v2 (latest)

MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis

8 February 2024
Dewei Zhou
You Li
Fan Ma
Zongxin Yang
Yi Yang
    DiffM
ArXiv (abs)PDFHTML

Papers citing "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis"

46 / 46 papers shown
DreamingComics: A Story Visualization Pipeline via Subject and Layout Customized Generation using Video Models
Patrick Kwon
Chen Chen
DiffMAI4TSVGen
161
0
0
01 Dec 2025
SAIDO: Generalizable Detection of AI-Generated Images via Scene-Aware and Importance-Guided Dynamic Optimization in Continual Learning
Yongkang Hu
Yu Cheng
Y. Zhang
Yuan Xie
Zhaoxia Yin
94
0
0
29 Nov 2025
A Training-Free Approach for Multi-ID Customization via Attention Adjustment and Spatial Control
A Training-Free Approach for Multi-ID Customization via Attention Adjustment and Spatial Control
Jiawei Lin
Guanlong Jiao
Jianjin Xu
290
0
0
25 Nov 2025
BideDPO: Conditional Image Generation with Simultaneous Text and Condition Alignment
BideDPO: Conditional Image Generation with Simultaneous Text and Condition Alignment
Dewei Zhou
Mingwei Li
Zongxin Yang
Yu Lu
Yunqiu Xu
Zhizhong Wang
Zeyi Huang
Yi Yang
DiffMEGVM
204
0
0
24 Nov 2025
Are Image-to-Video Models Good Zero-Shot Image Editors?
Are Image-to-Video Models Good Zero-Shot Image Editors?
Zechuan Zhang
Zhenyuan Chen
Zongxin Yang
Yi Yang
DiffMVGen
571
0
0
24 Nov 2025
DiP: Taming Diffusion Models in Pixel Space
DiP: Taming Diffusion Models in Pixel Space
Z. Chen
J. Zhu
Xu Chen
Jiangning Zhang
Xiaobin Hu
Hanzhen Zhao
C. Wang
Jian Yang
Ying Tai
306
1
0
24 Nov 2025
ConsistCompose: Unified Multimodal Layout Control for Image Composition
ConsistCompose: Unified Multimodal Layout Control for Image Composition
Xuanke Shi
B. Li
Xiaoyang Han
Zhongang Cai
Lei Yang
Dahua Lin
Quan-ding Wang
MLLM
391
0
0
23 Nov 2025
Compositional Image Synthesis with Inference-Time Scaling
Compositional Image Synthesis with Inference-Time Scaling
Minsuk Ji
Sanghyeok Lee
Namhyuk Ahn
DiffMMLLMEGVMVLM
267
0
0
28 Oct 2025
UltraHR-100K: Enhancing UHR Image Synthesis with A Large-Scale High-Quality Dataset
UltraHR-100K: Enhancing UHR Image Synthesis with A Large-Scale High-Quality Dataset
Chen Zhao
En Ci
Yunzhe Xu
Tiehan Fan
Shanyan Guan
Yanhao Ge
Jian Yang
Ying Tai
179
7
0
23 Oct 2025
ContextGen: Contextual Layout Anchoring for Identity-Consistent Multi-Instance Generation
ContextGen: Contextual Layout Anchoring for Identity-Consistent Multi-Instance Generation
Ruihang Xu
Dewei Zhou
Fan Ma
Yi Yang
DiffM
201
2
0
13 Oct 2025
DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing
DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing
Zihan Zhou
Shilin Lu
Shuli Leng
Shaocong Zhang
Zhuming Lian
Xinlei Yu
A. Kong
DiffM
323
7
0
02 Oct 2025
Does FLUX Already Know How to Perform Physically Plausible Image Composition?
Does FLUX Already Know How to Perform Physically Plausible Image Composition?
Shilin Lu
Zhuming Lian
Zihan Zhou
Shaocong Zhang
Chen Zhao
A. Kong
316
11
0
25 Sep 2025
OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps
OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps
Bingnan Li
Chen Wang
Haiyang Xu
Xiang Zhang
Ethan Armand
Divyansh Srivastava
Xiaojun Shan
Zeyuan Chen
Jianwen Xie
Zhuowen Tu
VLM
163
1
0
23 Sep 2025
InstanceAssemble: Layout-Aware Image Generation via Instance Assembling Attention
InstanceAssemble: Layout-Aware Image Generation via Instance Assembling Attention
Qiang Xiang
Shuang Sun
Binglei Li
Dejia Song
Huaxia Li
Nemo Chen
Xu Tang
Yao Hu
Junping Zhang
DiffM
302
1
0
20 Sep 2025
Double Helix Diffusion for Cross-Domain Anomaly Image Generation
Double Helix Diffusion for Cross-Domain Anomaly Image Generation
Linchun Wu
Qin Zou
Xianbiao Qi
Bo Du
Zhongyuan Wang
Qingquan Li
171
0
0
16 Sep 2025
MUSE: Multi-Subject Unified Synthesis via Explicit Layout Semantic Expansion
MUSE: Multi-Subject Unified Synthesis via Explicit Layout Semantic Expansion
Fei Peng
Junqiang Wu
Yan Li
Tingting Gao
Di Zhang
Huiyuan Fu
DiffM
165
2
0
20 Aug 2025
LaRender: Training-Free Occlusion Control in Image Generation via Latent Rendering
LaRender: Training-Free Occlusion Control in Image Generation via Latent Rendering
Xiaohang Zhan
Dingming Liu
DiffM
140
2
0
11 Aug 2025
YOLO-Count: Differentiable Object Counting for Text-to-Image Generation
YOLO-Count: Differentiable Object Counting for Text-to-Image Generation
Guanning Zeng
Xiang Zhang
Zirui Wang
Haiyang Xu
Zeyuan Chen
Bingnan Li
Zhuowen Tu
174
6
0
01 Aug 2025
LLMControl: Grounded Control of Text-to-Image Diffusion-based Synthesis with Multimodal LLMs
LLMControl: Grounded Control of Text-to-Image Diffusion-based Synthesis with Multimodal LLMs
Jiaze Wang
Rui Chen
Haowang Cui
181
0
0
26 Jul 2025
GenEscape: Hierarchical Multi-Agent Generation of Escape Room Puzzles
GenEscape: Hierarchical Multi-Agent Generation of Escape Room Puzzles
Mengyi Shan
Brian L. Curless
Ira Kemelmacher-Shlizerman
S. M. Seitz
177
0
0
27 Jun 2025
ControlThinker: Unveiling Latent Semantics for Controllable Image Generation through Visual Reasoning
ControlThinker: Unveiling Latent Semantics for Controllable Image Generation through Visual Reasoning
Feng Han
Yang Jiao
Shaoxiang Chen
Junhao Xu
Yue Yu
Yu-Gang Jiang
DiffMLRM
289
3
0
04 Jun 2025
Seg2Any: Open-set Segmentation-Mask-to-Image Generation with Precise Shape and Semantic Control
Seg2Any: Open-set Segmentation-Mask-to-Image Generation with Precise Shape and Semantic Control
Danfeng li
Hui Zhang
Sheng Wang
Jiacheng Li
Zuxuan Wu
DiffMVLM
354
2
0
31 May 2025
ISAC: Training-Free Instance-to-Semantic Attention Control for Improving Multi-Instance Generation
ISAC: Training-Free Instance-to-Semantic Attention Control for Improving Multi-Instance Generation
Sanghyun Jo
Wooyeol Lee
Ziseok Lee
Kyungsu Kim
1.1K
0
0
27 May 2025
CreatiDesign: A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design
CreatiDesign: A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design
H. Zhang
Dexiang Hong
Maoke Yang
Yutao Chen
Zhao Zhang
Jie Shao
Xinglong Wu
Zuxuan Wu
Yu Jiang
DiffMAI4CE
562
14
0
25 May 2025
Hierarchical and Step-Layer-Wise Tuning of Attention Specialty for Multi-Instance Synthesis in Diffusion Transformers
Hierarchical and Step-Layer-Wise Tuning of Attention Specialty for Multi-Instance Synthesis in Diffusion Transformers
Chunyang Zhang
Zhenhong Sun
Zhicheng Zhang
Junyan Wang
Yu Zhang
Dong Gong
H. Mo
Daoyi Dong
418
1
0
14 Apr 2025
Marmot: Object-Level Self-Correction via Multi-Agent Reasoning
Marmot: Object-Level Self-Correction via Multi-Agent Reasoning
Jiayang Sun
Hongru Wang
Jie Cao
Huaibo Huang
Ran He
DiffM
435
0
0
10 Apr 2025
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes
Nikai Du
Zhennan Chen
Zheyu Chen
Shan Gao
Xi Chen
Zhengkai Jiang
Jian Yang
Ying Tai
DiffM
660
16
0
30 Mar 2025
DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models
DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models
Dewei Zhou
Mingwei Li
Zongxin Yang
Yi Yang
494
15
0
17 Mar 2025
PlanGen: Towards Unified Layout Planning and Image Generation in Auto-Regressive Vision Language Models
PlanGen: Towards Unified Layout Planning and Image Generation in Auto-Regressive Vision Language Models
Runze He
Bo Cheng
Yuhang Ma
Qingxiang Jia
Shanyuan Liu
Ao Ma
Xiaoyu Wu
Xiaoyu Wu
Dawei Leng
Yuhui Yin
DiffMVLM
532
7
0
13 Mar 2025
CE-SDWV: Effective and Efficient Concept Erasure for Text-to-Image Diffusion Models via a Semantic-Driven Word Vocabulary
CE-SDWV: Effective and Efficient Concept Erasure for Text-to-Image Diffusion Models via a Semantic-Driven Word Vocabulary
Jiahang Tu
Qian Feng
Jiahua Dong
Jiahua Dong
Hanbin Zhao
Chao Zhang
Hui Qian
DiffM
368
7
0
26 Jan 2025
DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Qu He
Jinlong Peng
P. Xu
Boyuan Jiang
Xiaobin Hu
...
Wenshu Fan
Yun Wang
Chengjie Wang
Xuelong Li
Jing Zhang
DiffM
618
3
0
04 Dec 2024
AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
AnyEdit: Mastering Unified High-Quality Image Editing for Any IdeaComputer Vision and Pattern Recognition (CVPR), 2024
Qifan Yu
Wei Chow
Zhongqi Yue
Kaihang Pan
Yang Wu
Xiaoyang Wan
Juncheng Billy Li
Siliang Tang
Hao Zhang
Yueting Zhuang
DiffM
549
118
0
24 Nov 2024
AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data Generation
AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data GenerationComputer Vision and Pattern Recognition (CVPR), 2024
Datao Tang
Xiangyong Cao
Xuan Wu
Jialin Li
Jing Yao
Xueru Bai
Deyu Meng
Yin Li
Deyu Meng
DiffM
564
35
0
23 Nov 2024
Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances
Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to AdvancesInternational Conference on Learning Representations (ICLR), 2024
Shilin Lu
Zihan Zhou
Jiayou Lu
Yuanzhi Zhu
A. Kong
WIGM
570
82
0
24 Oct 2024
The Scene Language: Representing Scenes with Programs, Words, and Embeddings
The Scene Language: Representing Scenes with Programs, Words, and EmbeddingsComputer Vision and Pattern Recognition (CVPR), 2024
Yunzhi Zhang
Zizhang Li
Mingyuan Zhou
Shangzhe Wu
Jiajun Wu
514
16
0
22 Oct 2024
Layout-your-3D: Controllable and Precise 3D Generation with 2D Blueprint
Layout-your-3D: Controllable and Precise 3D Generation with 2D BlueprintInternational Conference on Learning Representations (ICLR), 2024
Junwei Zhou
Xueting Li
Lu Qi
Ming-Hsuan Yang
DiffM
312
14
0
20 Oct 2024
3DIS: Depth-Driven Decoupled Instance Synthesis for Text-to-Image Generation
3DIS: Depth-Driven Decoupled Instance Synthesis for Text-to-Image Generation
Dewei Zhou
Ji Xie
Zongxin Yang
Yi Yang
DiffM
367
23
0
16 Oct 2024
IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation
IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image GenerationInternational Conference on Learning Representations (ICLR), 2024
Xinchen Zhang
Ling Yang
Ge Li
Yaqi Cai
Jiake Xie
Yong Tang
Yujiu Yang
Mengdi Wang
Bin Cui
EGVMCoGe
356
19
0
09 Oct 2024
MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis
MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis
Dewei Zhou
Yuchen Ren
Fan Ma
Zongxin Yang
Yue Yang
412
28
0
02 Jul 2024
Semantic-guided Adversarial Diffusion Model for Self-supervised Shadow
  Removal
Semantic-guided Adversarial Diffusion Model for Self-supervised Shadow Removal
Ziqi Zeng
Chen Zhao
Weiling Cai
Chenyu Dong
DiffM
255
5
0
01 Jul 2024
Prompt-Consistency Image Generation (PCIG): A Unified Framework
  Integrating LLMs, Knowledge Graphs, and Controllable Diffusion Models
Prompt-Consistency Image Generation (PCIG): A Unified Framework Integrating LLMs, Knowledge Graphs, and Controllable Diffusion Models
Yichen Sun
Zhixuan Chu
Zhan Qin
Kui Ren
DiffM
246
2
0
24 Jun 2024
MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
X. Wang
Siming Fu
Qihan Huang
Wanggui He
Hao Jiang
DiffM
555
104
0
11 Jun 2024
A Survey on Personalized Content Synthesis with Diffusion Models
A Survey on Personalized Content Synthesis with Diffusion ModelsMachine Intelligence Research (MIR), 2024
Xu-Lu Zhang
Xiao Wei
Wengyu Zhang
Jinlin Wu
Jiaxin Wu
Zhen Lei
Zhaoxiang Zhang
Zhen Lei
Qing Li
EGVM
600
31
0
09 May 2024
HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting
HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting
Zhen Zhou
Fan Ma
Hehe Fan
Yi Yang
3DGS
215
35
0
09 Feb 2024
Wavelet-based Fourier Information Interaction with Frequency Diffusion
  Adjustment for Underwater Image Restoration
Wavelet-based Fourier Information Interaction with Frequency Diffusion Adjustment for Underwater Image RestorationComputer Vision and Pattern Recognition (CVPR), 2023
Chen Zhao
Weiling Cai
Chenyu Dong
Chengwei Hu
370
140
0
28 Nov 2023
LoCo: Locally Constrained Training-Free Layout-to-Image Synthesis
LoCo: Locally Constrained Training-Free Layout-to-Image Synthesis
Peiang Zhao
Han Li
Ruiyang Jin
S. Kevin Zhou
DiffM
450
18
0
21 Nov 2023
1
Page 1 of 1