ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2307.10816
  4. Cited By
BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained
  Diffusion

BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion

20 July 2023
Jinheng Xie
Yuexiang Li
Yawen Huang
Haozhe Liu
Wentian Zhang
Yefeng Zheng
Mike Zheng Shou
    DiffM
ArXivPDFHTML

Papers citing "BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion"

50 / 166 papers shown
Title
Region-Aware Text-to-Image Generation via Hard Binding and Soft
  Refinement
Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement
Zhennan Chen
Yajie Li
Haofan Wang
Z. Chen
Zhengkai Jiang
Jun Yu Li
Qian Wang
Jian Yang
Ying Tai
DiffM
47
8
0
10 Nov 2024
Towards Small Object Editing: A Benchmark Dataset and A Training-Free
  Approach
Towards Small Object Editing: A Benchmark Dataset and A Training-Free Approach
Qihe Pan
Zhen Zhao
Zicheng Wang
Sifan Long
Yiming Wu
Wei Ji
Haoran Liang
Ronghua Liang
19
0
0
03 Nov 2024
GrounDiT: Grounding Diffusion Transformers via Noisy Patch
  Transplantation
GrounDiT: Grounding Diffusion Transformers via Noisy Patch Transplantation
Phillip Y. Lee
Taehoon Yoon
Minhyuk Sung
42
1
1
27 Oct 2024
Layout-your-3D: Controllable and Precise 3D Generation with 2D Blueprint
Layout-your-3D: Controllable and Precise 3D Generation with 2D Blueprint
Junwei Zhou
Xueting Li
Lu Qi
Ming Yang
DiffM
29
2
0
20 Oct 2024
SeaS: Few-shot Industrial Anomaly Image Generation with Separation and
  Sharing Fine-tuning
SeaS: Few-shot Industrial Anomaly Image Generation with Separation and Sharing Fine-tuning
Zhewei Dai
Shilei Zeng
Haotian Liu
Xurui Li
Feng Xue
Yu Zhou
DiffM
24
1
0
19 Oct 2024
HiCo: Hierarchical Controllable Diffusion Model for Layout-to-image
  Generation
HiCo: Hierarchical Controllable Diffusion Model for Layout-to-image Generation
Bo Cheng
Yuhang Ma
Liebucha Wu
Shanyuan Liu
Ao Ma
Xiaoyu Wu
Dawei Leng
Yuhui Yin
DiffM
22
8
0
18 Oct 2024
3DIS: Depth-Driven Decoupled Instance Synthesis for Text-to-Image
  Generation
3DIS: Depth-Driven Decoupled Instance Synthesis for Text-to-Image Generation
Dewei Zhou
Ji Xie
Zongxin Yang
Yi Yang
DiffM
62
6
0
16 Oct 2024
Semantic Score Distillation Sampling for Compositional Text-to-3D
  Generation
Semantic Score Distillation Sampling for Compositional Text-to-3D Generation
L. Yang
Zixiang Zhang
Junlin Han
Bohan Zeng
Runjia Li
Philip Torr
Wentao Zhang
34
2
0
11 Oct 2024
Boosting Few-Shot Detection with Large Language Models and
  Layout-to-Image Synthesis
Boosting Few-Shot Detection with Large Language Models and Layout-to-Image Synthesis
Ahmed Abdullah
Nikolas Ebert
Oliver Wasenmüller
ObjD
25
1
0
09 Oct 2024
IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation
IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation
Xinchen Zhang
Ling Yang
G. Li
Yaqi Cai
Jiake Xie
Yong Tang
Yujiu Yang
Mengdi Wang
Bin Cui
EGVM
CoGe
36
5
0
09 Oct 2024
DiffGAD: A Diffusion-based Unsupervised Graph Anomaly Detector
DiffGAD: A Diffusion-based Unsupervised Graph Anomaly Detector
Jinghan Li
Yuan Gao
Jinda Lu
Junfeng Fang
Congcong Wen
Hui Lin
Xiang Wang
45
0
0
09 Oct 2024
T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through
  Data, Reward, and Conditional Guidance Design
T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through Data, Reward, and Conditional Guidance Design
Jiachen Li
Qian Long
Jian Zheng
Xiaofeng Gao
Robinson Piramuthu
Wenhu Chen
William Yang Wang
VGen
25
22
0
08 Oct 2024
OmniBooth: Learning Latent Control for Image Synthesis with Multi-modal
  Instruction
OmniBooth: Learning Latent Control for Image Synthesis with Multi-modal Instruction
Leheng Li
Weichao Qiu
Xu Yan
Jing He
Kaiqiang Zhou
Yingjie Cai
Qing Lian
Bingbing Liu
Ying-Cong Chen
SyDa
DiffM
39
1
0
07 Oct 2024
Event-Customized Image Generation
Event-Customized Image Generation
Zhen Wang
Yilei Jiang
Dong Zheng
Jun Xiao
Long Chen
DiffM
26
1
0
03 Oct 2024
Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We
  Learn How Vision-Language Models Function
Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function
Chenyi Zhuang
Ying Hu
Pan Gao
DiffM
VLM
42
12
0
30 Sep 2024
Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation
Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation
Jiaxin Cheng
Zixu Zhao
Tong He
Tianjun Xiao
Yicong Zhou
Zheng Zhang
DiffM
37
0
0
07 Sep 2024
SpotActor: Training-Free Layout-Controlled Consistent Image Generation
SpotActor: Training-Free Layout-Controlled Consistent Image Generation
Jiahao Wang
Caixia Yan
Weizhan Zhang
Haonan Lin
Mengmeng Wang
Guang Dai
Tieliang Gong
Hao Sun
Jingdong Wang
DiffM
29
2
0
07 Sep 2024
Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image
  Generation
Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation
Abdelrahman Eldesokey
Peter Wonka
DiffM
27
4
0
27 Aug 2024
Draw Like an Artist: Complex Scene Generation with Diffusion Model via
  Composition, Painting, and Retouching
Draw Like an Artist: Complex Scene Generation with Diffusion Model via Composition, Painting, and Retouching
Minghao Liu
Le Zhang
Yingjie Tian
Xiaochao Qu
Luoqi Liu
Ting Liu
DiffM
CoGe
27
2
0
25 Aug 2024
Show-o: One Single Transformer to Unify Multimodal Understanding and
  Generation
Show-o: One Single Transformer to Unify Multimodal Understanding and Generation
Jinheng Xie
Weijia Mao
Zechen Bai
David Junhao Zhang
Weihao Wang
Kevin Qinghong Lin
Yuchao Gu
Zhijie Chen
Zhenheng Yang
Mike Zheng Shou
44
159
0
22 Aug 2024
TraDiffusion: Trajectory-Based Training-Free Image Generation
TraDiffusion: Trajectory-Based Training-Free Image Generation
Mingrui Wu
Oucheng Huang
Jiayi Ji
Jiale Li
Xinyue Cai
Huafeng Kuang
Jianzhuang Liu
Xiaoshuai Sun
Rongrong Ji
40
3
0
19 Aug 2024
Concept Conductor: Orchestrating Multiple Personalized Concepts in
  Text-to-Image Synthesis
Concept Conductor: Orchestrating Multiple Personalized Concepts in Text-to-Image Synthesis
Zebin Yao
Fangxiang Feng
Ruifan Li
Xiaojie Wang
DiffM
31
1
0
07 Aug 2024
ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large
  Language Models
ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models
Ming-Kuan Wu
Xinyue Cai
Jiayi Ji
Jiale Li
Oucheng Huang
Gen Luo
Hao Fei
Xiaoshuai Sun
Rongrong Ji
MLLM
40
7
0
31 Jul 2024
FedDEO: Description-Enhanced One-Shot Federated Learning with Diffusion
  Models
FedDEO: Description-Enhanced One-Shot Federated Learning with Diffusion Models
Mingzhao Yang
Shangchao Su
Bin Li
Xiangyang Xue
19
5
0
29 Jul 2024
ReCorD: Reasoning and Correcting Diffusion for HOI Generation
ReCorD: Reasoning and Correcting Diffusion for HOI Generation
Jian-Yu Jiang-Lin
Kang-Yang Huang
Ling Lo
Yi-Ning Huang
Terence Lin
Jhih-Ciang Wu
Hong-Han Shuai
Wen-Huang Cheng
DiffM
24
5
0
25 Jul 2024
DiffX: Guide Your Layout to Cross-Modal Generative Modeling
DiffX: Guide Your Layout to Cross-Modal Generative Modeling
Zeyu Wang
Jingyu Lin
Yifei Qian
Yi Huang
Shicen Tian
...
Qu Yang
Lan Du
Cunjian Chen
Yufei Guo
Kejie Huang
DiffM
VLM
23
2
0
22 Jul 2024
LSReGen: Large-Scale Regional Generator via Backward Guidance Framework
LSReGen: Large-Scale Regional Generator via Backward Guidance Framework
Bowen Zhang
Cheng Yang
Xuanhui Liu
DiffM
14
0
0
21 Jul 2024
Training-free Composite Scene Generation for Layout-to-Image Synthesis
Training-free Composite Scene Generation for Layout-to-Image Synthesis
Jiaqi Liu
Tao Huang
Chang Xu
DiffM
30
5
0
18 Jul 2024
The Fabrication of Reality and Fantasy: Scene Generation with
  LLM-Assisted Prompt Interpretation
The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation
Yi Yao
Chan-Feng Hsu
Jhe-Hao Lin
Hongxia Xie
Terence Lin
Yi-Ning Huang
Hong-Han Shuai
Wen-Huang Cheng
DiffM
24
4
0
17 Jul 2024
AirSketch: Generative Motion to Sketch
AirSketch: Generative Motion to Sketch
Hui Xian Grace Lim
Xuanming Cui
Y. S. Rawat
Ser-Nam Lim
VGen
DiffM
24
0
0
12 Jul 2024
PerlDiff: Controllable Street View Synthesis Using Perspective-Layout
  Diffusion Models
PerlDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models
Jinhua Zhang
Hualian Sheng
Sijia Cai
Bing Deng
Qiao Liang
Wen Li
Ying Fu
Jieping Ye
Shuhang Gu
DiffM
32
2
0
08 Jul 2024
GenArtist: Multimodal LLM as an Agent for Unified Image Generation and
  Editing
GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing
Zhenyu Wang
Aoxue Li
Zhenguo Li
Xihui Liu
MLLM
DiffM
41
25
0
08 Jul 2024
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for
  Text-to-Image Generation?
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?
Zhaorun Chen
Yichao Du
Zichen Wen
Yiyang Zhou
Chenhang Cui
...
Jiawei Zhou
Zhuokai Zhao
Rafael Rafailov
Chelsea Finn
Huaxiu Yao
EGVM
MLLM
53
29
0
05 Jul 2024
PartCraft: Crafting Creative Objects by Parts
PartCraft: Crafting Creative Objects by Parts
Kam Woh Ng
Xiatian Zhu
Yi-Zhe Song
Tao Xiang
35
6
0
05 Jul 2024
MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis
MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis
Dewei Zhou
Y. Li
Fan Ma
Zongxin Yang
Y. Yang
88
11
0
02 Jul 2024
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language
Yicheng Chen
Xiangtai Li
Yining Li
Yanhong Zeng
Jianzong Wu
Xiangyu Zhao
Kai Chen
VLM
DiffM
56
3
0
28 Jun 2024
AnyControl: Create Your Artwork with Versatile Control on Text-to-Image
  Generation
AnyControl: Create Your Artwork with Versatile Control on Text-to-Image Generation
Yanan Sun
Yanchen Liu
Yinhao Tang
Wenjie Pei
Kai Chen
DiffM
19
8
0
27 Jun 2024
MotionBooth: Motion-Aware Customized Text-to-Video Generation
MotionBooth: Motion-Aware Customized Text-to-Video Generation
Jianzong Wu
Xiangtai Li
Yanhong Zeng
J. J. Zhang
Qianyu Zhou
Yining Li
Yunhai Tong
Kai Chen
DiffM
VGen
70
40
0
25 Jun 2024
Prompt-Consistency Image Generation (PCIG): A Unified Framework
  Integrating LLMs, Knowledge Graphs, and Controllable Diffusion Models
Prompt-Consistency Image Generation (PCIG): A Unified Framework Integrating LLMs, Knowledge Graphs, and Controllable Diffusion Models
Yichen Sun
Zhixuan Chu
Zhan Qin
Kui Ren
DiffM
30
0
0
24 Jun 2024
Understanding Multi-Granularity for Open-Vocabulary Part Segmentation
Understanding Multi-Granularity for Open-Vocabulary Part Segmentation
Jiho Choi
Seonho Lee
Seungho Lee
Minhyun Lee
Hyunjung Shim
OCL
33
0
0
17 Jun 2024
Poetry2Image: An Iterative Correction Framework for Images Generated
  from Chinese Classical Poetry
Poetry2Image: An Iterative Correction Framework for Images Generated from Chinese Classical Poetry
Jing Jiang
Yiran Ling
Binzhu Li
Pengxiang Li
Junming Piao
Yu Zhang
EGVM
DiffM
35
1
0
15 Jun 2024
Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image
  Diffusion Models
Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image Diffusion Models
Ziyi Wu
Yulia Rubanova
Rishabh Kabra
Drew A. Hudson
Igor Gilitschenski
Yusuf Aytar
Sjoerd van Steenkiste
Kelsey R. Allen
Thomas Kipf
VGen
DiffM
36
10
0
13 Jun 2024
AttnDreamBooth: Towards Text-Aligned Personalized Text-to-Image
  Generation
AttnDreamBooth: Towards Text-Aligned Personalized Text-to-Image Generation
Lianyu Pang
Jian Yin
Baoquan Zhao
Feize Wu
Fu Lee Wang
Qing Li
Xudong Mao
DiffM
37
1
0
07 Jun 2024
VideoTetris: Towards Compositional Text-to-Video Generation
VideoTetris: Towards Compositional Text-to-Video Generation
Ye Tian
Ling Yang
Haotian Yang
Yuan Gao
Yufan Deng
...
Zhaochen Yu
Xin Tao
Pengfei Wan
Di Zhang
Bin Cui
DiffM
VGen
76
15
0
06 Jun 2024
ODGEN: Domain-specific Object Detection Data Generation with Diffusion
  Models
ODGEN: Domain-specific Object Detection Data Generation with Diffusion Models
Jingyuan Zhu
Shiyu Li
Yuxuan Liu
Ping-Chia Huang
Jiulong Shan
Huimin Ma
Jian Yuan
24
3
0
24 May 2024
FreeTuner: Any Subject in Any Style with Training-free Diffusion
FreeTuner: Any Subject in Any Style with Training-free Diffusion
Youcan Xu
Zhen Wang
Jun Xiao
Wei Liu
Long Chen
DiffM
27
9
0
23 May 2024
Enhancing Image Layout Control with Loss-Guided Diffusion Models
Enhancing Image Layout Control with Loss-Guided Diffusion Models
Zakaria Patel
Kirill Serkh
DiffM
25
3
0
23 May 2024
Robust Disaster Assessment from Aerial Imagery Using Text-to-Image
  Synthetic Data
Robust Disaster Assessment from Aerial Imagery Using Text-to-Image Synthetic Data
Tarun Kalluri
Jihyeon Janel Lee
Kihyuk Sohn
Sahil Singla
Manmohan Chandraker
Joseph Z. Xu
Jeremiah Liu
31
1
0
22 May 2024
Bridging the Intent Gap: Knowledge-Enhanced Visual Generation
Bridging the Intent Gap: Knowledge-Enhanced Visual Generation
Yi Cheng
Ziwei Xu
Dongyun Lin
Harry Cheng
Yongkang Wong
Ying Sun
Joo Hwee Lim
Mohan S. Kankanhalli
31
0
0
21 May 2024
RATLIP: Generative Adversarial CLIP Text-to-Image Synthesis Based on
  Recurrent Affine Transformations
RATLIP: Generative Adversarial CLIP Text-to-Image Synthesis Based on Recurrent Affine Transformations
Chengde Lin
Xijun Lu
Guangxi Chen
33
0
0
13 May 2024
Previous
1234
Next