Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2302.08113
Cited By
MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation
International Conference on Machine Learning (ICML), 2023
16 February 2023
Omer Bar-Tal
Lior Yariv
Y. Lipman
Tali Dekel
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Papers citing
"MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation"
50 / 219 papers shown
MARBLE: Material Recomposition and Blending in CLIP-Space
Computer Vision and Pattern Recognition (CVPR), 2025
Ta-Ying Cheng
Prafull Sharma
Mark Boss
Varun Jampani
DiffM
254
4
0
05 Jun 2025
PixCell: A generative foundation model for digital histopathology images
Srikar Yellapragada
Alexandros Graikos
Zilinghan Li
Kostas Triaridis
Varun Belagali
...
Tahsin M. Kurc
Rajarsi R. Gupta
Ravi K. Madduri
Joel H. Saltz
Dimitris Samaras
DiffM
MedIm
347
2
0
05 Jun 2025
GP-MoLFormer-Sim: Test Time Molecular Optimization through Contextual Similarity Guidance
Jirí Navrátil
Jarret Ross
Payel Das
Youssef Mroueh
Samuel C. Hoffman
Vijil Chenthamarakshan
Brian M. Belgodere
204
0
0
05 Jun 2025
Facial Appearance Capture at Home with Patch-Level Reflectance Prior
ACM Transactions on Graphics (TOG), 2025
Yuxuan Han
Junfeng Lyu
Kuan Sheng
Minghao Que
Qixuan Zhang
Lan Xu
Feng Xu
DiffM
191
4
0
04 Jun 2025
Ultra-High-Resolution Image Synthesis: Data, Method and Evaluation
Jinjin Zhang
Qiuyu Huang
Junjie Liu
Xiefan Guo
Di Huang
229
2
0
02 Jun 2025
Image Generation from Contextually-Contradictory Prompts
Saar Huberman
Or Patashnik
Omer Dahary
Ron Mokady
Daniel Cohen-Or
DiffM
232
3
0
02 Jun 2025
MOVi: Training-free Text-conditioned Multi-Object Video Generation
Aimon Rahman
Jiang Liu
Ze Wang
Ximeng Sun
Jialian Wu
Xiaodong Yu
Yusheng Su
Vishal M. Patel
Zicheng Liu
Emad Barsoum
DiffM
VGen
275
1
0
29 May 2025
What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?
Jinhong Ni
Chang-Bin Zhang
Qiang Zhang
Jing Zhang
MDE
185
5
0
28 May 2025
ISAC: Training-Free Instance-to-Semantic Attention Control for Improving Multi-Instance Generation
Sanghyun Jo
Wooyeol Lee
Ziseok Lee
Kyungsu Kim
1.1K
0
0
27 May 2025
Be Decisive: Noise-Induced Layouts for Multi-Subject Generation
Omer Dahary
Yehonathan Cohen
Or Patashnik
Kfir Aberman
Daniel Cohen-Or
DiffM
286
6
0
27 May 2025
Conditional Panoramic Image Generation via Masked Autoregressive Modeling
Chaoyang Wang
Xiangtai Li
Lu Qi
X. Lin
Jinbin Bai
Qianyu Zhou
Yunhai Tong
DiffM
322
3
0
22 May 2025
Creatively Upscaling Images with Global-Regional Priors
International Journal of Computer Vision (IJCV), 2025
Yurui Qian
Qi Cai
Yingwei Pan
Ting Yao
Tao Mei
DiffM
383
0
0
22 May 2025
Aquarius: A Family of Industry-Level Video Generation Models for Marketing Scenarios
Huafeng Shi
Jianzhong Liang
Rongchang Xie
Xian Wu
Cheng Chen
Chang Liu
VGen
375
0
0
14 May 2025
HCMA: Hierarchical Cross-model Alignment for Grounded Text-to-Image Generation
Hang Wang
Zhi-Qi Cheng
Chenhao Lin
Chao Shen
Lei Zhang
DiffM
416
1
0
10 May 2025
Computationally Efficient Diffusion Models in Medical Imaging: A Comprehensive Review
Abdullah
Wei Chen
Ickjai Lee
Euijoon Ahn
MedIm
434
2
0
09 May 2025
InstanceGen: Image Generation with Instance-level Instructions
Etai Sella
Yanir Kleiman
Hadar Averbuch-Elor
422
4
0
08 May 2025
Improving Editability in Image Generation with Layer-wise Memory
Computer Vision and Pattern Recognition (CVPR), 2025
Daneul Kim
Jaeah Lee
Jaesik Park
DiffM
KELM
297
1
0
02 May 2025
JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers
Kwon Byung-Ki
Jingdong Sun
Lee Hyoseok
Chong Luo
Tae-Hyun Oh
654
4
0
01 May 2025
Generative Machine Learning in Adaptive Control of Dynamic Manufacturing Processes: A Review
Suk Ki Lee
Hyunwoong Ko
AI4CE
404
2
0
30 Apr 2025
DiTPainter: Efficient Video Inpainting with Diffusion Transformers
Xian Wu
Chang Liu
DiffM
363
2
0
22 Apr 2025
SphereDiff: Tuning-free 360° Static and Dynamic Panorama Generation via Spherical Latent Representation
Minho Park
Taewoong Kang
Jooyeol Yun
Sungwon Hwang
Jaegul Choo
VGen
MDE
412
4
0
19 Apr 2025
Hadamard product in deep learning: Introduction, Advances and Challenges
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
Grigorios G. Chrysos
Yongtao Wu
Razvan Pascanu
Philip Torr
Volkan Cevher
AAML
348
15
0
17 Apr 2025
Omni
2
^2
2
: Unifying Omnidirectional Image Generation and Editing in an Omni Model
Liu Yang
Huiyu Duan
Yucheng Zhu
Xiaohong Liu
Lu Liu
Zitong Xu
Guangji Ma
Xiongkuo Min
Guoquan Zheng
P. Callet
VLM
VGen
931
6
0
15 Apr 2025
Marmot: Object-Level Self-Correction via Multi-Agent Reasoning
Jiayang Sun
Hongru Wang
Jie Cao
Huaibo Huang
Ran He
DiffM
419
0
0
10 Apr 2025
Compass Control: Multi Object Orientation Control for Text-to-Image Generation
Computer Vision and Pattern Recognition (CVPR), 2025
Rishubh Parihar
Vaibhav Agrawal
Sachidanand VS
R. V. Babu
DiffM
389
2
0
09 Apr 2025
HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance
Jiazi Bu
Pengyang Ling
Yujie Zhou
Pan Zhang
Tong Wu
Xiaoyi Dong
Yuhang Zang
Yuhang Cao
Dahua Lin
Jiaqi Wang
298
6
0
08 Apr 2025
Disentangling Instruction Influence in Diffusion Transformers for Parallel Multi-Instruction-Guided Image Editing
Hui Liu
Bin Zou
Suiyun Zhang
Kecheng Chen
Rui Liu
Haoliang Li
DiffM
242
0
0
07 Apr 2025
LV-MAE: Learning Long Video Representations through Masked-Embedding Autoencoders
Ilan Naiman
Emanuel Ben-Baruch
Oron Anschel
Alon Shoshan
Igor Kviatkovsky
Manoj Aggarwal
Gérard Medioni
298
0
0
04 Apr 2025
ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation
Yunhong Min
Daehyeon Choi
Kyeongmin Yeo
Jihyun Lee
Minhyuk Sung
478
1
0
28 Mar 2025
SyncSDE: A Probabilistic Framework for Diffusion Synchronization
Computer Vision and Pattern Recognition (CVPR), 2025
Hyunjun Lee
Hyunsoo Lee
Sookwan Han
DiffM
457
1
0
27 Mar 2025
Unconditional Priors Matter! Improving Conditional Generation of Fine-Tuned Diffusion Models
Prin Phunyaphibarn
Phillip Y. Lee
Jaihoon Kim
Minhyuk Sung
DiffM
482
5
0
26 Mar 2025
ICE: Intrinsic Concept Extraction from a Single Image via Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2025
Fernando Julio Cendra
Kai Han
VLM
410
0
0
25 Mar 2025
Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2025
Jinho Jeong
Sangmin Han
Jinwoo Kim
Seon Joo Kim
353
11
0
24 Mar 2025
Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2025
Jinjin Zhang
Qiuyu Huang
Junjie Liu
Xiefan Guo
Di Huang
365
26
0
24 Mar 2025
Unseen from Seen: Rewriting Observation-Instruction Using Foundation Models for Augmenting Vision-Language Navigation
Ziming Wei
Bingqian Lin
Yunshuang Nie
Jiaqi Chen
Shikui Ma
Hang Xu
Xiaodan Liang
488
3
0
23 Mar 2025
MotionStreamer: Streaming Motion Generation via Diffusion-based Autoregressive Model in Causal Latent Space
Lixing Xiao
Shunlin Lu
Huaijin Pi
Bin Ji
Liang Pan
Yueer Zhou
Ziyong Feng
Xiaowei Zhou
Sida Peng
Jingbo Wang
DiffM
VGen
463
26
0
19 Mar 2025
MOSAIC: Generating Consistent, Privacy-Preserving Scenes from Multiple Depth Views in Multi-Room Environments
Zhixuan Liu
H. Zhu
R. Chen
Jonathan M Francis
Soonmin Hwang
Jiangning Zhang
Jean Oh
VGen
1.2K
2
0
18 Mar 2025
Exploring Position Encoding in Diffusion U-Net for Training-free High-resolution Image Generation
Feng Zhou
Pu Cao
Yiyang Ma
Pu Cao
Jianqin Yin
DiffM
273
3
0
12 Mar 2025
Consistent Image Layout Editing with Diffusion Models
Tao Xia
Yudi Zhang
Ting Liu Lei Zhang
DiffM
291
1
0
09 Mar 2025
PixelPonder: Dynamic Patch Adaptation for Enhanced Multi-Conditional Text-to-Image Generation
Yanjie Pan
Qu He
Zhengkai Jiang
P. Xu
Chaoyi Wang
...
Yun Cao
Zhenye Gan
M. Chi
Bo Peng
Yun Wang
DiffM
350
5
0
09 Mar 2025
TextDoctor: Unified Document Image Inpainting via Patch Pyramid Diffusion Models
Wanglong Lu
Lingming Su
Jingjing Zheng
Vinícius Veloso de Melo
Farzaneh Shoeleh
J. Hawkin
T. Tricco
Hanli Zhao
Xianta Jiang
DiffM
279
2
0
06 Mar 2025
Attention Distillation: A Unified Approach to Visual Characteristics Transfer
Computer Vision and Pattern Recognition (CVPR), 2025
Yang Zhou
Xu Gao
Zichong Chen
Hui Huang
DiffM
274
21
0
27 Feb 2025
ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation
Computer Vision and Pattern Recognition (CVPR), 2025
Yifan Pu
Yiming Zhao
Zhicong Tang
Ruihong Yin
Haoxing Ye
...
Ji Li
Xiu Li
Zheng Lian
Gao Huang
Baining Guo
DiffM
406
20
0
25 Feb 2025
LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
Shuai Yang
Jing Tan
Mengchen Zhang
Tong Wu
Yongqian Li
Gordon Wetzstein
Yu Qiao
Dahua Lin
MDE
VGen
362
34
0
24 Feb 2025
Spherical Dense Text-to-Image Synthesis
Timon Winter
Stanislav Frolov
Brian B. Moser
Andreas Dengel
MDE
DiffM
488
0
0
18 Feb 2025
SketchFlex: Facilitating Spatial-Semantic Coherence in Text-to-Image Generation with Region-Based Sketches
International Conference on Human Factors in Computing Systems (CHI), 2025
Haichuan Lin
Yilin Ye
Jiazhi Xia
Wei Zeng
DiffM
271
8
0
11 Feb 2025
Beyond and Free from Diffusion: Invertible Guided Consistency Training
Chia-Hong Hsu
Shiu-hong Kao
Randall Balestriero
3DV
369
1
0
08 Feb 2025
T-Stars-Poster: A Framework for Product-Centric Advertising Image Design
Hongyu Chen
Min Zhou
Jing Jiang
Jiale Chen
Yang Lu
Bo Xiao
Bo Xiao
Bangyu Xiang
Bo Zheng
DiffM
VLM
344
0
0
24 Jan 2025
PreciseCam: Precise Camera Control for Text-to-Image Generation
Computer Vision and Pattern Recognition (CVPR), 2025
Edurne Bernal-Berdun
Ana Serrano
B. Masiá
Matheus Gadelha
Yannick Hold-Geoffroy
Xin Sun
Diego F. F. Gutierrez
DiffM
VGen
216
9
0
22 Jan 2025
Parallel Sequence Modeling via Generalized Spatial Propagation Network
Computer Vision and Pattern Recognition (CVPR), 2025
Hongjun Wang
Wonmin Byeon
Jiarui Xu
Liang Feng
Ka Chun Cheung
Xiaolong Wang
Kai Han
Jan Kautz
Sifei Liu
837
3
0
21 Jan 2025
Previous
1
2
3
4
5
Next