Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.03206
Cited By
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
5 March 2024
Patrick Esser
Sumith Kulal
A. Blattmann
Rahim Entezari
Jonas Muller
Harry Saini
Yam Levi
Dominik Lorenz
Axel Sauer
Frederic Boesel
Dustin Podell
Tim Dockhorn
Zion English
Kyle Lacey
Alex Goodwin
Yannik Marek
Robin Rombach
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Scaling Rectified Flow Transformers for High-Resolution Image Synthesis"
50 / 800 papers shown
Title
TDSM: Triplet Diffusion for Skeleton-Text Matching in Zero-Shot Action Recognition
Jeonghyeok Do
Munchurl Kim
44
1
0
16 Nov 2024
FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on
Boyuan Jiang
Xiaobin Hu
Donghao Luo
Q. He
C. Xu
Jinlong Peng
J. Zhang
Chengjie Wang
Yunsheng Wu
Yanwei Fu
DiffM
28
6
0
15 Nov 2024
NeuralDEM -- Real-time Simulation of Industrial Particulate Flows
Benedikt Alkin
Tobias Kronlachner
Samuele Papa
Stefan Pirker
Thomas Lichtenegger
Johannes Brandstetter
PINN
AI4CE
38
1
1
14 Nov 2024
Latent Space Disentanglement in Diffusion Transformers Enables Precise Zero-shot Semantic Editing
Zitao Shuai
Chenwei Wu
Zhengxu Tang
Bowen Song
Liyue Shen
DiffM
50
0
0
12 Nov 2024
Wavelet Latent Diffusion (Wala): Billion-Parameter 3D Generative Model with Compact Wavelet Encodings
Aditya Sanghi
Aliasghar Khani
Pradyumna Reddy
Arianna Rampini
Derek Cheung
Kamal Rahimi Malekshan
Kanika Madan
Hooman Shayani
34
3
0
12 Nov 2024
FlowTS: Time Series Generation via Rectified Flow
Yang Hu
X. Wang
Lirong Wu
H. M. Zhang
Stan Z. Li
Sheng Wang
Tianlong Chen
Jiheng Zhang
Ziyun Li
Tianlong Chen
AI4TS
26
0
0
12 Nov 2024
GaussianAnything: Interactive Point Cloud Flow Matching For 3D Object Generation
Yushi Lan
Shangchen Zhou
Zhaoyang Lyu
Fangzhou Hong
Shuai Yang
Bo Dai
Xingang Pan
Chen Change Loy
3DGS
53
0
0
12 Nov 2024
Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models
Nvidia
:
Yuval Atzmon
Maciej Bala
Yogesh Balaji
...
Ting-Chun Wang
Fangyin Wei
Xiaohui Zeng
Yu Zeng
Qinsheng Zhang
47
6
0
11 Nov 2024
OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision
Cong Wei
Zheyang Xiong
Weiming Ren
Xinrun Du
Ge Zhang
Wenhu Chen
102
18
0
11 Nov 2024
Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement
Zhennan Chen
Yajie Li
Haofan Wang
Z. Chen
Zhengkai Jiang
Jun Yu Li
Qian Wang
Jian Yang
Ying Tai
DiffM
47
8
0
10 Nov 2024
Autoregressive Models in Vision: A Survey
Jing Xiong
Gongye Liu
Lun Huang
Chengyue Wu
Taiqiang Wu
...
M. Zhang
Guillermo Sapiro
Jiebo Luo
Ping Luo
Ngai Wong
VGen
46
9
0
08 Nov 2024
AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation
Anil Kag
Huseyin Coskun
Jierun Chen
Junli Cao
Willi Menapace
Aliaksandr Siarohin
Sergey Tulyakov
Jian Ren
46
3
0
07 Nov 2024
Stem-OB: Generalizable Visual Imitation Learning with Stem-Like Convergent Observation through Diffusion Inversion
Kaizhe Hu
Zihang Rui
Yao He
Yuyao Liu
Pu Hua
Huazhe Xu
33
1
0
07 Nov 2024
Boosting Latent Diffusion with Perceptual Objectives
Tariq Berrada
Pietro Astolfi
Jakob Verbeek
Melissa Hall
Marton Havasi
M. Drozdzal
Yohann Benchetrit
Adriana Romero Soriano
Karteek Alahari
43
0
0
06 Nov 2024
DiMSUM: Diffusion Mamba -- A Scalable and Unified Spatial-Frequency Method for Image Generation
Hao Phung
Quan Dao
T. Dao
Hoang Phan
Dimitris Metaxas
Anh Tran
Mamba
62
3
0
06 Nov 2024
On Improved Conditioning Mechanisms and Pre-training Strategies for Diffusion Models
Tariq Berrada Ifriqi
Pietro Astolfi
Melissa Hall
Reyhane Askari Hemmat
Yohann Benchetrit
...
Matthew Muckley
Karteek Alahari
Adriana Romero Soriano
Jakob Verbeek
M. Drozdzal
AI4CE
VLM
52
2
0
05 Nov 2024
Training-free Regional Prompting for Diffusion Transformers
Anthony Chen
Jianjin Xu
Wenzhao Zheng
Gaole Dai
Y. Wang
Renrui Zhang
Haofan Wang
Shanghang Zhang
VLM
40
2
0
04 Nov 2024
GenXD: Generating Any 3D and 4D Scenes
Yuyang Zhao
Chung-Ching Lin
Kevin Qinghong Lin
Zhiwen Yan
Linjie Li
Z. Yang
Jianfeng Wang
G. Lee
Lijuan Wang
VGen
43
14
0
04 Nov 2024
MoMu-Diffusion: On Learning Long-Term Motion-Music Synchronization and Correspondence
Fuming You
Minghui Fang
Li Tang
Rongjie Huang
Yongqi Wang
Zhou Zhao
18
2
0
04 Nov 2024
xDiT: an Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
Jiarui Fang
Jinzhe Pan
Xibo Sun
Aoyu Li
Jiannan Wang
54
4
0
04 Nov 2024
Randomized Autoregressive Visual Generation
Qihang Yu
Ju He
XueQing Deng
Xiaohui Shen
Liang-Chieh Chen
VGen
DiffM
52
30
1
01 Nov 2024
TextDestroyer: A Training- and Annotation-Free Diffusion Method for Destroying Anomal Text from Images
Mengcheng Li
Mingbao Lin
Fei Chao
Chia-Wen Lin
Rongrong Ji
DiffM
46
0
0
01 Nov 2024
Constant Acceleration Flow
Dogyun Park
Sojin Lee
S. Kim
Taehoon Lee
Youngjoon Hong
Hyunwoo J. Kim
50
2
0
01 Nov 2024
Scaling Concept With Text-Guided Diffusion Models
Chao Huang
Susan Liang
Yunlong Tang
Yapeng Tian
Anurag Kumar
Chenliang Xu
DiffM
43
5
0
31 Oct 2024
Redefining <Creative> in Dictionary: Towards an Enhanced Semantic Understanding of Creative Generation
Fu Feng
Yucheng Xie
Xu Yang
Jing Wang
Xin Geng
DiffM
26
0
0
31 Oct 2024
Controlling Language and Diffusion Models by Transporting Activations
P. Rodríguez
Arno Blaas
Michal Klein
Luca Zappella
N. Apostoloff
Marco Cuturi
Xavier Suau
LLMSV
35
4
0
30 Oct 2024
FlowDCN: Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution
Shuai Wang
Zexian Li
Tianhui Song
Xubin Li
Tiezheng Ge
Bo Zheng
L. Wang
27
1
0
30 Oct 2024
Consistency Diffusion Bridge Models
Guande He
Kaiwen Zheng
Jianfei Chen
Fan Bao
Jun-Jie Zhu
DiffM
59
3
0
30 Oct 2024
Diffusion Beats Autoregressive: An Evaluation of Compositional Generation in Text-to-Image Models
Arash Marioriyad
Parham Rezaei
M. Baghshah
M. Rohban
CoGe
88
0
0
30 Oct 2024
ET-Flow: Equivariant Flow-Matching for Molecular Conformer Generation
Majdi Hassan
Nikhil Shenoy
Jungyoon Lee
Hannes Stärk
Stephan Thaler
Dominique Beaini
22
6
0
29 Oct 2024
GRADE: Quantifying Sample Diversity in Text-to-Image Models
Royi Rassin
Aviv Slobodkin
Shauli Ravfogel
Yanai Elazar
Yoav Goldberg
58
1
0
29 Oct 2024
AdvI2I: Adversarial Image Attack on Image-to-Image Diffusion models
Yaopei Zeng
Yuanpu Cao
Bochuan Cao
Yurui Chang
Jinghui Chen
Lu Lin
DiffM
29
3
0
28 Oct 2024
Shallow Diffuse: Robust and Invisible Watermarking through Low-Dimensional Subspaces in Diffusion Models
Wenda Li
Huijie Zhang
Qing Qu
WIGM
41
1
0
28 Oct 2024
Diff-Instruct*: Towards Human-Preferred One-step Text-to-image Generative Models
Weijian Luo
C. Zhang
Debing Zhang
Zhengyang Geng
26
3
0
28 Oct 2024
GrounDiT: Grounding Diffusion Transformers via Noisy Patch Transplantation
Phillip Y. Lee
Taehoon Yoon
Minhyuk Sung
42
4
1
27 Oct 2024
An Efficient Watermarking Method for Latent Diffusion Models via Low-Rank Adaptation
Dongdong Lin
Yue Li
B. Tondi
Bin Li
Mauro Barni
WIGM
26
1
0
26 Oct 2024
Flow Generator Matching
Zemin Huang
Zhengyang Geng
Weijian Luo
Guo-jun Qi
33
8
0
25 Oct 2024
Towards Visual Text Design Transfer Across Languages
Yejin Choi
Jiwan Chung
Sumin Shim
Giyeong Oh
Youngjae Yu
VLM
DiffM
22
1
0
24 Oct 2024
Rectified Diffusion Guidance for Conditional Generation
Mengfei Xia
Nan Xue
Yujun Shen
Ran Yi
Tieliang Gong
Y. Liu
DiffM
20
3
0
24 Oct 2024
DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation
Yuang Ai
Xiaoqiang Zhou
Huaibo Huang
Xiaotian Han
Zhengyu Chen
Quanzeng You
Hongxia Yang
42
8
0
24 Oct 2024
FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling
Zhengqiang Zhang
Ruihuang Li
Lei Zhang
33
2
0
24 Oct 2024
Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances
Shilin Lu
Zihan Zhou
Jiayou Lu
Yuanzhi Zhu
A. Kong
WIGM
80
10
0
24 Oct 2024
Scalable Ranked Preference Optimization for Text-to-Image Generation
Shyamgopal Karthik
Huseyin Coskun
Zeynep Akata
Sergey Tulyakov
J. Ren
Anil Kag
EGVM
52
4
0
23 Oct 2024
Training Free Guided Flow Matching with Optimal Control
Luran Wang
Chaoran Cheng
Yizhen Liao
Yanru Qu
Ge Liu
31
1
0
23 Oct 2024
VistaDream: Sampling multiview consistent images for single-view scene reconstruction
Haiping Wang
Yuan-Bin Liu
Ziwei Liu
Wenping Wang
Z. Dong
Bisheng Yang
45
11
0
22 Oct 2024
One-Step Diffusion Distillation through Score Implicit Matching
Weijian Luo
Zemin Huang
Zhengyang Geng
J. Zico Kolter
Guo-jun Qi
DiffM
24
14
0
22 Oct 2024
Efficient Antibody Structure Refinement Using Energy-Guided SE(3) Flow Matching
Jiying Zhang
Zijing Liu
Shengyuan Bai
He Cao
Yu Li
Lei Zhang
DiffM
35
1
0
22 Oct 2024
Progressive Compositionality in Text-to-Image Generative Models
Xu Han
Linghao Jin
Xiaofeng Liu
Paul Pu Liang
CoGe
96
2
0
22 Oct 2024
Elucidating the design space of language models for image generation
Xuantong Liu
Shaozhe Hao
Xianbiao Qi
Tianyang Hu
Jun Wang
Rong Xiao
Yuan Yao
VLM
30
3
0
21 Oct 2024
Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models
Giannis Daras
Weili Nie
Karsten Kreis
A. Dimakis
Morteza Mardani
Nikola B. Kovachki
Arash Vahdat
DiffM
33
5
0
21 Oct 2024
Previous
1
2
3
...
10
11
12
...
14
15
16
Next