ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.03206
  4. Cited By
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

5 March 2024
Patrick Esser
Sumith Kulal
A. Blattmann
Rahim Entezari
Jonas Muller
Harry Saini
Yam Levi
Dominik Lorenz
Axel Sauer
Frederic Boesel
Dustin Podell
Tim Dockhorn
Zion English
Kyle Lacey
Alex Goodwin
Yannik Marek
Robin Rombach
    DiffM
ArXiv (abs)PDFHTMLHuggingFace (68 upvotes)

Papers citing "Scaling Rectified Flow Transformers for High-Resolution Image Synthesis"

50 / 1,247 papers shown
Veritas: Generalizable Deepfake Detection via Pattern-Aware Reasoning
Veritas: Generalizable Deepfake Detection via Pattern-Aware Reasoning
Hao Tan
Jun Lan
Zichang Tan
Ajian Liu
Chuanbiao Song
Senyuan Shi
Huijia Zhu
Weiqiang Wang
Jun Wan
Zhen Lei
245
4
0
28 Aug 2025
Breaking Diffusion with Cache: Exploiting Approximate Caches in Diffusion Models
Breaking Diffusion with Cache: Exploiting Approximate Caches in Diffusion Models
Desen Sun
Shuncheng Jie
Sihang Liu
DiffM
132
0
0
28 Aug 2025
Droplet3D: Commonsense Priors from Videos Facilitate 3D Generation
Droplet3D: Commonsense Priors from Videos Facilitate 3D Generation
Xiaochuan Li
Guoguang Du
Runze Zhang
Liang Jin
Qi Jia
...
Tianqi Wang
Changsheng Li
Xiaoli Gong
Rengang Li
Baoyu Fan
VGen
113
0
0
28 Aug 2025
Mixture of Contexts for Long Video Generation
Mixture of Contexts for Long Video Generation
S. Cai
Ceyuan Yang
Lvmin Zhang
Yuwei Guo
Junfei Xiao
...
Alan Yuille
Leonidas Guibas
Maneesh Agrawala
Lu Jiang
Gordon Wetzstein
VLM
208
27
0
28 Aug 2025
Describe, Don't Dictate: Semantic Image Editing with Natural Language Intent
Describe, Don't Dictate: Semantic Image Editing with Natural Language Intent
En Ci
Shanyan Guan
Yanhao Ge
Yilin Zhang
Wei-Jang Li
Zhenyu Zhang
Jian Yang
Ying Tai
DiffM
97
2
0
28 Aug 2025
Phased One-Step Adversarial Equilibrium for Video Diffusion Models
Phased One-Step Adversarial Equilibrium for Video Diffusion Models
Jiaxiang Cheng
Bing Ma
Xuhua Ren
Hongyi Jin
Kai Yu
Peng Zhang
Wenyue Li
Yuan Zhou
Tianxiang Zheng
Qinglin Lu
DiffMVGen
173
3
0
28 Aug 2025
PHD: Personalized 3D Human Body Fitting with Point Diffusion
PHD: Personalized 3D Human Body Fitting with Point Diffusion
Hsuan-I Ho
Chen Guo
Po-Chen Wu
Ivan Shugurov
Chengcheng Tang
Abhay Mittal
Sizhe An
Manuel Kaufmann
Linguang Zhang
3DH
220
0
0
28 Aug 2025
Train-Once Plan-Anywhere Kinodynamic Motion Planning via Diffusion Trees
Train-Once Plan-Anywhere Kinodynamic Motion Planning via Diffusion Trees
Yaniv Hassidof
Tom Jurgenson
Kiril Solovey
183
1
0
28 Aug 2025
MotionFlux: Efficient Text-Guided Motion Generation through Rectified Flow Matching and Preference Alignment
MotionFlux: Efficient Text-Guided Motion Generation through Rectified Flow Matching and Preference Alignment
Zhiting Gao
Dan Song
Diqiong Jiang
Chao Xue
An-an Liu
VGen
165
0
0
27 Aug 2025
AudioStory: Generating Long-Form Narrative Audio with Large Language Models
AudioStory: Generating Long-Form Narrative Audio with Large Language Models
Yuxin Guo
Teng Wang
Yuying Ge
Shijie Ma
Yixiao Ge
Wei Zou
Mingyu Ding
DiffMAuLLM
147
3
0
27 Aug 2025
ERTACache: Error Rectification and Timesteps Adjustment for Efficient Diffusion
ERTACache: Error Rectification and Timesteps Adjustment for Efficient Diffusion
Xurui Peng
Hong Liu
Chenqian Yan
Rui Ma
Fangmin Chen
X. Wang
Zhihua Wu
Songwei Liu
Mingbao Lin
DiffM
201
1
0
27 Aug 2025
ROSE: Remove Objects with Side Effects in Videos
ROSE: Remove Objects with Side Effects in Videos
Chenxuan Miao
Yutong Feng
Jianshu Zeng
Zixiang Gao
Hantang Liu
Yunfeng Yan
Donglian Qi
Xi Chen
Bin Wang
Hengshuang Zhao
DiffMVGen
202
4
0
26 Aug 2025
VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space
VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space
Lin Li
Zehuan Huang
Haoran Feng
Gengxiong Zhuang
Rui Chen
Chunchao Guo
Lu Sheng
DiffMVGen
158
11
0
26 Aug 2025
The Mind's Eye: A Multi-Faceted Reward Framework for Guiding Visual Metaphor Generation
The Mind's Eye: A Multi-Faceted Reward Framework for Guiding Visual Metaphor Generation
Girish A. Koushik
Fatemeh Nazarieh
Katherine Birch
Shenbin Qian
Diptesh Kanojia
EGVM
118
0
0
26 Aug 2025
PRISM: Robust VLM Alignment with Principled Reasoning for Integrated Safety in Multimodality
PRISM: Robust VLM Alignment with Principled Reasoning for Integrated Safety in Multimodality
Nanxi Li
Zhengyue Zhao
Chaowei Xiao
LRM
69
0
0
26 Aug 2025
Preference Trajectory Modeling via Flow Matching for Sequential Recommendation
Preference Trajectory Modeling via Flow Matching for Sequential Recommendation
Li Li
Mingyue Cheng
Yuyang Ye
Zhiding Liu
Tong Xu
DiffMAI4TS
146
1
0
25 Aug 2025
Follow My Hold: Hand-Object Interaction Reconstruction through Geometric Guidance
Follow My Hold: Hand-Object Interaction Reconstruction through Geometric Guidance
Ayce Idil Aytekin
Helge Rhodin
Rishabh Dabral
Christian Theobalt
DiffM
129
2
0
25 Aug 2025
SuperGen: An Efficient Ultra-high-resolution Video Generation System with Sketching and Tiling
SuperGen: An Efficient Ultra-high-resolution Video Generation System with Sketching and Tiling
Fanjiang Ye
Zepeng Zhao
Yi Mu
Jucheng Shen
Renjie Li
...
Triston Cao
Aditya Akella
Arvind Krishnamurthy
T.S. Eugene Ng
Zhengzhong Tu
DiffMVGen
137
0
0
25 Aug 2025
JCo-MVTON: Jointly Controllable Multi-Modal Diffusion Transformer for Mask-Free Virtual Try-on
JCo-MVTON: Jointly Controllable Multi-Modal Diffusion Transformer for Mask-Free Virtual Try-on
Aowen Wang
Wei Li
Hao Luo
Mengxing Ao
Chenyu Zhu
Xinyang Li
Fan Wang
DiffM
104
0
0
25 Aug 2025
DiCache: Let Diffusion Model Determine Its Own Cache
DiCache: Let Diffusion Model Determine Its Own Cache
Jiazi Bu
Pengyang Ling
Yujie Zhou
Yibin Wang
Yuhang Zang
Tong Wu
Dahua Lin
DiffM
302
1
0
24 Aug 2025
Condition Weaving Meets Expert Modulation: Towards Universal and Controllable Image Generation
Condition Weaving Meets Expert Modulation: Towards Universal and Controllable Image Generation
Guoqing Zhang
Xingtong Ge
Lu Shi
Xin Zhang
Muqing Xue
Wanru Xu
Yigang Cen
J. Zhang
DiffM
216
0
0
24 Aug 2025
T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation
T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation
Kaiyue Sun
Rongyao Fang
Chengqi Duan
Xian Liu
Xihui Liu
167
14
0
24 Aug 2025
HiCache: Training-free Acceleration of Diffusion Models via Hermite Polynomial-based Feature Caching
HiCache: Training-free Acceleration of Diffusion Models via Hermite Polynomial-based Feature Caching
Liang Feng
Shikang Zheng
Jiacheng Liu
Yuqi Lin
Qinming Zhou
...
Xinyu Wang
Junjie Chen
Chang Zou
Yue Ma
Linfeng Zhang
DiffM
139
3
0
23 Aug 2025
On the Collapse Errors Induced by the Deterministic Sampler for Diffusion Models
On the Collapse Errors Induced by the Deterministic Sampler for Diffusion Models
Yi Zhang
Zhenyu Liao
Jingfeng Wu
Difan Zou
DiffM
186
1
0
22 Aug 2025
Scaling Group Inference for Diverse and High-Quality Generation
Scaling Group Inference for Diverse and High-Quality Generation
Gaurav Parmar
Or Patashnik
Daniil Ostashev
Kuan-Chieh Wang
Kfir Aberman
Srinivasa Narasimhan
Jun-Yan Zhu
177
2
0
21 Aug 2025
Efficient Identification of Critical Transitions via Flow Matching: A Scalable Generative Approach for Many-Body Systems
Efficient Identification of Critical Transitions via Flow Matching: A Scalable Generative Approach for Many-Body Systems
Qian-Rui Lee
Daw-Wei Wang
273
0
0
21 Aug 2025
MUSE: Multi-Subject Unified Synthesis via Explicit Layout Semantic Expansion
MUSE: Multi-Subject Unified Synthesis via Explicit Layout Semantic Expansion
Fei Peng
Junqiang Wu
Yan Li
Tingting Gao
Di Zhang
Huiyuan Fu
DiffM
164
2
0
20 Aug 2025
CurveFlow: Curvature-Guided Flow Matching for Image Generation
CurveFlow: Curvature-Guided Flow Matching for Image Generation
Yan Luo
Drake Du
Niraj Pudasaini
Yi Fang
Mengyu Wang
241
3
0
20 Aug 2025
CTA-Flux: Integrating Chinese Cultural Semantics into High-Quality English Text-to-Image Communities
CTA-Flux: Integrating Chinese Cultural Semantics into High-Quality English Text-to-Image Communities
Yue Gong
Shanyuan Liu
Liuzhuozheng Li
Jian Zhu
Bo Cheng
Liebucha Wu
Xiaoyu Wu
Yuhang Ma
Dawei Leng
Yuhui Yin
DiffM
233
0
0
20 Aug 2025
Ouroboros: Single-step Diffusion Models for Cycle-consistent Forward and Inverse Rendering
Ouroboros: Single-step Diffusion Models for Cycle-consistent Forward and Inverse Rendering
Shanlin Sun
Yifan Wang
Hanwen Zhang
Yifeng Xiong
Qin Ren
Ruogu Fang
Xiaohui Xie
Chenyu You
170
3
0
20 Aug 2025
Tinker: Diffusion's Gift to 3D--Multi-View Consistent Editing From Sparse Inputs without Per-Scene Optimization
Tinker: Diffusion's Gift to 3D--Multi-View Consistent Editing From Sparse Inputs without Per-Scene Optimization
Canyu Zhao
Xiaoman Li
Tianjian Feng
Zhiyue Zhao
Hao Chen
Chunhua Shen
DiffMVGen
183
2
0
20 Aug 2025
Disentanglement in T-space for Faster and Distributed Training of Diffusion Models with Fewer Latent-states
Disentanglement in T-space for Faster and Distributed Training of Diffusion Models with Fewer Latent-states
Samarth Gupta
Raghudeep Gadde
Rui Chen
Aleix M. Martinez
130
0
0
20 Aug 2025
Compute-Optimal Scaling for Value-Based Deep RL
Compute-Optimal Scaling for Value-Based Deep RL
Preston Fu
Oleh Rybkin
Zhiyuan Zhou
Michal Nauman
Pieter Abbeel
Sergey Levine
Aviral Kumar
OffRL
185
2
0
20 Aug 2025
OmniTry: Virtual Try-On Anything without Masks
OmniTry: Virtual Try-On Anything without Masks
Yutong Feng
Linlin Zhang
H. Cao
Yiming Chen
Xiaoduan Feng
Jian Cao
Yuxiong Wu
Bin Wang
115
2
0
19 Aug 2025
SAGA: Learning Signal-Aligned Distributions for Improved Text-to-Image Generation
SAGA: Learning Signal-Aligned Distributions for Improved Text-to-Image Generation
P. Grimal
Michael Soumm
Hervé Le Borgne
Olivier Ferret
Akihiro Sugimoto
DiffM
169
0
0
19 Aug 2025
Revisiting Diffusion Q-Learning: From Iterative Denoising to One-Step Action Generation
Revisiting Diffusion Q-Learning: From Iterative Denoising to One-Step Action Generation
Thanh Nguyen
Chang D. Yoo
192
1
0
19 Aug 2025
CTFlow: Video-Inspired Latent Flow Matching for 3D CT Synthesis
CTFlow: Video-Inspired Latent Flow Matching for 3D CT Synthesis
Jiayi Wang
Hadrien Reynaud
Franciskus Xaverius Erick
Bernhard Kainz
DiffMMedImVGen
108
0
0
18 Aug 2025
ID-Card Synthetic Generation: Toward a Simulated Bona fide Dataset
ID-Card Synthetic Generation: Toward a Simulated Bona fide Dataset
Qingwen Zeng
Juan E. Tapia
Izan Garcia
Juan M. Espin
Christoph Busch
AAML
146
0
0
18 Aug 2025
S$^2$-Guidance: Stochastic Self Guidance for Training-Free Enhancement of Diffusion Models
S2^22-Guidance: Stochastic Self Guidance for Training-Free Enhancement of Diffusion Models
Chubin Chen
Jiashu Zhu
Xiaokun Feng
Nisha Huang
Meiqi Wu
Fangyuan Mao
Jiahong Wu
Xiangxiang Chu
Xiu Li
249
1
0
18 Aug 2025
Next Visual Granularity Generation
Next Visual Granularity Generation
Yikai Wang
Zhouxia Wang
Zhonghua Wu
Qingyi Tao
Kang Liao
Chen Change Loy
147
1
0
18 Aug 2025
EgoTwin: Dreaming Body and View in First Person
EgoTwin: Dreaming Body and View in First Person
Jingqiao Xiu
Fangzhou Hong
Yicong Li
Mengze Li
Wentao Wang
Sirui Han
Liang Pan
Ziwei Liu
DiffMVGen
154
4
0
18 Aug 2025
LoRAtorio: An intrinsic approach to LoRA Skill Composition
LoRAtorio: An intrinsic approach to LoRA Skill Composition
Niki Foteinopoulou
Ignas Budvytis
Stephan Liwicki
MoMe
157
0
0
15 Aug 2025
A Survey on Diffusion Language Models
A Survey on Diffusion Language Models
Tianyi Li
Mingda Chen
Bowei Guo
Zhiqiang Shen
315
30
0
14 Aug 2025
Towards Spatially Consistent Image Generation: On Incorporating Intrinsic Scene Properties into Diffusion Models
Towards Spatially Consistent Image Generation: On Incorporating Intrinsic Scene Properties into Diffusion Models
H. J. Lee
Suhyung Choi
Byoung-Tak Zhang
Inwoo Hwang
189
0
0
14 Aug 2025
Increasing the Utility of Synthetic Images through Chamfer Guidance
Increasing the Utility of Synthetic Images through Chamfer Guidance
Nicola DallÁsen
Xiaofeng Zhang
Reyhane Askari Hemmat
Melissa Hall
Jakob Verbeek
Adriana Romero Soriano
M. Drozdzal
215
1
0
14 Aug 2025
MDNS: Masked Diffusion Neural Sampler via Stochastic Optimal Control
MDNS: Masked Diffusion Neural Sampler via Stochastic Optimal Control
Yuchen Zhu
Wei Guo
Jaemoo Choi
Guan-Horng Liu
Yongxin Chen
Molei Tao
203
9
0
14 Aug 2025
NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale
NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale
NextStep Team
Chunrui Han
Guopeng Li
J. Wu
Quan Sun
...
Ziyang Meng
Binxing Jiao
Daxin Jiang
X. Zhang
Yibo Zhu
DiffM
201
22
0
14 Aug 2025
BLADE: Block-Sparse Attention Meets Step Distillation for Efficient Video Generation
BLADE: Block-Sparse Attention Meets Step Distillation for Efficient Video Generation
Youping Gu
Xiaolong Li
Yuhao Hu
Minqi Chen
Bohan Zhuang
VGen
179
0
0
14 Aug 2025
ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing
ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing
Lingen Li
Guangzhi Wang
Zhaoyang Zhang
Yaowei Li
Xiaoyu Li
Qi Dou
Jinwei Gu
Tianfan Xue
Mingyu Ding
VGen
177
2
0
14 Aug 2025
Ultra-High-Definition Reference-Based Landmark Image Super-Resolution with Generative Diffusion Prior
Ultra-High-Definition Reference-Based Landmark Image Super-Resolution with Generative Diffusion Prior
Zhenning Shi
Zizheng Yan
Yuhang Yu
Clara Xue
Jingyu Zhuang
Qi Zhang
Jinwei Chen
Tao Li
Qingnan Fan
128
0
0
14 Aug 2025
Previous
123...8910...232425
Next