ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.03206
  4. Cited By
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

5 March 2024
Patrick Esser
Sumith Kulal
A. Blattmann
Rahim Entezari
Jonas Muller
Harry Saini
Yam Levi
Dominik Lorenz
Axel Sauer
Frederic Boesel
Dustin Podell
Tim Dockhorn
Zion English
Kyle Lacey
Alex Goodwin
Yannik Marek
Robin Rombach
    DiffM
ArXiv (abs)PDFHTMLHuggingFace (68 upvotes)

Papers citing "Scaling Rectified Flow Transformers for High-Resolution Image Synthesis"

50 / 1,247 papers shown
A Unified Multi-Agent Framework for Universal Multimodal Understanding and Generation
A Unified Multi-Agent Framework for Universal Multimodal Understanding and Generation
Jiulin Li
Ping Huang
Yexin Li
Shuo Chen
Juewen Hu
Ye Tian
106
1
0
14 Aug 2025
Gen-AFFECT: Generation of Avatar Fine-grained Facial Expressions with Consistent identiTy
Gen-AFFECT: Generation of Avatar Fine-grained Facial Expressions with Consistent identiTy
Hao Yu
Rupayan Mallick
Margrit Betke
Sarah Adel Bargal
DiffM
83
0
0
13 Aug 2025
MangaDiT: Reference-Guided Line Art Colorization with Hierarchical Attention in Diffusion Transformers
MangaDiT: Reference-Guided Line Art Colorization with Hierarchical Attention in Diffusion Transformers
Qianru Qiu
Jiafeng Mao
Kento Masui
Xueting Wang
DiffM
90
0
0
13 Aug 2025
A Survey on 3D Gaussian Splatting Applications: Segmentation, Editing, and Generation
A Survey on 3D Gaussian Splatting Applications: Segmentation, Editing, and Generation
Shuting He
Peilin Ji
Yitong Yang
Changshuo Wang
Jiayi Ji
Yinglin Wang
Henghui Ding
3DGS
293
9
0
13 Aug 2025
OneVAE: Joint Discrete and Continuous Optimization Helps Discrete Video VAE Train Better
OneVAE: Joint Discrete and Continuous Optimization Helps Discrete Video VAE Train Better
Yupeng Zhou
Zhen Li
Ziheng Ouyang
Yuming Chen
Ruoyi Du
...
Bin Fu
Yihao Liu
Peng Gao
Ming-Ming Cheng
Qibin Hou
210
1
0
13 Aug 2025
Edge General Intelligence Through World Models and Agentic AI: Fundamentals, Solutions, and Challenges
Edge General Intelligence Through World Models and Agentic AI: Fundamentals, Solutions, and Challenges
Changyuan Zhao
Guangyuan Liu
Ruichen Zhang
Yinqiu Liu
Jiacheng Wang
...
Shen
Zhu Han
Sumei Sun
Chau Yuen
Dong In Kim
209
5
0
13 Aug 2025
Lay2Story: Extending Diffusion Transformers for Layout-Togglable Story Generation
Lay2Story: Extending Diffusion Transformers for Layout-Togglable Story Generation
Ao Ma
Jiasong Feng
Ke Cao
Jing Wang
Yun Wang
Quanwei Zhang
Zhanjie Zhang
DiffMVGen
162
5
0
12 Aug 2025
Per-Query Visual Concept Learning
Per-Query Visual Concept Learning
Ori Malca
Dvir Samuel
Gal Chechik
DiffMVLM
114
0
0
12 Aug 2025
Stand-In: A Lightweight and Plug-and-Play Identity Control for Video Generation
Stand-In: A Lightweight and Plug-and-Play Identity Control for Video Generation
Bowen Xue
Zheng-Peng Duan
Qixin Yan
Wenjing Wang
Hao Liu
Chun-Le Guo
Chongyi Li
Chen Li
Jing Lyu
DiffMVGen
179
5
0
11 Aug 2025
Generative Video Matting
Generative Video Matting
Yongtao Ge
Kangyang Xie
Guangkai Xu
Mingyu Liu
Li Ke
Longtao Huang
Hui Xue
Hao Chen
Chunhua Shen
DiffMVGen
104
2
0
11 Aug 2025
Enhancing Small-Scale Dataset Expansion with Triplet-Connection-based Sample Re-Weighting
Enhancing Small-Scale Dataset Expansion with Triplet-Connection-based Sample Re-Weighting
Ting Xiang
Changjian Chen
Zhuo Tang
Qifeng Zhang
Fei Lyu
Li Yang
Jiapeng Zhang
KenLi Li
MedIm
139
0
0
11 Aug 2025
VSF: Simple, Efficient, and Effective Negative Guidance in Few-Step Image Generation Models By Value Sign Flip
VSF: Simple, Efficient, and Effective Negative Guidance in Few-Step Image Generation Models By Value Sign Flip
Wenqi Guo
Shan Du
DiffM
452
1
0
11 Aug 2025
Learning User Preferences for Image Generation Model
Learning User Preferences for Image Generation Model
Wenyi Mo
Ying Ba
Tianyu Zhang
Yalong Bai
Biye Li
DiffM
88
2
0
11 Aug 2025
TBAC-UniImage: Unified Understanding and Generation by Ladder-Side Diffusion Tuning
TBAC-UniImage: Unified Understanding and Generation by Ladder-Side Diffusion Tuning
Junzhe Xu
Yuyang Yin
Xi Chen
229
5
0
11 Aug 2025
OMGSR: You Only Need One Mid-timestep Guidance for Real-World Image Super-Resolution
OMGSR: You Only Need One Mid-timestep Guidance for Real-World Image Super-Resolution
Zhiqiang Wu
Zhaomang Sun
Tong Zhou
Bingtao Fu
Ji Cong
Yitong Dong
Huaqi Zhang
Xuan Tang
Xiao He
Xian Wei
DiffM
116
1
0
11 Aug 2025
X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning
X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning
Jian Ma
Xujie Zhu
Zihao Pan
Qirong Peng
Xu Guo
Chen Chen
H. Lu
156
5
0
11 Aug 2025
Score Augmentation for Diffusion Models
Score Augmentation for Diffusion Models
Liang Hou
Yuan Gao
Boyuan Jiang
Xin Tao
Qi Yan
Renjie Liao
Pengfei Wan
Di Zhang
Kun Gai
DiffM
129
0
0
11 Aug 2025
Exploring Multimodal Diffusion Transformers for Enhanced Prompt-based Image Editing
Exploring Multimodal Diffusion Transformers for Enhanced Prompt-based Image Editing
Joonghyuk Shin
Alchan Hwang
Yujin Kim
Daneul Kim
Jaesik Park
DiffM
122
4
0
11 Aug 2025
Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers
Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers
Xin Ma
Yaohui Wang
Genyun Jia
Xinyuan Chen
Tien-Tsin Wong
C. L. P. Chen
VGen
160
0
0
10 Aug 2025
DCoAR: Deep Concept Injection into Unified Autoregressive Models for Personalized Text-to-Image Generation
DCoAR: Deep Concept Injection into Unified Autoregressive Models for Personalized Text-to-Image Generation
Fangtai Wu
Mushui Liu
Weijie He
Wanggui He
Hao Jiang
DiffM
134
0
0
10 Aug 2025
HiMat: DiT-based Ultra-High Resolution SVBRDF Generation
HiMat: DiT-based Ultra-High Resolution SVBRDF Generation
Zixiong Wang
Jian Yang
Yiwei Hu
Milos Hasan
Beibei Wang
227
0
0
09 Aug 2025
MultiRef: Controllable Image Generation with Multiple Visual References
MultiRef: Controllable Image Generation with Multiple Visual References
Ruoxi Chen
Dongping Chen
Siyuan Wu
Sinan Wang
Shiyun Lang
Petr Sushko
Gaoyang Jiang
Yao Wan
Ranjay Krishna
DiffM
288
2
0
09 Aug 2025
CannyEdit: Selective Canny Control and Dual-Prompt Guidance for Training-Free Image Editing
CannyEdit: Selective Canny Control and Dual-Prompt Guidance for Training-Free Image Editing
Weiyan Xie
Han Gao
Didan Deng
Kaican Li
April Hua Liu
Yongxiang Huang
Nevin L. Zhang
DiffM
203
0
0
09 Aug 2025
Towards High-Order Mean Flow Generative Models: Feasibility, Expressivity, and Provably Efficient Criteria
Towards High-Order Mean Flow Generative Models: Feasibility, Expressivity, and Provably Efficient Criteria
Yang Cao
Yubin Chen
Zhao Song
Jiahao Zhang
174
7
0
09 Aug 2025
SwiftVideo: A Unified Framework for Few-Step Video Generation through Trajectory-Distribution Alignment
SwiftVideo: A Unified Framework for Few-Step Video Generation through Trajectory-Distribution Alignment
Yanxiao Sun
Jiafu Wu
Yun Cao
C. Xu
Yabiao Wang
Weijian Cao
Donghao Luo
Chengjie Wang
Yanwei Fu
DiffMVGen
161
3
0
08 Aug 2025
Diffusion LLMs Can Do Faster-Than-AR Inference via Discrete Diffusion Forcing
Diffusion LLMs Can Do Faster-Than-AR Inference via Discrete Diffusion Forcing
Xu Wang
Chenkai Xu
Yijie Jin
Jiachun Jin
Hao Zhang
Zhijie Deng
AI4CE
170
31
0
08 Aug 2025
MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows
MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows
Xiquan Li
Junxi Liu
Yuzhe Liang
Zhikang Niu
Wenxi Chen
Xie Chen
259
2
0
08 Aug 2025
WeTok: Powerful Discrete Tokenization for High-Fidelity Visual Reconstruction
WeTok: Powerful Discrete Tokenization for High-Fidelity Visual Reconstruction
Shaobin Zhuang
Yiwei Guo
Canmiao Fu
Z. Huang
Zeyue Tian
Ying Zhang
Ying Zhang
Chen Li
Yali Wang
ViT
224
2
0
07 Aug 2025
MAISI-v2: Accelerated 3D High-Resolution Medical Image Synthesis with Rectified Flow and Region-specific Contrastive Loss
MAISI-v2: Accelerated 3D High-Resolution Medical Image Synthesis with Rectified Flow and Region-specific Contrastive Loss
Can Zhao
Pengfei Guo
Dong Yang
Yucheng Tang
Yufan He
Benjamin D. Simon
Mason J Belue
Stephanie Harmon
Baris Turkbey
Daguang Xu
MedIm
82
3
0
07 Aug 2025
DualMat: PBR Material Estimation via Coherent Dual-Path Diffusion
DualMat: PBR Material Estimation via Coherent Dual-Path Diffusion
Yifeng Huang
Zhang Chen
Yi Tian Xu
Minh Hoai
Zhong Li
DiffM
113
1
0
07 Aug 2025
UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation
UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation
Wonjun Kang
Byeongkeun Ahn
Minjae Lee
Kevin Galim
Seunghyuk Oh
Hyung Il Koo
N. Cho
DiffM
180
0
0
07 Aug 2025
Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off
Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off
Seungyong Lee
Jeong-gi Kwak
DiffM
237
1
0
06 Aug 2025
HierarchicalPrune: Position-Aware Compression for Large-Scale Diffusion Models
HierarchicalPrune: Position-Aware Compression for Large-Scale Diffusion Models
Young D. Kwon
Rui Li
Sijia Li
Da Li
S. Bhattacharya
Stylianos I. Venieris
VLM
168
2
0
06 Aug 2025
TempFlow-GRPO: When Timing Matters for GRPO in Flow Models
TempFlow-GRPO: When Timing Matters for GRPO in Flow Models
Xiaoxuan He
Siming Fu
Yuke Zhao
W. Li
Zhiqiang Wang
Dacheng Yin
Fengyun Rao
Bo Zhang
AI4CE
342
25
0
06 Aug 2025
SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering
SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering
J. Melechovský
Ambuj Mehrish
Abhinaba Roy
Dorien Herremans
185
2
0
05 Aug 2025
Injecting Measurement Information Yields a Fast and Noise-Robust Diffusion-Based Inverse Problem Solver
Injecting Measurement Information Yields a Fast and Noise-Robust Diffusion-Based Inverse Problem Solver
J. Patsenker
Henry Li
Myeongseob Ko
Ruoxi Jia
Y. Kluger
DiffM
332
0
0
05 Aug 2025
RAAG: Ratio Aware Adaptive Guidance
RAAG: Ratio Aware Adaptive Guidance
Shangwen Zhu
Qianyu Peng
Yuting Hu
Zhantao Yang
Han Zhang
Zhao Pu
Andy Zheng
Zhilei Shu
Ruili Feng
Fan Cheng
AI4TS
229
1
0
05 Aug 2025
Draw Your Mind: Personalized Generation via Condition-Level Modeling in Text-to-Image Diffusion Models
Draw Your Mind: Personalized Generation via Condition-Level Modeling in Text-to-Image Diffusion Models
Hyungjin Kim
Seokho Ahn
Young-Duk Seo
DiffM
130
1
0
05 Aug 2025
LORE: Latent Optimization for Precise Semantic Control in Rectified Flow-based Image Editing
LORE: Latent Optimization for Precise Semantic Control in Rectified Flow-based Image Editing
Liangyang Ouyang
Jiafeng Mao
DiffM
204
1
0
05 Aug 2025
Likelihood Matching for Diffusion Models
Likelihood Matching for Diffusion Models
Lei Qian
Wu Su
Yanqi Huang
Song Xi Chen
DiffM
158
0
0
05 Aug 2025
READ: Real-time and Efficient Asynchronous Diffusion for Audio-driven Talking Head Generation
READ: Real-time and Efficient Asynchronous Diffusion for Audio-driven Talking Head Generation
Haotian Wang
Yuzhe Weng
Jun Du
Haoran Xu
X. Wu
Shan He
Bing Yin
Cong Liu
J. Gao
Qingfeng Liu
DiffMVGen
297
1
0
05 Aug 2025
UniEdit-I: Training-free Image Editing for Unified VLM via Iterative Understanding, Editing and Verifying
UniEdit-I: Training-free Image Editing for Unified VLM via Iterative Understanding, Editing and Verifying
Chengyu Bai
Jintao Chen
Xiang Bai
Yilong Chen
Qi She
Ming Lu
Shanghang Zhang
185
1
0
05 Aug 2025
DreamPainter: Image Background Inpainting for E-commerce Scenarios
DreamPainter: Image Background Inpainting for E-commerce Scenarios
Sijie Zhao
Jing Cheng
Yaoyao Wu
Hao Xu
Shaohui Jiao
DiffM
114
0
0
04 Aug 2025
StrandDesigner: Towards Practical Strand Generation with Sketch Guidance
StrandDesigner: Towards Practical Strand Generation with Sketch Guidance
Na Zhang
Moran Li
Chengming Xu
Han Feng
Xiaobin Hu
Jiangning Zhang
Weijian Cao
Chengjie Wang
Yanwei Fu
DiffM
92
0
0
03 Aug 2025
The Promise of RL for Autoregressive Image Editing
The Promise of RL for Autoregressive Image Editing
Saba Ahmadi
Rabiul Awal
Ankur Sikarwar
Amirhossein Kazemnejad
Ge Ya Luo
...
Sai Rajeswar
Siva Reddy
C. Pal
Benno Krojer
Aishwarya Agrawal
OffRLKELM
271
2
0
01 Aug 2025
AudioGen-Omni: A Unified Multimodal Diffusion Transformer for Video-Synchronized Audio, Speech, and Song Generation
AudioGen-Omni: A Unified Multimodal Diffusion Transformer for Video-Synchronized Audio, Speech, and Song Generation
L. Wang
Jun Wang
Feng Deng
Feng Deng
Chen Zhang
Di Zhang
Kun Gai
DiffMVGen
757
8
0
01 Aug 2025
DC-AE 1.5: Accelerating Diffusion Model Convergence with Structured Latent Space
DC-AE 1.5: Accelerating Diffusion Model Convergence with Structured Latent Space
Junyu Chen
Dongyun Zou
Wenkun He
Junsong Chen
Enze Xie
Song Han
Han Cai
185
16
0
01 Aug 2025
SDMatte: Grafting Diffusion Models for Interactive Matting
SDMatte: Grafting Diffusion Models for Interactive Matting
Daigang Xu
Yu Liang
H. Zhang
Jinwei Chen
Wei Dong
L. Chen
Wanyu Liu
Bo Li
P. Jiang
DiffM
225
2
0
01 Aug 2025
FMPlug: Plug-In Foundation Flow-Matching Priors for Inverse Problems
FMPlug: Plug-In Foundation Flow-Matching Priors for Inverse Problems
Yuxiang Wan
Ryan Devera
Wenjie Zhang
Ju Sun
AI4CE
175
1
0
01 Aug 2025
SpA2V: Harnessing Spatial Auditory Cues for Audio-driven Spatially-aware Video Generation
SpA2V: Harnessing Spatial Auditory Cues for Audio-driven Spatially-aware Video Generation
K. T. Pham
Yingqing He
Yazhou Xing
Qifeng Chen
L. Chen
DiffMVGen
1.1K
1
0
01 Aug 2025
Previous
123...91011...232425
Next
Page 10 of 25
Pageof 25