ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.03206
  4. Cited By
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

5 March 2024
Patrick Esser
Sumith Kulal
A. Blattmann
Rahim Entezari
Jonas Muller
Harry Saini
Yam Levi
Dominik Lorenz
Axel Sauer
Frederic Boesel
Dustin Podell
Tim Dockhorn
Zion English
Kyle Lacey
Alex Goodwin
Yannik Marek
Robin Rombach
    DiffM
ArXiv (abs)PDFHTMLHuggingFace (68 upvotes)

Papers citing "Scaling Rectified Flow Transformers for High-Resolution Image Synthesis"

50 / 1,247 papers shown
MobileI2V: Fast and High-Resolution Image-to-Video on Mobile Devices
MobileI2V: Fast and High-Resolution Image-to-Video on Mobile Devices
Shuai Zhang
Bao Tang
Siyuan Yu
Yueting Zhu
Jingfeng Yao
Ya Zou
Shanglin Yuan
Li Yu
Wenyu Liu
Xinggang Wang
DiffMVGen
204
0
0
26 Nov 2025
ReDirector: Creating Any-Length Video Retakes with Rotary Camera Encoding
ReDirector: Creating Any-Length Video Retakes with Rotary Camera Encoding
Byeongjun Park
Byung-Hoon Kim
Hyungjin Chung
Jong Chul Ye
VGen
190
0
0
25 Nov 2025
CREward: A Type-Specific Creativity Reward Model
CREward: A Type-Specific Creativity Reward Model
Jiyeon Han
Ali Mahdavi-Amiri
Hao Zhang
Haedong Jeong
105
0
0
25 Nov 2025
Restora-Flow: Mask-Guided Image Restoration with Flow Matching
Restora-Flow: Mask-Guided Image Restoration with Flow Matching
Arnela Hadzic
Franz Thaler
Lea Bogensperger
Simon Johannes Joham
M. Urschler
DiffM
550
0
0
25 Nov 2025
PromptMoG: Enhancing Diversity in Long-Prompt Image Generation via Prompt Embedding Mixture-of-Gaussian Sampling
PromptMoG: Enhancing Diversity in Long-Prompt Image Generation via Prompt Embedding Mixture-of-Gaussian Sampling
Bo-Kai Ruan
Teng-Fang Hsiao
Ling Lo
Yi-Lun Wu
Hong-Han Shuai
DiffMVLM
185
0
0
25 Nov 2025
HiCoGen: Hierarchical Compositional Text-to-Image Generation in Diffusion Models via Reinforcement Learning
HiCoGen: Hierarchical Compositional Text-to-Image Generation in Diffusion Models via Reinforcement Learning
Hongji Yang
Yucheng Zhou
Wencheng Han
Runzhou Tao
Zhongying Qiu
Jianfei Yang
Jianbing Shen
DiffMEGVM
348
0
0
25 Nov 2025
DINO-Tok: Adapting DINO for Visual Tokenizers
DINO-Tok: Adapting DINO for Visual Tokenizers
Mingkai Jia
Mingxiao Li
Liaoyuan Fan
Tianxing Shi
Jiaxin Guo
...
Xiaoyang Guo
Xiao-Xiao Long
Qian Zhang
P. Tan
Wei Yin
ViT
192
0
0
25 Nov 2025
Training-Free Generation of Diverse and High-Fidelity Images via Prompt Semantic Space Optimization
Training-Free Generation of Diverse and High-Fidelity Images via Prompt Semantic Space Optimization
Debin Meng
Chen Jin
Zheng Gao
Yanran Li
Ioannis Patras
Georgios Tzimiropoulos
DiffM
267
0
0
25 Nov 2025
A Training-Free Approach for Multi-ID Customization via Attention Adjustment and Spatial Control
A Training-Free Approach for Multi-ID Customization via Attention Adjustment and Spatial Control
Jiawei Lin
Guanlong Jiao
Jianjin Xu
272
0
0
25 Nov 2025
The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation
The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation
Weijia Mao
Hao Chen
Zhenheng Yang
Mike Zheng Shou
EGVM
272
0
0
25 Nov 2025
HBridge: H-Shape Bridging of Heterogeneous Experts for Unified Multimodal Understanding and Generation
HBridge: H-Shape Bridging of Heterogeneous Experts for Unified Multimodal Understanding and Generation
Xiang Wang
Zhifei Zhang
Chentao Song
Zhe Lin
Yuqian Zhou
...
Haitian Zheng
Jason Kuen
Yuehuan Wang
Changxin Gao
Nong Sang
MoE
172
0
0
25 Nov 2025
EmoFeedback$^2$: Reinforcement of Continuous Emotional Image Generation via LVLM-based Reward and Textual Feedback
EmoFeedback2^22: Reinforcement of Continuous Emotional Image Generation via LVLM-based Reward and Textual Feedback
Jingyang Jia
Kai Shu
Gang Yang
Long Xing
Xun Chen
Aiping Liu
EGVM
395
1
0
25 Nov 2025
Flash-DMD: Towards High-Fidelity Few-Step Image Generation with Efficient Distillation and Joint Reinforcement Learning
Flash-DMD: Towards High-Fidelity Few-Step Image Generation with Efficient Distillation and Joint Reinforcement Learning
Guanjie Chen
Shirui Huang
Kai Liu
J. Zhu
Xiaoye Qu
Peng Chen
Yu Cheng
Yifu Sun
189
1
0
25 Nov 2025
The Consistency Critic: Correcting Inconsistencies in Generated Images via Reference-Guided Attentive Alignment
The Consistency Critic: Correcting Inconsistencies in Generated Images via Reference-Guided Attentive Alignment
Ziheng Ouyang
Yiren Song
Y. Liu
Shihao Zhu
Qibin Hou
Ming-Ming Cheng
Mike Zheng Shou
128
0
0
25 Nov 2025
STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flows
STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flows
Jiatao Gu
Ying Shen
Tianrong Chen
Laurent Dinh
Y. Wang
Miguel Angel Bautista
David Berthelot
Josh Susskind
Shuangfei Zhai
DiffMVGen
302
3
0
25 Nov 2025
Block Cascading: Training Free Acceleration of Block-Causal Video Models
Block Cascading: Training Free Acceleration of Block-Causal Video Models
Hmrishav Bandyopadhyay
Nikhil Pinnaparaju
Rahim Entezari
Jim Scott
Yi-Zhe Song
Varun Jampani
VGen
100
1
0
25 Nov 2025
SONIC: Spectral Optimization of Noise for Inpainting with Consistency
SONIC: Spectral Optimization of Noise for Inpainting with Consistency
Seungyeon Baek
Erqun Dong
Shadan Namazifard
Mark J. Matthews
Kwang Moo Yi
145
1
0
25 Nov 2025
RubricRL: Simple Generalizable Rewards for Text-to-Image Generation
RubricRL: Simple Generalizable Rewards for Text-to-Image Generation
Xuelu Feng
Yunsheng Li
Ziyu Wan
Zixuan Gao
Junsong Yuan
Dongdong Chen
Chunming Qiao
EGVM
274
0
0
25 Nov 2025
iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation
iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation
Zhoujie Fu
Xianfang Zeng
Jinghong Lan
Xinyao Liao
Cheng Chen
...
Wei Cheng
Shiyu Liu
Y. Chen
Gang Yu
Guosheng Lin
DiffMVGen
344
1
0
25 Nov 2025
One Attention, One Scale: Phase-Aligned Rotary Positional Embeddings for Mixed-Resolution Diffusion Transformer
One Attention, One Scale: Phase-Aligned Rotary Positional Embeddings for Mixed-Resolution Diffusion Transformer
Haoyu Wu
Jingyi Xu
Qiaomu Miao
Dimitris Samaras
H. Le
89
0
0
24 Nov 2025
Large Language Models for the Summarization of Czech Documents: From History to the Present
Large Language Models for the Summarization of Czech Documents: From History to the Present
Václav Tran
Jakub Šmíd
Ladislav Lenc
Jean-Pierre Salmon
Pavel Král
83
0
0
24 Nov 2025
HunyuanVideo 1.5 Technical Report
HunyuanVideo 1.5 Technical Report
Bing Wu
Chang Zou
Changlin Li
Duojun Huang
Fang Yang
...
Zhihe Yang
Zilin Yang
Z. Lu
Zixiang Zhou
Zhao Zhong
DiffMVGen
328
4
0
24 Nov 2025
Dynamic Granularity Matters: Rethinking Vision Transformers Beyond Fixed Patch Splitting
Dynamic Granularity Matters: Rethinking Vision Transformers Beyond Fixed Patch Splitting
Qiyang Yu
Yu Fang
Tianrui Li
Xuemei Cao
Yan Chen
Jianghao Li
Fan Min
ViT
125
0
0
24 Nov 2025
Beyond Reward Margin: Rethinking and Resolving Likelihood Displacement in Diffusion Models via Video Generation
Beyond Reward Margin: Rethinking and Resolving Likelihood Displacement in Diffusion Models via Video Generation
Ruojun Xu
Yu Kai
Xuhua Ren
Jiaxiang Cheng
Bing Ma
Tianxiang Zheng
Qinhlin Lu
EGVM
159
0
0
24 Nov 2025
Test-Time Preference Optimization for Image Restoration
Test-Time Preference Optimization for Image Restoration
Bingchen Li
Xin Li
Jiaqi Xu
Jiaming Guo
Wenbo Li
Renjing Pei
Zhibo Chen
125
0
0
24 Nov 2025
ProxT2I: Efficient Reward-Guided Text-to-Image Generation via Proximal Diffusion
ProxT2I: Efficient Reward-Guided Text-to-Image Generation via Proximal Diffusion
Zhenghan Fang
Jian Zheng
Qiaozi Gao
Xiaofeng Gao
Jeremias Sulam
212
0
0
24 Nov 2025
Terminal Velocity Matching
Terminal Velocity Matching
Linqi Zhou
Mathias Parger
Ayaan Haque
Jiaming Song
70
0
0
24 Nov 2025
DiP: Taming Diffusion Models in Pixel Space
DiP: Taming Diffusion Models in Pixel Space
Z. Chen
J. Zhu
Xu Chen
Jiangning Zhang
Xiaobin Hu
Hanzhen Zhao
C. Wang
Jian Yang
Ying Tai
283
0
0
24 Nov 2025
One4D: Unified 4D Generation and Reconstruction via Decoupled LoRA Control
One4D: Unified 4D Generation and Reconstruction via Decoupled LoRA Control
Zhenxing Mi
Yuxin Wang
Dan Xu
VGen
164
0
0
24 Nov 2025
BideDPO: Conditional Image Generation with Simultaneous Text and Condition Alignment
BideDPO: Conditional Image Generation with Simultaneous Text and Condition Alignment
Dewei Zhou
Mingwei Li
Zongxin Yang
Yu Lu
Yunqiu Xu
Zhizhong Wang
Zeyi Huang
Yi Yang
DiffMEGVM
196
0
0
24 Nov 2025
Are Image-to-Video Models Good Zero-Shot Image Editors?
Are Image-to-Video Models Good Zero-Shot Image Editors?
Zechuan Zhang
Zhenyuan Chen
Zongxin Yang
Yi Yang
DiffMVGen
557
0
0
24 Nov 2025
Beyond Words and Pixels: A Benchmark for Implicit World Knowledge Reasoning in Generative Models
Beyond Words and Pixels: A Benchmark for Implicit World Knowledge Reasoning in Generative Models
Tianyang Han
Junhao Su
J. Hu
Peizhen Yang
Hengyu Shi
Junfeng Luo
Jialin Gao
EGVMVGen
480
0
0
23 Nov 2025
ConsistCompose: Unified Multimodal Layout Control for Image Composition
ConsistCompose: Unified Multimodal Layout Control for Image Composition
Xuanke Shi
B. Li
Xiaoyang Han
Zhongang Cai
Lei Yang
Dahua Lin
Quan-ding Wang
MLLM
385
0
0
23 Nov 2025
Zero-Shot Video Deraining with Video Diffusion Models
Zero-Shot Video Deraining with Video Diffusion Models
Tuomas Varanka
Juan Luis Gonzalez
Hyeongwoo Kim
Pablo Garrido
Xu Yao
DiffMVGen
148
0
0
23 Nov 2025
CADTrack: Learning Contextual Aggregation with Deformable Alignment for Robust RGBT Tracking
CADTrack: Learning Contextual Aggregation with Deformable Alignment for Robust RGBT Tracking
Hao Li
Yuhao Wang
X. Hu
Wenning Hao
P. Zhang
D. Wang
Huchuan Lu
124
0
0
22 Nov 2025
Plan-X: Instruct Video Generation via Semantic Planning
Plan-X: Instruct Video Generation via Semantic Planning
Lun Huang
You Xie
Hongyi Xu
Tianpei Gu
Chenxu Zhang
Guoxian Song
Zenan Li
Xiaochen Zhao
Linjie Luo
Guillermo Sapiro
DiffMVGen
93
0
0
22 Nov 2025
UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios
UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios
Tian Ye
Song Fei
Lei Zhu
92
0
0
22 Nov 2025
Where Culture Fades: Revealing the Cultural Gap in Text-to-Image Generation
Where Culture Fades: Revealing the Cultural Gap in Text-to-Image Generation
Chuancheng Shi
Shangze Li
Shiming Guo
Simiao Xie
Wenhua Wu
...
Canran Xiao
Cong Wang
Zifeng Cheng
Fei Shen
Tat-Seng Chua
VLM
225
0
0
21 Nov 2025
Align & Invert: Solving Inverse Problems with Diffusion and Flow-based Models via Representational Alignment
Align & Invert: Solving Inverse Problems with Diffusion and Flow-based Models via Representational Alignment
Loukas Sfountouris
Giannis Daras
Paris Giampouras
DiffM
106
0
0
21 Nov 2025
Energy Scaling Laws for Diffusion Models: Quantifying Compute and Carbon Emissions in Image Generation
Energy Scaling Laws for Diffusion Models: Quantifying Compute and Carbon Emissions in Image Generation
Aniketh Iyengar
Jiaqi Han
Boris Ruf
Vincent Grari
Marcin Detyniecki
Stefano Ermon
DiffM
192
0
0
21 Nov 2025
Designing and Generating Diverse, Equitable Face Image Datasets for Face Verification Tasks
Designing and Generating Diverse, Equitable Face Image Datasets for Face Verification Tasks
Georgia Baltsou
Ioannis Sarridis
C. Koutlis
Symeon Papadopoulos
160
0
0
21 Nov 2025
Diversity Has Always Been There in Your Visual Autoregressive Models
Diversity Has Always Been There in Your Visual Autoregressive Models
Tong Wang
Guanyu Yang
Nian Liu
Kai Wang
Yaxing Wang
Abdelrahman M. Shaker
Salman Khan
Fahad Shahbaz Khan
S. Li
136
0
0
21 Nov 2025
RoomPlanner: Explicit Layout Planner for Easier LLM-Driven 3D Room Generation
RoomPlanner: Explicit Layout Planner for Easier LLM-Driven 3D Room Generation
Wenzhuo Sun
Mingjian Liang
Wenxuan Song
Xuelian Cheng
Zongyuan Ge
3DV
222
0
0
21 Nov 2025
EvDiff: High Quality Video with an Event Camera
EvDiff: High Quality Video with an Event Camera
Weilun Li
Lei-huan Sun
Ruixi Gao
Qi Jiang
Yuqin Ma
Kaiwei Wang
M. Yang
Luc Van Gool
D. Paudel
DiffMVGen
184
0
0
21 Nov 2025
Loomis Painter: Reconstructing the Painting Process
Loomis Painter: Reconstructing the Painting Process
Markus Pobitzer
Chang Liu
Chenyi Zhuang
Teng Long
Bin Ren
Nicu Sebe
DiffM
235
0
0
21 Nov 2025
SPIDER: Spatial Image CorresponDence Estimator for Robust Calibration
SPIDER: Spatial Image CorresponDence Estimator for Robust Calibration
Zhimin Shao
Abhay Kumar Yadav
Rama Chellappa
Cheng-Fang Peng
81
0
0
21 Nov 2025
Saving Foundation Flow-Matching Priors for Inverse Problems
Saving Foundation Flow-Matching Priors for Inverse Problems
Yuxiang Wan
Ryan Devera
Wenjie Zhang
Ju Sun
AI4CE
175
0
0
20 Nov 2025
TRIM: Scalable 3D Gaussian Diffusion Inference with Temporal and Spatial Trimming
Zeyuan Yin
Xiaoming Liu
3DGS
92
1
0
20 Nov 2025
Pluggable Pruning with Contiguous Layer Distillation for Diffusion Transformers
Jian Ma
Qirong Peng
Xujie Zhu
Peixing Xie
Chen Chen
H. Lu
134
0
0
20 Nov 2025
Thinking-while-Generating: Interleaving Textual Reasoning throughout Visual Generation
Ziyu Guo
Renrui Zhang
Hongyu Li
M. Zhang
Xinyan Chen
Sifan Wang
Yan Feng
Peng Pei
Pheng-Ann Heng
245
4
0
20 Nov 2025
Previous
12345...232425
Next