ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.03206
  4. Cited By
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

5 March 2024
Patrick Esser
Sumith Kulal
A. Blattmann
Rahim Entezari
Jonas Muller
Harry Saini
Yam Levi
Dominik Lorenz
Axel Sauer
Frederic Boesel
Dustin Podell
Tim Dockhorn
Zion English
Kyle Lacey
Alex Goodwin
Yannik Marek
Robin Rombach
    DiffM
ArXiv (abs)PDFHTMLHuggingFace (68 upvotes)

Papers citing "Scaling Rectified Flow Transformers for High-Resolution Image Synthesis"

50 / 1,247 papers shown
LAMIC: Layout-Aware Multi-Image Composition via Scalability of Multimodal Diffusion Transformer
LAMIC: Layout-Aware Multi-Image Composition via Scalability of Multimodal Diffusion Transformer
Yuzhuo Chen
Zehua Ma
Jianhua Wang
Kai Kang
Shunyu Yao
Weiming Zhang
VLM
166
2
0
24 Dec 2025
Efficient Generative Transformer Operators For Million-Point PDEs
Efficient Generative Transformer Operators For Million-Point PDEs
Armand K. Koupai
Lise Le Boudec
Patrick Gallinari
61
0
0
04 Dec 2025
VideoSSM: Autoregressive Long Video Generation with Hybrid State-Space Memory
VideoSSM: Autoregressive Long Video Generation with Hybrid State-Space Memory
Yifei Yu
Xiaoshan Wu
Xinting Hu
Tao Hu
Yangtian Sun
...
Bo Wang
Lin Ma
Yuewen Ma
Zhongrui Wang
Xiaojuan Qi
DiffMVGen
173
1
0
04 Dec 2025
SA-IQA: Redefining Image Quality Assessment for Spatial Aesthetics with Multi-Dimensional Rewards
SA-IQA: Redefining Image Quality Assessment for Spatial Aesthetics with Multi-Dimensional Rewards
Yuan Gao
Jin Song
EGVM
134
0
0
04 Dec 2025
Refaçade: Editing Object with Given Reference Texture
Refaçade: Editing Object with Given Reference Texture
Youze Huang
Penghui Ruan
Bojia Zi
Xianbiao Qi
Jianan Wang
Rong Xiao
DiffM
172
0
0
04 Dec 2025
Highly Efficient Test-Time Scaling for T2I Diffusion Models with Text Embedding Perturbation
Highly Efficient Test-Time Scaling for T2I Diffusion Models with Text Embedding Perturbation
Hang Xu
Linjiang Huang
Feng Zhao
DiffM
120
0
0
03 Dec 2025
UniLight: A Unified Representation for Lighting
UniLight: A Unified Representation for Lighting
Zitian Zhang
Iliyan Georgiev
Michael Fischer
Yannick Hold-Geoffroy
Jean-François Lalonde
Valentin Deschaintre
60
0
0
03 Dec 2025
WeMMU: Enhanced Bridging of Vision-Language Models and Diffusion Models via Noisy Query Tokens
WeMMU: Enhanced Bridging of Vision-Language Models and Diffusion Models via Noisy Query Tokens
Jian Yang
Dacheng Yin
Xiaoxuan He
Y. Li
Fengyun Rao
Jing Lyu
Wei-dong Zhai
Yang Cao
Zheng-Jun Zha
VLM
239
0
0
02 Dec 2025
YingVideo-MV: Music-Driven Multi-Stage Video Generation
YingVideo-MV: Music-Driven Multi-Stage Video Generation
Jiahui Chen
Weida Wang
Runhua Shi
Huan Yang
Chaofan Ding
Zihao Chen
DiffMVGen
237
0
0
02 Dec 2025
Taming Camera-Controlled Video Generation with Verifiable Geometry Reward
Taming Camera-Controlled Video Generation with Verifiable Geometry Reward
Zhaoqing Wang
Xiaobo Xia
Zhuolin Bie
Jinlin Liu
Dongdong Yu
Jia-Wang Bian
Changhu Wang
EGVMVGen
154
0
0
02 Dec 2025
MultiShotMaster: A Controllable Multi-Shot Video Generation Framework
MultiShotMaster: A Controllable Multi-Shot Video Generation Framework
Qinghe Wang
Xiaoyu Shi
Baolu Li
Weikang Bian
Quande Liu
Huchuan Lu
Xintao Wang
Pengfei Wan
Kun Gai
Xu Jia
VGen
208
2
0
02 Dec 2025
PGP-DiffSR: Phase-Guided Progressive Pruning for Efficient Diffusion-based Image Super-Resolution
PGP-DiffSR: Phase-Guided Progressive Pruning for Efficient Diffusion-based Image Super-Resolution
Zhongbao Yang
Jiangxin Dong
Yazhou Yao
Jinhui Tang
Jinshan Pan
166
0
0
02 Dec 2025
Hear What Matters! Text-conditioned Selective Video-to-Audio Generation
Hear What Matters! Text-conditioned Selective Video-to-Audio Generation
Junwon Lee
Juhan Nam
Jiyoung Lee
DiffMVGen
109
0
0
02 Dec 2025
Spatiotemporal Pyramid Flow Matching for Climate Emulation
Spatiotemporal Pyramid Flow Matching for Climate Emulation
Jeremy Irvin
Jiaqi Han
Z. Wang
Abdulaziz Alharbi
Yufei Zhao
Nomin-Erdene Bayarsaikhan
Daniele Visioni
A. Ng
Duncan Watson-Parris
AI4TS
85
0
0
01 Dec 2025
DreamingComics: A Story Visualization Pipeline via Subject and Layout Customized Generation using Video Models
Patrick Kwon
Chen Chen
DiffMAI4TSVGen
147
0
0
01 Dec 2025
FineGRAIN: Evaluating Failure Modes of Text-to-Image Models with Vision Language Model Judges
FineGRAIN: Evaluating Failure Modes of Text-to-Image Models with Vision Language Model Judges
Kevin David Hayes
Micah Goldblum
Vikash Sehwag
Gowthami Somepalli
Ashwinee Panda
Tom Goldstein
MLLMEGVM
240
0
0
01 Dec 2025
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models
Zhiheng Liu
Weiming Ren
Haozhe Liu
Zijian Zhou
S. Chen
...
Ping Luo
Wei Liu
Tao Xiang
Jonas Schult
Yuren Cong
155
0
0
01 Dec 2025
Reversible Inversion for Training-Free Exemplar-guided Image Editing
Yuke Li
Lianli Gao
Ji Zhang
Pengpeng Zeng
Lichuan Xiang
Hongkai Wen
Heng Tao Shen
Jingkuan Song
DiffM
129
0
0
01 Dec 2025
Generative Video Motion Editing with 3D Point Tracks
Yao-Chih Lee
Zhoutong Zhang
Jiahui Huang
Jui-Hsien Wang
Joon-Young Lee
Jia-Bin Huang
Eli Shechtman
Zhengqi Li
DiffMVGen3DPC
262
0
0
01 Dec 2025
FreqEdit: Preserving High-Frequency Features for Robust Multi-Turn Image Editing
Yucheng Liao
Jiajun Liang
Kaiqian Cui
Baoquan Zhao
Haoran Xie
Wei Liu
Qing Li
Xudong Mao
126
0
0
01 Dec 2025
FRAMER: Frequency-Aligned Self-Distillation with Adaptive Modulation Leveraging Diffusion Priors for Real-World Image Super-Resolution
Seungho Choi
Jeahun Sung
Jihyong Oh
DiffM
157
0
0
01 Dec 2025
ResDiT: Evoking the Intrinsic Resolution Scalability in Diffusion Transformers
ResDiT: Evoking the Intrinsic Resolution Scalability in Diffusion Transformers
Yiyang Ma
Feng Zhou
Xuedan Yin
Pu Cao
Yonghao Dang
Jianqin Yin
95
0
0
01 Dec 2025
Dynamic-eDiTor: Training-Free Text-Driven 4D Scene Editing with Multimodal Diffusion Transformer
Dong In Lee
Hyungjun Doh
Seunggeun Chi
Runlin Duan
Sangpil Kim
K. Ramani
DiffM3DGSVGen
145
0
0
30 Nov 2025
Multi-GRPO: Multi-Group Advantage Estimation for Text-to-Image Generation with Tree-Based Trajectories and Multiple Rewards
Multi-GRPO: Multi-Group Advantage Estimation for Text-to-Image Generation with Tree-Based Trajectories and Multiple Rewards
Qiang Lyu
Z. Chen
C. Wang
Haolin Shi
Shibo Gao
...
Jianlou Si
Fei Ding
Jing Li
Chun Pong Lau
Weiqiang Wang
EGVM
128
1
0
30 Nov 2025
Assimilation Matters: Model-level Backdoor Detection in Vision-Language Pretrained Models
Assimilation Matters: Model-level Backdoor Detection in Vision-Language Pretrained Models
Z. Wang
Jie M. Zhang
Shiguang Shan
Xilin Chen
AAML
372
0
0
29 Nov 2025
SAIDO: Generalizable Detection of AI-Generated Images via Scene-Aware and Importance-Guided Dynamic Optimization in Continual Learning
Yongkang Hu
Yu Cheng
Y. Zhang
Yuan Xie
Zhaoxia Yin
88
0
0
29 Nov 2025
RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards
RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards
Junyan Ye
Leiqi Zhu
Yuncheng Guo
Dongzhi Jiang
Zilong Huang
Yifan Zhang
Zhiyuan Yan
Haohuan Fu
Conghui He
Weijia Li
EGVM
116
0
0
29 Nov 2025
Guiding Visual Autoregressive Models through Spectrum Weakening
Guiding Visual Autoregressive Models through Spectrum Weakening
Chaoyang Wang
Tianmeng Yang
Jingdong Wang
Yunhai Tong
DiffM
168
0
0
28 Nov 2025
Visual Generation Tuning
Visual Generation Tuning
Jiahao Guo
Sinan Du
J. Yao
Wenyu Liu
Bo Li
Haoxiang Cao
Kun Gai
C. Yuan
Kai Wu
Xinggang Wang
VLM
302
0
0
28 Nov 2025
VQRAE: Representation Quantization Autoencoders for Multimodal Understanding, Generation and Reconstruction
VQRAE: Representation Quantization Autoencoders for Multimodal Understanding, Generation and Reconstruction
Sinan Du
Jiahao Guo
Bo Li
Shuhao Cui
Zhengzhuo Xu
...
Yongxian Wei
Kun Gai
X. Wang
Kai Wu
C. Yuan
213
0
0
28 Nov 2025
One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer
One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer
S. Shi
Jing Xu
Zhihang Li
Chunli Peng
Xiaoda Yang
Lijing Lu
Kai Hu
Jiangning Zhang
DiffM
119
0
0
28 Nov 2025
Vision Bridge Transformer at Scale
Vision Bridge Transformer at Scale
Zhenxiong Tan
Zeqing Wang
Xingyi Yang
Songhua Liu
Xinchao Wang
DiffM
100
0
0
28 Nov 2025
REVEAL: Reasoning-enhanced Forensic Evidence Analysis for Explainable AI-generated Image Detection
REVEAL: Reasoning-enhanced Forensic Evidence Analysis for Explainable AI-generated Image Detection
Huangsen Cao
Qin Mei
Zhiheng Li
Yuxi Li
Ying Zhang
...
Zhimeng Zhang
Xin Ding
Yongwei Wang
Jing Lyu
Fei Wu
131
0
0
28 Nov 2025
Ovis-Image Technical Report
Ovis-Image Technical Report
Guo-Hua Wang
Liangfu Cao
Tianyu Cui
Minghao Fu
Xiaohao Chen
...
Jianshan Zhao
Lan Li
Bowen Fu
Jiaqi Liu
Qing-Guo Chen
VLM
531
0
0
28 Nov 2025
GOATex: Geometry & Occlusion-Aware Texturing
GOATex: Geometry & Occlusion-Aware Texturing
Hyunjin Kim
Kunho Kim
Adam Lee
Wonkwang Lee
DiffM
101
0
0
28 Nov 2025
Semantic Anchoring for Robust Personalization in Text-to-Image Diffusion Models
Semantic Anchoring for Robust Personalization in Text-to-Image Diffusion Models
Seoyun Yang
Gihoon Kim
Taesup Kim
81
0
0
27 Nov 2025
Fast3Dcache: Training-free 3D Geometry Synthesis Acceleration
Fast3Dcache: Training-free 3D Geometry Synthesis Acceleration
M. Yang
Yanming Yang
Chenyi Xu
Chenxi Song
Yufan Zuo
Tong Zhao
Ruibo Li
Chi Zhang
DiffM
129
0
0
27 Nov 2025
Generative Anchored Fields: Controlled Data Generation via Emergent Velocity Fields and Transport Algebra
Generative Anchored Fields: Controlled Data Generation via Emergent Velocity Fields and Transport Algebra
Deressa Wodajo Deressa
Hannes Mareen
Peter Lambert
Glenn Van Wallendael
64
0
0
27 Nov 2025
Designing Instance-Level Sampling Schedules via REINFORCE with James-Stein Shrinkage
Designing Instance-Level Sampling Schedules via REINFORCE with James-Stein Shrinkage
Peiyu Yu
Suraj Kothawade
Sirui Xie
Ying Nian Wu
Hongliang Fei
114
0
0
27 Nov 2025
PROMPTMINER: Black-Box Prompt Stealing against Text-to-Image Generative Models via Reinforcement Learning and Fuzz Optimization
PROMPTMINER: Black-Box Prompt Stealing against Text-to-Image Generative Models via Reinforcement Learning and Fuzz Optimization
Mingzhe Li
Renhao Zhang
Zhiyang Wen
Siqi Pan
Bruno Castro da Silva
Juan Zhai
Shiqing Ma
65
0
0
27 Nov 2025
StreamFlow: Theory, Algorithm, and Implementation for High-Efficiency Rectified Flow Generation
StreamFlow: Theory, Algorithm, and Implementation for High-Efficiency Rectified Flow Generation
Sen Fang
Hongbin Zhong
Yalin Feng
Dimitris N. Metaxas
Dimitris N. Metaxas
154
1
0
27 Nov 2025
Ar2Can: An Architect and an Artist Leveraging a Canvas for Multi-Human Generation
Ar2Can: An Architect and an Artist Leveraging a Canvas for Multi-Human Generation
Shubhankar Borse
Phuc Pham
Farzad Farhadzadeh
Seokeon Choi
P. Nguyen
Anh Tran
Sungrack Yun
Munawar Hayat
Fatih Porikli
78
0
0
27 Nov 2025
Adversarial Flow Models
Adversarial Flow Models
Shanchuan Lin
Ceyuan Yang
Zhijie Lin
Hao Chen
Haoqi Fan
GAN
149
0
0
27 Nov 2025
Inversion-Free Style Transfer with Dual Rectified Flows
Inversion-Free Style Transfer with Dual Rectified Flows
Yingying Deng
Xiangyu He
Fan Tang
Weiming Dong
Xucheng Yin
DiffM
245
0
0
26 Nov 2025
Progress by Pieces: Test-Time Scaling for Autoregressive Image Generation
Progress by Pieces: Test-Time Scaling for Autoregressive Image Generation
Joonhyung Park
Hyeongwon Jang
Joowon Kim
Eunho Yang
VLM
156
0
0
26 Nov 2025
MobileI2V: Fast and High-Resolution Image-to-Video on Mobile Devices
MobileI2V: Fast and High-Resolution Image-to-Video on Mobile Devices
Shuai Zhang
Bao Tang
Siyuan Yu
Yueting Zhu
Jingfeng Yao
Ya Zou
Shanglin Yuan
Li Yu
Wenyu Liu
Xinggang Wang
DiffMVGen
204
0
0
26 Nov 2025
MUSE: Manipulating Unified Framework for Synthesizing Emotions in Images via Test-Time Optimization
MUSE: Manipulating Unified Framework for Synthesizing Emotions in Images via Test-Time Optimization
Yingjie Xia
X. Wang
Jinglei Shi
Vicky Kalogeiton
Jian Yang
EGVMVGen
546
0
0
26 Nov 2025
3MDiT: Unified Tri-Modal Diffusion Transformer for Text-Driven Synchronized Audio-Video Generation
3MDiT: Unified Tri-Modal Diffusion Transformer for Text-Driven Synchronized Audio-Video Generation
Y. Li
Heyu Si
Federico Landi
Pilar Oplustil Gallegos
Ioannis Koutsoumpas
...
Ruiju Fu
Qi Guo
Xin Jin
Shunyu Liu
Mingli Song
DiffMVGen
192
0
0
26 Nov 2025
Deep Parameter Interpolation for Scalar Conditioning
Deep Parameter Interpolation for Scalar Conditioning
Chicago Y. Park
Michael T. McCann
Cristina Garcia-Cardona
B. Wohlberg
Ulugbek S. Kamilov
AI4CE
277
0
0
26 Nov 2025
FlowerDance: MeanFlow for Efficient and Refined 3D Dance Generation
FlowerDance: MeanFlow for Efficient and Refined 3D Dance Generation
Kaixing Yang
Xulong Tang
Ziqiao Peng
X. Zhang
Puwei Wang
Jun He
Hongyan Liu
194
1
0
26 Nov 2025
1234...232425
Next