ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.03206
  4. Cited By
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

5 March 2024
Patrick Esser
Sumith Kulal
A. Blattmann
Rahim Entezari
Jonas Muller
Harry Saini
Yam Levi
Dominik Lorenz
Axel Sauer
Frederic Boesel
Dustin Podell
Tim Dockhorn
Zion English
Kyle Lacey
Alex Goodwin
Yannik Marek
Robin Rombach
    DiffM
ArXiv (abs)PDFHTMLHuggingFace (68 upvotes)

Papers citing "Scaling Rectified Flow Transformers for High-Resolution Image Synthesis"

50 / 1,245 papers shown
LAMIC: Layout-Aware Multi-Image Composition via Scalability of Multimodal Diffusion Transformer
LAMIC: Layout-Aware Multi-Image Composition via Scalability of Multimodal Diffusion Transformer
Yuzhuo Chen
Zehua Ma
Jianhua Wang
Kai Kang
Shunyu Yao
Weiming Zhang
VLM
161
2
0
24 Dec 2025
SA-IQA: Redefining Image Quality Assessment for Spatial Aesthetics with Multi-Dimensional Rewards
SA-IQA: Redefining Image Quality Assessment for Spatial Aesthetics with Multi-Dimensional Rewards
Yuan Gao
Jin Song
EGVM
132
0
0
04 Dec 2025
Refaçade: Editing Object with Given Reference Texture
Refaçade: Editing Object with Given Reference Texture
Youze Huang
Penghui Ruan
Bojia Zi
Xianbiao Qi
Jianan Wang
Rong Xiao
DiffM
162
0
0
04 Dec 2025
VideoSSM: Autoregressive Long Video Generation with Hybrid State-Space Memory
VideoSSM: Autoregressive Long Video Generation with Hybrid State-Space Memory
Yifei Yu
Xiaoshan Wu
Xinting Hu
Tao Hu
Yangtian Sun
...
Bo Wang
Lin Ma
Yuewen Ma
Zhongrui Wang
Xiaojuan Qi
DiffMVGen
168
1
0
04 Dec 2025
Efficient Generative Transformer Operators For Million-Point PDEs
Efficient Generative Transformer Operators For Million-Point PDEs
Armand K. Koupai
Lise Le Boudec
Patrick Gallinari
54
0
0
04 Dec 2025
Highly Efficient Test-Time Scaling for T2I Diffusion Models with Text Embedding Perturbation
Highly Efficient Test-Time Scaling for T2I Diffusion Models with Text Embedding Perturbation
Hang Xu
Linjiang Huang
Feng Zhao
DiffM
115
0
0
03 Dec 2025
UniLight: A Unified Representation for Lighting
UniLight: A Unified Representation for Lighting
Zitian Zhang
Iliyan Georgiev
Michael Fischer
Yannick Hold-Geoffroy
Jean-François Lalonde
Valentin Deschaintre
59
0
0
03 Dec 2025
WeMMU: Enhanced Bridging of Vision-Language Models and Diffusion Models via Noisy Query Tokens
WeMMU: Enhanced Bridging of Vision-Language Models and Diffusion Models via Noisy Query Tokens
Jian Yang
Dacheng Yin
Xiaoxuan He
Y. Li
Fengyun Rao
Jing Lyu
Wei-dong Zhai
Yang Cao
Zheng-Jun Zha
VLM
232
0
0
02 Dec 2025
PGP-DiffSR: Phase-Guided Progressive Pruning for Efficient Diffusion-based Image Super-Resolution
PGP-DiffSR: Phase-Guided Progressive Pruning for Efficient Diffusion-based Image Super-Resolution
Zhongbao Yang
Jiangxin Dong
Yazhou Yao
Jinhui Tang
Jinshan Pan
165
0
0
02 Dec 2025
Taming Camera-Controlled Video Generation with Verifiable Geometry Reward
Taming Camera-Controlled Video Generation with Verifiable Geometry Reward
Zhaoqing Wang
Xiaobo Xia
Zhuolin Bie
Jinlin Liu
Dongdong Yu
Jia-Wang Bian
Changhu Wang
EGVMVGen
153
0
0
02 Dec 2025
YingVideo-MV: Music-Driven Multi-Stage Video Generation
YingVideo-MV: Music-Driven Multi-Stage Video Generation
Jiahui Chen
Weida Wang
Runhua Shi
Huan Yang
Chaofan Ding
Zihao Chen
DiffMVGen
229
0
0
02 Dec 2025
MultiShotMaster: A Controllable Multi-Shot Video Generation Framework
MultiShotMaster: A Controllable Multi-Shot Video Generation Framework
Qinghe Wang
Xiaoyu Shi
Baolu Li
Weikang Bian
Quande Liu
Huchuan Lu
Xintao Wang
Pengfei Wan
Kun Gai
Xu Jia
VGen
202
1
0
02 Dec 2025
FineGRAIN: Evaluating Failure Modes of Text-to-Image Models with Vision Language Model Judges
FineGRAIN: Evaluating Failure Modes of Text-to-Image Models with Vision Language Model Judges
Kevin David Hayes
Micah Goldblum
Vikash Sehwag
Gowthami Somepalli
Ashwinee Panda
Tom Goldstein
MLLMEGVM
240
0
0
01 Dec 2025
DreamingComics: A Story Visualization Pipeline via Subject and Layout Customized Generation using Video Models
Patrick Kwon
Chen Chen
DiffMAI4TSVGen
145
0
0
01 Dec 2025
Spatiotemporal Pyramid Flow Matching for Climate Emulation
Spatiotemporal Pyramid Flow Matching for Climate Emulation
Jeremy Irvin
Jiaqi Han
Z. Wang
Abdulaziz Alharbi
Yufei Zhao
Nomin-Erdene Bayarsaikhan
Daniele Visioni
A. Ng
Duncan Watson-Parris
AI4TS
84
0
0
01 Dec 2025
Reversible Inversion for Training-Free Exemplar-guided Image Editing
Yuke Li
Lianli Gao
Ji Zhang
Pengpeng Zeng
Lichuan Xiang
Hongkai Wen
Heng Tao Shen
Jingkuan Song
DiffM
124
0
0
01 Dec 2025
FreqEdit: Preserving High-Frequency Features for Robust Multi-Turn Image Editing
Yucheng Liao
Jiajun Liang
Kaiqian Cui
Baoquan Zhao
Haoran Xie
Wei Liu
Qing Li
Xudong Mao
124
0
0
01 Dec 2025
Generative Video Motion Editing with 3D Point Tracks
Yao-Chih Lee
Zhoutong Zhang
Jiahui Huang
Jui-Hsien Wang
Joon-Young Lee
Jia-Bin Huang
Eli Shechtman
Zhengqi Li
DiffMVGen3DPC
259
0
0
01 Dec 2025
FRAMER: Frequency-Aligned Self-Distillation with Adaptive Modulation Leveraging Diffusion Priors for Real-World Image Super-Resolution
Seungho Choi
Jeahun Sung
Jihyong Oh
DiffM
152
0
0
01 Dec 2025
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models
Zhiheng Liu
Weiming Ren
Haozhe Liu
Zijian Zhou
S. Chen
...
Ping Luo
Wei Liu
Tao Xiang
Jonas Schult
Yuren Cong
152
0
0
01 Dec 2025
ResDiT: Evoking the Intrinsic Resolution Scalability in Diffusion Transformers
ResDiT: Evoking the Intrinsic Resolution Scalability in Diffusion Transformers
Yiyang Ma
Feng Zhou
Xuedan Yin
Pu Cao
Yonghao Dang
Jianqin Yin
88
0
0
01 Dec 2025
Dynamic-eDiTor: Training-Free Text-Driven 4D Scene Editing with Multimodal Diffusion Transformer
Dong In Lee
Hyungjun Doh
Seunggeun Chi
Runlin Duan
Sangpil Kim
K. Ramani
DiffM3DGSVGen
137
0
0
30 Nov 2025
Multi-GRPO: Multi-Group Advantage Estimation for Text-to-Image Generation with Tree-Based Trajectories and Multiple Rewards
Multi-GRPO: Multi-Group Advantage Estimation for Text-to-Image Generation with Tree-Based Trajectories and Multiple Rewards
Qiang Lyu
Z. Chen
C. Wang
Haolin Shi
Shibo Gao
...
Jianlou Si
Fei Ding
Jing Li
Chun Pong Lau
Weiqiang Wang
EGVM
121
0
0
30 Nov 2025
Assimilation Matters: Model-level Backdoor Detection in Vision-Language Pretrained Models
Assimilation Matters: Model-level Backdoor Detection in Vision-Language Pretrained Models
Z. Wang
Jie M. Zhang
Shiguang Shan
Xilin Chen
AAML
352
0
0
29 Nov 2025
RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards
RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards
Junyan Ye
Leiqi Zhu
Yuncheng Guo
Dongzhi Jiang
Zilong Huang
Yifan Zhang
Zhiyuan Yan
Haohuan Fu
Conghui He
Weijia Li
EGVM
112
0
0
29 Nov 2025
REVEAL: Reasoning-enhanced Forensic Evidence Analysis for Explainable AI-generated Image Detection
REVEAL: Reasoning-enhanced Forensic Evidence Analysis for Explainable AI-generated Image Detection
Huangsen Cao
Qin Mei
Zhiheng Li
Yuxi Li
Ying Zhang
...
Zhimeng Zhang
Xin Ding
Yongwei Wang
Jing Lyu
Fei Wu
127
0
0
28 Nov 2025
GOATex: Geometry & Occlusion-Aware Texturing
GOATex: Geometry & Occlusion-Aware Texturing
Hyunjin Kim
Kunho Kim
Adam Lee
Wonkwang Lee
DiffM
96
0
0
28 Nov 2025
One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer
One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer
S. Shi
Jing Xu
Zhihang Li
Chunli Peng
Xiaoda Yang
Lijing Lu
Kai Hu
Jiangning Zhang
DiffM
118
0
0
28 Nov 2025
Vision Bridge Transformer at Scale
Vision Bridge Transformer at Scale
Zhenxiong Tan
Zeqing Wang
Xingyi Yang
Songhua Liu
Xinchao Wang
DiffM
96
0
0
28 Nov 2025
VQRAE: Representation Quantization Autoencoders for Multimodal Understanding, Generation and Reconstruction
VQRAE: Representation Quantization Autoencoders for Multimodal Understanding, Generation and Reconstruction
Sinan Du
Jiahao Guo
Bo Li
Shuhao Cui
Zhengzhuo Xu
...
Yongxian Wei
Kun Gai
X. Wang
Kai Wu
C. Yuan
213
0
0
28 Nov 2025
Ovis-Image Technical Report
Ovis-Image Technical Report
Guo-Hua Wang
Liangfu Cao
Tianyu Cui
Minghao Fu
Xiaohao Chen
...
Jianshan Zhao
Lan Li
Bowen Fu
Jiaqi Liu
Qing-Guo Chen
VLM
528
0
0
28 Nov 2025
Visual Generation Tuning
Visual Generation Tuning
Jiahao Guo
Sinan Du
J. Yao
Wenyu Liu
Bo Li
Haoxiang Cao
Kun Gai
C. Yuan
Kai Wu
Xinggang Wang
VLM
301
0
0
28 Nov 2025
Guiding Visual Autoregressive Models through Spectrum Weakening
Guiding Visual Autoregressive Models through Spectrum Weakening
Chaoyang Wang
Tianmeng Yang
Jingdong Wang
Yunhai Tong
DiffM
167
0
0
28 Nov 2025
Semantic Anchoring for Robust Personalization in Text-to-Image Diffusion Models
Semantic Anchoring for Robust Personalization in Text-to-Image Diffusion Models
Seoyun Yang
Gihoon Kim
Taesup Kim
80
0
0
27 Nov 2025
StreamFlow: Theory, Algorithm, and Implementation for High-Efficiency Rectified Flow Generation
StreamFlow: Theory, Algorithm, and Implementation for High-Efficiency Rectified Flow Generation
Sen Fang
Hongbin Zhong
Yalin Feng
Dimitris N. Metaxas
Dimitris N. Metaxas
154
1
0
27 Nov 2025
Ar2Can: An Architect and an Artist Leveraging a Canvas for Multi-Human Generation
Ar2Can: An Architect and an Artist Leveraging a Canvas for Multi-Human Generation
Shubhankar Borse
Phuc Pham
Farzad Farhadzadeh
Seokeon Choi
P. Nguyen
Anh Tran
Sungrack Yun
Munawar Hayat
Fatih Porikli
76
0
0
27 Nov 2025
PROMPTMINER: Black-Box Prompt Stealing against Text-to-Image Generative Models via Reinforcement Learning and Fuzz Optimization
PROMPTMINER: Black-Box Prompt Stealing against Text-to-Image Generative Models via Reinforcement Learning and Fuzz Optimization
Mingzhe Li
Renhao Zhang
Zhiyang Wen
Siqi Pan
Bruno Castro da Silva
Juan Zhai
Shiqing Ma
60
0
0
27 Nov 2025
Fast3Dcache: Training-free 3D Geometry Synthesis Acceleration
Fast3Dcache: Training-free 3D Geometry Synthesis Acceleration
M. Yang
Yanming Yang
Chenyi Xu
Chenxi Song
Yufan Zuo
Tong Zhao
Ruibo Li
Chi Zhang
DiffM
128
0
0
27 Nov 2025
Generative Anchored Fields: Controlled Data Generation via Emergent Velocity Fields and Transport Algebra
Generative Anchored Fields: Controlled Data Generation via Emergent Velocity Fields and Transport Algebra
Deressa Wodajo Deressa
Hannes Mareen
Peter Lambert
Glenn Van Wallendael
64
0
0
27 Nov 2025
Adversarial Flow Models
Adversarial Flow Models
Shanchuan Lin
Ceyuan Yang
Zhijie Lin
Hao Chen
Haoqi Fan
GAN
144
0
0
27 Nov 2025
Designing Instance-Level Sampling Schedules via REINFORCE with James-Stein Shrinkage
Designing Instance-Level Sampling Schedules via REINFORCE with James-Stein Shrinkage
Peiyu Yu
Suraj Kothawade
Sirui Xie
Ying Nian Wu
Hongliang Fei
113
0
0
27 Nov 2025
Progress by Pieces: Test-Time Scaling for Autoregressive Image Generation
Progress by Pieces: Test-Time Scaling for Autoregressive Image Generation
Joonhyung Park
Hyeongwon Jang
Joowon Kim
Eunho Yang
VLM
144
0
0
26 Nov 2025
Deep Parameter Interpolation for Scalar Conditioning
Deep Parameter Interpolation for Scalar Conditioning
Chicago Y. Park
Michael T. McCann
Cristina Garcia-Cardona
B. Wohlberg
Ulugbek S. Kamilov
AI4CE
277
0
0
26 Nov 2025
MUSE: Manipulating Unified Framework for Synthesizing Emotions in Images via Test-Time Optimization
MUSE: Manipulating Unified Framework for Synthesizing Emotions in Images via Test-Time Optimization
Yingjie Xia
X. Wang
Jinglei Shi
Vicky Kalogeiton
Jian Yang
EGVMVGen
542
0
0
26 Nov 2025
3MDiT: Unified Tri-Modal Diffusion Transformer for Text-Driven Synchronized Audio-Video Generation
3MDiT: Unified Tri-Modal Diffusion Transformer for Text-Driven Synchronized Audio-Video Generation
Y. Li
Heyu Si
Federico Landi
Pilar Oplustil Gallegos
Ioannis Koutsoumpas
...
Ruiju Fu
Qi Guo
Xin Jin
Shunyu Liu
Mingli Song
DiffMVGen
192
0
0
26 Nov 2025
Inversion-Free Style Transfer with Dual Rectified Flows
Inversion-Free Style Transfer with Dual Rectified Flows
Yingying Deng
Xiangyu He
Fan Tang
Weiming Dong
Xucheng Yin
DiffM
245
0
0
26 Nov 2025
FlowerDance: MeanFlow for Efficient and Refined 3D Dance Generation
FlowerDance: MeanFlow for Efficient and Refined 3D Dance Generation
Kaixing Yang
Xulong Tang
Ziqiao Peng
X. Zhang
Puwei Wang
Jun He
Hongyan Liu
188
1
0
26 Nov 2025
MobileI2V: Fast and High-Resolution Image-to-Video on Mobile Devices
MobileI2V: Fast and High-Resolution Image-to-Video on Mobile Devices
Shuai Zhang
Bao Tang
Siyuan Yu
Yueting Zhu
Jingfeng Yao
Ya Zou
Shanglin Yuan
Li Yu
Wenyu Liu
Xinggang Wang
DiffMVGen
201
0
0
26 Nov 2025
CaliTex: Geometry-Calibrated Attention for View-Coherent 3D Texture Generation
CaliTex: Geometry-Calibrated Attention for View-Coherent 3D Texture Generation
Chenyu Liu
Hongze Chen
Jingzhi Bao
Lingting Zhu
Runze Zhang
Weikai Chen
Zeyu Hu
Yingda Yin
Keyang Luo
Xin Wang
DiffM
244
0
0
26 Nov 2025
Training-Free Generation of Diverse and High-Fidelity Images via Prompt Semantic Space Optimization
Training-Free Generation of Diverse and High-Fidelity Images via Prompt Semantic Space Optimization
Debin Meng
Chen Jin
Zheng Gao
Yanran Li
Ioannis Patras
Georgios Tzimiropoulos
DiffM
264
0
0
25 Nov 2025
1234...232425
Next