ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.03206
  4. Cited By
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

5 March 2024
Patrick Esser
Sumith Kulal
A. Blattmann
Rahim Entezari
Jonas Muller
Harry Saini
Yam Levi
Dominik Lorenz
Axel Sauer
Frederic Boesel
Dustin Podell
Tim Dockhorn
Zion English
Kyle Lacey
Alex Goodwin
Yannik Marek
Robin Rombach
    DiffM
ArXiv (abs)PDFHTMLHuggingFace (68 upvotes)

Papers citing "Scaling Rectified Flow Transformers for High-Resolution Image Synthesis"

50 / 1,247 papers shown
USB: A Comprehensive and Unified Safety Evaluation Benchmark for Multimodal Large Language Models
USB: A Comprehensive and Unified Safety Evaluation Benchmark for Multimodal Large Language Models
Baolin Zheng
Guanlin Chen
Hongqiong Zhong
Qingyang Teng
Yingshui Tan
...
Jincheng Wei
Yuchi Xu
Xiaoyong Zhu
Bo Zheng
Kaifu Zhang
ELM
167
4
0
26 May 2025
What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models
What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models
Lorenzo Baraldi
Davide Bucciarelli
Federico Betti
Marcella Cornia
Lorenzo Baraldi
Andrii Zadaianchuk
Rita Cucchiara
390
2
0
26 May 2025
Decision Flow Policy Optimization
Decision Flow Policy Optimization
Jifeng Hu
Sili Huang
Siyuan Guo
Zhaogeng Liu
Li Shen
Lichao Sun
Hechang Chen
Yi-Ju Chang
Dacheng Tao
333
0
0
26 May 2025
VisRet: Visualization Improves Knowledge-Intensive Text-to-Image Retrieval
VisRet: Visualization Improves Knowledge-Intensive Text-to-Image Retrieval
Di Wu
Yixin Wan
Kai-Wei Chang
309
1
0
26 May 2025
HunyuanVideo-Avatar: High-Fidelity Audio-Driven Human Animation for Multiple Characters
HunyuanVideo-Avatar: High-Fidelity Audio-Driven Human Animation for Multiple Characters
Yi Chen
Sen Liang
Zixiang Zhou
Ziyao Huang
Yifeng Ma
Junshu Tang
Qin Lin
Yuan Zhou
Qinglin Lu
VGen
304
29
0
26 May 2025
In-Context Brush: Zero-shot Customized Subject Insertion with Context-Aware Latent Space Manipulation
In-Context Brush: Zero-shot Customized Subject Insertion with Context-Aware Latent Space Manipulation
Yu Xu
Fan Tang
You Wu
Lin Gao
Oliver Deussen
Hongbin Yan
Jintao Li
Juan Cao
Tong-Yee Lee
DiffM
209
2
0
26 May 2025
FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities
FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities
Jin Wang
Yao Lai
Aoxue Li
Shifeng Zhang
Jiacheng Sun
Ning Kang
Chengyue Wu
Zhenguo Li
Ping Luo
396
19
0
26 May 2025
STRICT: Stress Test of Rendering Images Containing Text
STRICT: Stress Test of Rendering Images Containing Text
Tianyu Zhang
Xinyu Wang
Zhenghan Tai
Lu Li
Jijun Chi
Jingrui Tian
Hailin He
Suyuchen Wang
297
1
0
25 May 2025
CreatiDesign: A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design
CreatiDesign: A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design
H. Zhang
Dexiang Hong
Maoke Yang
Yutao Chen
Zhao Zhang
Jie Shao
Xinglong Wu
Zuxuan Wu
Yu Jiang
DiffMAI4CE
527
12
0
25 May 2025
Enhancing Text-to-Image Diffusion Transformer via Split-Text Conditioning
Enhancing Text-to-Image Diffusion Transformer via Split-Text Conditioning
Yu Zhang
Jialei Zhou
Xinchen Li
Tao Gui
Zhongwei Wan
Tianyu Wang
Duoqian Miao
Changwei Wang
LongBing Cao
DiffM
274
6
0
25 May 2025
Querying Kernel Methods Suffices for Reconstructing their Training Data
Querying Kernel Methods Suffices for Reconstructing their Training Data
Daniel Barzilai
Yuval Margalit
Eitan Gronich
Gilad Yehudai
Meirav Galun
Ronen Basri
217
0
0
25 May 2025
Training-free Stylized Text-to-Image Generation with Fast Inference
Training-free Stylized Text-to-Image Generation with Fast Inference
Xianzheng Ma
Yaohui Wang
Xinyuan Chen
Tien-Tsin Wong
C. L. P. Chen
1.2K
1
0
25 May 2025
Fast Kernel-Space Diffusion for Remote Sensing Pansharpening
Fast Kernel-Space Diffusion for Remote Sensing Pansharpening
Hancong Jin
Zihan Cao
Liangjian Deng
Jingjing Li
DiffM
392
0
0
25 May 2025
So-Fake: Benchmarking and Explaining Social Media Image Forgery Detection
So-Fake: Benchmarking and Explaining Social Media Image Forgery Detection
Zhenglin Huang
Tianxiao Li
Xiangtai Li
Haiquan Wen
Yiwei He
...
Hao Fei
Xi Yang
Xiaowei Huang
Bei Peng
Guangliang Cheng
706
6
0
24 May 2025
Align Beyond Prompts: Evaluating World Knowledge Alignment in Text-to-Image Generation
Align Beyond Prompts: Evaluating World Knowledge Alignment in Text-to-Image Generation
Wenchao Zhang
Jiahe Tian
Runze He
Jizhong Han
Jiao Dai
Miaomiao Feng
Wei Mi
Xiaodan Zhang
273
0
0
24 May 2025
Localizing Knowledge in Diffusion Transformers
Localizing Knowledge in Diffusion Transformers
Arman Zarei
Samyadeep Basu
Keivan Rezaei
Zihao Lin
Sayan Nag
Soheil Feizi
320
1
0
24 May 2025
OmniGenBench: A Benchmark for Omnipotent Multimodal Generation across 50+ Tasks
OmniGenBench: A Benchmark for Omnipotent Multimodal Generation across 50+ Tasks
Jiayu Wang
Yang Jiao
Yue Yu
Tianwen Qian
Shaoxiang Chen
Yue Yu
Yu Jiang
MLLMLM&MAELM
259
0
0
24 May 2025
OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data
OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data
Yiren Song
Cheng Liu
Mike Zheng Shou
DiffM
411
10
0
24 May 2025
T2VUnlearning: A Concept Erasing Method for Text-to-Video Diffusion Models
T2VUnlearning: A Concept Erasing Method for Text-to-Video Diffusion Models
Xiaoyu Ye
Songjie Cheng
Yongtao Wang
Yajiao Xiong
Yishen Li
DiffM
427
3
0
23 May 2025
FLEX: A Backbone for Diffusion-Based Modeling of Spatio-temporal Physical Systems
FLEX: A Backbone for Diffusion-Based Modeling of Spatio-temporal Physical Systems
N. Benjamin Erichson
Vinicius Mikuni
Dongwei Lyu
Yang Gao
Omri Azencot
Soon Hoe Lim
Michael W. Mahoney
AI4CE
1.0K
6
0
23 May 2025
Direct3D-S2: Gigascale 3D Generation Made Easy with Spatial Sparse Attention
Direct3D-S2: Gigascale 3D Generation Made Easy with Spatial Sparse Attention
Shuang Wu
Youtian Lin
Feihu Zhang
Yifei Zeng
Yikang Yang
...
Jiachen Qian
Siyu Zhu
Xun Cao
Juil Sock
Yao Yao
3DGS
337
33
0
23 May 2025
Scaling Image and Video Generation via Test-Time Evolutionary Search
Haoran He
Jiajun Liang
X. Wang
Pengfei Wan
Di Zhang
Kun Gai
Ling Pan
DiffM
402
9
0
23 May 2025
Co-Reinforcement Learning for Unified Multimodal Understanding and Generation
Co-Reinforcement Learning for Unified Multimodal Understanding and Generation
Jingjing Jiang
Chongjie Si
Jun Luo
Hanwang Zhang
Chao Ma
780
5
0
23 May 2025
Diffusion Classifiers Understand Compositionality, but Conditions Apply
Diffusion Classifiers Understand Compositionality, but Conditions Apply
Yujin Jeong
Arnas Uselis
Seong Joon Oh
Anna Rohrbach
DiffMCoGe
1.3K
3
3
23 May 2025
InfLVG: Reinforce Inference-Time Consistent Long Video Generation with GRPO
Xueji Fang
Liyuan Ma
Zhiyang Chen
Mingyuan Zhou
Guo-Jun Qi
VGen
563
7
0
23 May 2025
ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback
ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback
Litao Guo
Xinli Xu
Luozhou Wang
Jiantao Lin
Jinsong Zhou
Zixin Zhang
Bolan Su
Ying-Cong Chen
LLMAGLRM
209
6
0
23 May 2025
Delving into RL for Image Generation with CoT: A Study on DPO vs. GRPO
Delving into RL for Image Generation with CoT: A Study on DPO vs. GRPO
Chengzhuo Tong
Ziyu Guo
Renrui Zhang
Wenyu Shan
Xinyu Wei
Zhenghao Xing
Jiaming Song
Pheng-Ann Heng
EGVMOffRLLRM
432
23
0
22 May 2025
Flow Matching based Sequential Recommender Model
Flow Matching based Sequential Recommender ModelInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
Feng Liu
Lixin Zou
Xiangyu Zhao
Min Tang
Liming Dong
Dan Luo
Xiangyang Luo
Chenliang Li
DiffM
274
0
0
22 May 2025
DetailMaster: Can Your Text-to-Image Model Handle Long Prompts?
DetailMaster: Can Your Text-to-Image Model Handle Long Prompts?
Qirui Jiao
Daoyuan Chen
Yilun Huang
Xika Lin
Ying Shen
Yaliang Li
VLM
231
2
0
22 May 2025
Creatively Upscaling Images with Global-Regional Priors
Creatively Upscaling Images with Global-Regional PriorsInternational Journal of Computer Vision (IJCV), 2025
Yurui Qian
Qi Cai
Yingwei Pan
Ting Yao
Tao Mei
DiffM
383
0
0
22 May 2025
Training-Free Efficient Video Generation via Dynamic Token Carving
Training-Free Efficient Video Generation via Dynamic Token Carving
Yuechen Zhang
Jinbo Xing
Bin Xia
Shaoteng Liu
Bohao Peng
Xin Tao
Pengfei Wan
Eric Lo
Jiaya Jia
DiffMVGen
435
15
0
22 May 2025
Conditional Panoramic Image Generation via Masked Autoregressive Modeling
Conditional Panoramic Image Generation via Masked Autoregressive Modeling
Chaoyang Wang
Xiangtai Li
Lu Qi
X. Lin
Jinbin Bai
Qianyu Zhou
Yunhai Tong
DiffM
327
3
0
22 May 2025
dKV-Cache: The Cache for Diffusion Language Models
dKV-Cache: The Cache for Diffusion Language Models
Xinyin Ma
Runpeng Yu
Gongfan Fang
Xinchao Wang
DiffM
424
65
0
21 May 2025
My Face Is Mine, Not Yours: Facial Protection Against Diffusion Model Face Swapping
My Face Is Mine, Not Yours: Facial Protection Against Diffusion Model Face Swapping
Hon Ming Yam
Zhongliang Guo
Chun Pong Lau
DiffMAAML
239
2
0
21 May 2025
Angle Domain Guidance: Latent Diffusion Requires Rotation Rather Than Extrapolation
Angle Domain Guidance: Latent Diffusion Requires Rotation Rather Than Extrapolation
Cheng Jin
Zhenyu Xiao
Chutao Liu
Yuantao Gu
DiffM
213
4
0
21 May 2025
Harnessing Caption Detailness for Data-Efficient Text-to-Image Generation
Harnessing Caption Detailness for Data-Efficient Text-to-Image Generation
Xinran Wang
Muxi Diao
Yuanzhi Liu
Chunyu Wang
Kongming Liang
Zhanyu Ma
Jun Guo
301
1
0
21 May 2025
Riemannian Flow Matching for Brain Connectivity Matrices via Pullback Geometry
Riemannian Flow Matching for Brain Connectivity Matrices via Pullback Geometry
Antoine Collas
Ce Ju
Nicolas Salvy
Bertrand Thirion
218
2
0
20 May 2025
RoPECraft: Training-Free Motion Transfer with Trajectory-Guided RoPE Optimization on Diffusion Transformers
RoPECraft: Training-Free Motion Transfer with Trajectory-Guided RoPE Optimization on Diffusion Transformers
Ahmet Berke Gokmen
Yigit Ekin
Bahri Batuhan Bilecen
Aysegül Dündar
818
3
0
19 May 2025
PiT: Progressive Diffusion Transformer
PiT: Progressive Diffusion Transformer
Jiafu Wu
Yabiao Wang
Jian Li
Jinlong Peng
Yun Cao
Chengjie Wang
Jiangning Zhang
616
0
0
19 May 2025
PhySense: Sensor Placement Optimization for Accurate Physics Sensing
PhySense: Sensor Placement Optimization for Accurate Physics Sensing
Yuezhou Ma
Haixu Wu
Hang Zhou
Huikun Weng
Chao Guo
Mingsheng Long
DiffM
511
0
0
19 May 2025
Fine-tuning Quantized Neural Networks with Zeroth-order Optimization
Fine-tuning Quantized Neural Networks with Zeroth-order Optimization
Sifeng Shang
Jiayi Zhou
Chenyu Lin
Minxian Li
Kaiyang Zhou
MQ
354
1
0
19 May 2025
Synthetic History: Evaluating Visual Representations of the Past in Diffusion Models
Synthetic History: Evaluating Visual Representations of the Past in Diffusion Models
Maria-Teresa De Rosa Palmini
Eva Cetinic
263
0
0
18 May 2025
Is Artificial Intelligence Generated Image Detection a Solved Problem?
Is Artificial Intelligence Generated Image Detection a Solved Problem?
Wandi Qiao
Jiazhen Yan
Ziwen He
Kai Zeng
Weiwei Jiang
Lizhi Xiong
Zhangjie Fu
AAML
280
15
0
18 May 2025
Video-GPT via Next Clip Diffusion
Video-GPT via Next Clip Diffusion
Shaobin Zhuang
Zhipeng Huang
Ying Zhang
Fangyikang Wang
Canmiao Fu
Binxin Yang
Chong Sun
Chen Li
Yali Wang
DiffMVGen
629
5
0
18 May 2025
Towards Self-Improvement of Diffusion Models via Group Preference Optimization
Towards Self-Improvement of Diffusion Models via Group Preference Optimization
Renjie Chen
Wenfeng Lin
Yichen Zhang
Jiangchuan Wei
Boyuan Liu
Chao Feng
Jiao Ran
Mingyu Guo
327
3
0
16 May 2025
Attend to Not Attended: Structure-then-Detail Token Merging for Post-training DiT Acceleration
Attend to Not Attended: Structure-then-Detail Token Merging for Post-training DiT AccelerationComputer Vision and Pattern Recognition (CVPR), 2025
Haipeng Fang
Sheng Tang
Juan Cao
Enshuo Zhang
Fan Tang
Tong-Yee Lee
317
4
0
16 May 2025
DRAGON: A Large-Scale Dataset of Realistic Images Generated by Diffusion Models
DRAGON: A Large-Scale Dataset of Realistic Images Generated by Diffusion Models
Giulia Bertazzini
Daniele Baracchi
Dasara Shullani
Isao Echizen
Alessandro Piva
403
0
0
16 May 2025
DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling
DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling
Yuang Ai
Qihang Fan
Xuefeng Hu
Zhenheng Yang
Xiao-Yu Zhang
Huaibo Huang
DiffM
363
1
0
16 May 2025
PSDiffusion: Harmonized Multi-Layer Image Generation via Layout and Appearance Alignment
PSDiffusion: Harmonized Multi-Layer Image Generation via Layout and Appearance Alignment
Dingbang Huang
Wenbo Li
Yifei Zhao
Xinyu Pan
Yanhong Zeng
Bo Dai
Bo Dai
DiffM
305
9
0
16 May 2025
CompAlign: Improving Compositional Text-to-Image Generation with a Complex Benchmark and Fine-Grained Feedback
CompAlign: Improving Compositional Text-to-Image Generation with a Complex Benchmark and Fine-Grained Feedback
Yixin Wan
Kai-Wei Chang
EGVMCoGe
289
3
0
16 May 2025
Previous
123...141516...232425
Next
Page 15 of 25
Pageof 25