ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.03206
  4. Cited By
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

5 March 2024
Patrick Esser
Sumith Kulal
A. Blattmann
Rahim Entezari
Jonas Muller
Harry Saini
Yam Levi
Dominik Lorenz
Axel Sauer
Frederic Boesel
Dustin Podell
Tim Dockhorn
Zion English
Kyle Lacey
Alex Goodwin
Yannik Marek
Robin Rombach
    DiffM
ArXiv (abs)PDFHTMLHuggingFace (68 upvotes)

Papers citing "Scaling Rectified Flow Transformers for High-Resolution Image Synthesis"

50 / 1,247 papers shown
QR-LoRA: Efficient and Disentangled Fine-tuning via QR Decomposition for Customized Generation
QR-LoRA: Efficient and Disentangled Fine-tuning via QR Decomposition for Customized Generation
Jiahui Yang
Yongjia Ma
Donglin Di
Hao Li
Wei Chen
Yan Xie
Jianxun Cui
Xun Yang
W. Zuo
MoMe
291
1
0
07 Jul 2025
Effort-Optimized, Accuracy-Driven Labelling and Validation of Test Inputs for DL Systems: A Mixed-Integer Linear Programming Approach
Effort-Optimized, Accuracy-Driven Labelling and Validation of Test Inputs for DL Systems: A Mixed-Integer Linear Programming Approach
Mohammad Hossein Amini
M. Sabetzadeh
S. Nejati
VLM
184
0
0
07 Jul 2025
ICAS: Detecting Training Data from Autoregressive Image Generative Models
ICAS: Detecting Training Data from Autoregressive Image Generative Models
Hongyao Yu
Yixiang Qiu
Y. Yang
Hao Fang
Tianqu Zhuang
Jiaxin Hong
Bin Chen
Hao Wu
Shu-Tao Xia
136
5
0
07 Jul 2025
LACONIC: A 3D Layout Adapter for Controllable Image Creation
LACONIC: A 3D Layout Adapter for Controllable Image Creation
Léopold Maillard
Tom Durand
Adrien Ramanana Rahary
Maks Ovsjanikov
DiffM
209
0
0
04 Jul 2025
MoDA: Multi-modal Diffusion Architecture for Talking Head Generation
MoDA: Multi-modal Diffusion Architecture for Talking Head Generation
Xinyang Li
Gen Li
Zhihui Lin
Yichen Qian
Gongxin Yao
Weinan Jia
Aowen Wang
Weihua Chen
Fan Wang
DiffMVGen
283
0
0
04 Jul 2025
Lost in Latent Space: An Empirical Study of Latent Diffusion Models for Physics Emulation
Lost in Latent Space: An Empirical Study of Latent Diffusion Models for Physics Emulation
François Rozet
Ruben Ohana
Michael McCabe
Gilles Louppe
F. Lanusse
S. Ho
DiffM
250
7
0
03 Jul 2025
RichControl: Structure- and Appearance-Rich Training-Free Spatial Control for Text-to-Image Generation
RichControl: Structure- and Appearance-Rich Training-Free Spatial Control for Text-to-Image Generation
Liheng Zhang
Lexi Pang
Hang Ye
Xiaoxuan Ma
Yizhou Wang
DiffM
331
0
0
03 Jul 2025
Visual Contextual Attack: Jailbreaking MLLMs with Image-Driven Context Injection
Visual Contextual Attack: Jailbreaking MLLMs with Image-Driven Context Injection
Ziqi Miao
Yi Ding
Lijun Li
Jing Shao
AAML
279
8
0
03 Jul 2025
IC-Custom: Diverse Image Customization via In-Context Learning
IC-Custom: Diverse Image Customization via In-Context Learning
Yaowei Li
Xiaoyu Li
Zhaoyang Zhang
Yuxuan Bian
Gan Liu
...
Lingen Li
Jing Cai
Y. Zou
Yancheng He
Mingyu Ding
185
2
0
02 Jul 2025
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models
Gaojie Lin
Jianwen Jiang
Jiaqi Yang
Zerong Zheng
Chao Liang
DiffMVGen
1.3K
86
0
01 Jul 2025
Parameter-aware high-fidelity microstructure generation using stable diffusion
Parameter-aware high-fidelity microstructure generation using stable diffusionAdvanced Engineering Informatics (AEI), 2025
Hoang Cuong Phan
Minh Tien Tran
Chihun Lee
Hoheok Kim
Sehyeok Oh
Dong-Kyu Kim
Ho Won Lee
DiffM
142
0
0
01 Jul 2025
Seedance 1.0: Exploring the Boundaries of Video Generation Models
Seedance 1.0: Exploring the Boundaries of Video Generation Models
Yu Gao
Haoyuan Guo
Tuyen Hoang
Weilin Huang
Lu Jiang
...
Yang Zhao
Xiaozheng Zheng
Peihao Zhu
Jiaxin Zou
Feilong Zuo
DiffMVGenVLM
246
104
0
01 Jul 2025
Towards foundational LiDAR world models with efficient latent flow matching
Towards foundational LiDAR world models with efficient latent flow matching
Tianran Liu
Shengwen Zhao
Nicholas Rhinehart
226
4
0
30 Jun 2025
ThinkSound: Chain-of-Thought Reasoning in Multimodal Large Language Models for Audio Generation and Editing
ThinkSound: Chain-of-Thought Reasoning in Multimodal Large Language Models for Audio Generation and Editing
Huadai Liu
Kaicheng Luo
Jialei Wang
Wen Wang
Qian Chen
Zhou Zhao
Wei Xue
VGenLRM
441
16
0
26 Jun 2025
BitMark: Watermarking Bitwise Autoregressive Image Generative Models
BitMark: Watermarking Bitwise Autoregressive Image Generative Models
Louis Kerner
Michel Meintz
Bihe Zhao
Franziska Boenisch
Adam Dziedzic
WIGM
474
1
0
26 Jun 2025
TADA: Improved Diffusion Sampling with Training-free Augmented Dynamics
TADA: Improved Diffusion Sampling with Training-free Augmented Dynamics
Tianrong Chen
Huangjie Zheng
David Berthelot
Jiatao Gu
J. Susskind
Shuangfei Zhai
DiffM
182
1
0
26 Jun 2025
ODE$_t$(ODE$_l$): Shortcutting the Time and the Length in Diffusion and Flow Models for Faster Sampling
ODEt_tt​(ODEl_ll​): Shortcutting the Time and the Length in Diffusion and Flow Models for Faster Sampling
Denis A. Gudovskiy
Wenzhao Zheng
Tomoyuki Okuno
Yohei Nakata
Kurt Keutzer
223
0
0
26 Jun 2025
Step-by-Step Video-to-Audio Synthesis via Negative Audio Guidance
Step-by-Step Video-to-Audio Synthesis via Negative Audio Guidance
Akio Hayakawa
Masato Ishii
Takashi Shibuya
Yuki Mitsufuji
DiffMVGen
316
1
0
26 Jun 2025
From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios
From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios
Changliang Xia
Chengyou Jia
Zhuohang Dang
Minnan Luo
Zhihui Li
Xiaojun Chang
DiffMOffRL
259
1
0
25 Jun 2025
Orthogonal Finetuning Made Scalable
Orthogonal Finetuning Made Scalable
Zeju Qiu
Weiyang Liu
Adrian Weller
Bernhard Schölkopf
221
1
0
24 Jun 2025
SimpleGVR: A Simple Baseline for Latent-Cascaded Video Super-Resolution
SimpleGVR: A Simple Baseline for Latent-Cascaded Video Super-Resolution
Liangbin Xie
Yu Li
Shian Du
Menghan Xia
Xintao Wang
Fanghua Yu
Ziyan Chen
Pengfei Wan
Jiantao Zhou
Chao Dong
DiffMVGenSupR
413
1
0
24 Jun 2025
OmniGen2: Exploration to Advanced Multimodal Generation
OmniGen2: Exploration to Advanced Multimodal Generation
Chenyuan Wu
PengFei Zheng
Ruiran Yan
Shitao Xiao
Xin Luo
...
Defu Lian
X. Wang
Zhongyuan Wang
Tiejun Huang
Zheng Liu
MLLMSyDaVLM
304
169
0
23 Jun 2025
Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition
Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition
Jiaqi Li
Junshu Tang
Zhiyong Xu
Longhuang Wu
Yuan Zhou
Shuai Shao
Tianbao Yu
Zhiguo Cao
Qinglin Lu
DiffMVGen
208
24
0
20 Jun 2025
Fast and Stable Diffusion Planning through Variational Adaptive Weighting
Fast and Stable Diffusion Planning through Variational Adaptive Weighting
Zhiying Qiu
Tao Lin
DiffMOffRL
192
0
0
20 Jun 2025
DreamCube: 3D Panorama Generation via Multi-plane Synchronization
DreamCube: 3D Panorama Generation via Multi-plane Synchronization
Yukun Huang
Yanning Zhou
Jianan Wang
Kaiyi Huang
Xihui Liu
165
6
0
20 Jun 2025
How to Train your Text-to-Image Model: Evaluating Design Choices for Synthetic Training Captions
How to Train your Text-to-Image Model: Evaluating Design Choices for Synthetic Training Captions
Manuel Brack
Sudeep Katakol
Felix Friedrich
P. Schramowski
Hareesh Ravi
Kristian Kersting
Ajinkya Kale
178
1
0
20 Jun 2025
UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation
UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation
Teng Li
Quanfeng Lu
Lirui Zhao
Hao Li
X. Zhu
Yu Qiao
Jun Zhang
Wenqi Shao
228
4
0
20 Jun 2025
Emergent Temporal Correspondences from Video Diffusion Transformers
Emergent Temporal Correspondences from Video Diffusion Transformers
Jisu Nam
Soowon Son
Dahyun Chung
Jiyoung Kim
Siyoon Jin
Junhwa Hur
Seungryong Kim
VGen
346
10
0
20 Jun 2025
The Hidden Cost of an Image: Quantifying the Energy Consumption of AI Image Generation
The Hidden Cost of an Image: Quantifying the Energy Consumption of AI Image Generation
Giulia Bertazzini
Chiara Albisani
Daniele Baracchi
Dasara Shullani
Roberto Verdecchia
198
2
0
20 Jun 2025
FlowRAM: Grounding Flow Matching Policy with Region-Aware Mamba Framework for Robotic Manipulation
FlowRAM: Grounding Flow Matching Policy with Region-Aware Mamba Framework for Robotic ManipulationComputer Vision and Pattern Recognition (CVPR), 2025
Sen Wang
Le Wang
Sanping Zhou
Jingyi Tian
Jiayi Li
Haowen Sun
Wei Tang
202
7
0
19 Jun 2025
DT-UFC: Universal Large Model Feature Coding via Peaky-to-Balanced Distribution Transformation
DT-UFC: Universal Large Model Feature Coding via Peaky-to-Balanced Distribution Transformation
Changsheng Gao
Zijie Liu
L. Li
Dong Liu
Xiaoyan Sun
Weisi Lin
OffRL
149
1
0
19 Jun 2025
Improving Rectified Flow with Boundary Conditions
Improving Rectified Flow with Boundary Conditions
Xixi Hu
Runlong Liao
Keyang Xu
B. Liu
Yeqing Li
Eugene Ie
Hongliang Fei
Qiang Liu
224
1
0
18 Jun 2025
Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
Anirud Aggarwal
Abhinav Shrivastava
M. Gwilliam
417
0
0
18 Jun 2025
Show-o2: Improved Native Unified Multimodal Models
Show-o2: Improved Native Unified Multimodal Models
Jinheng Xie
Zhenheng Yang
Mike Zheng Shou
VGen
477
90
0
18 Jun 2025
Hunyuan3D 2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
Hunyuan3D 2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
Team Hunyuan3D
Shuhui Yang
M. Yang
Yifei Feng
Xin Huang
...
Yuhong Liu
Linus
Jie Jiang
J. Huang
Chunchao Guo
3DH
257
47
0
18 Jun 2025
FLUX.1 Kontext: Flow Matching for In-Context Image Generation and Editing in Latent Space
FLUX.1 Kontext: Flow Matching for In-Context Image Generation and Editing in Latent Space
Black Forest Labs
Stephen Batifol
A. Blattmann
Frederic Boesel
Saksham Consul
...
Dustin Podell
Robin Rombach
Harry Saini
Axel Sauer
Luke Smith
DiffM
353
343
0
17 Jun 2025
EchoShot: Multi-Shot Portrait Video Generation
EchoShot: Multi-Shot Portrait Video Generation
Jiahao Wang
Hualian Sheng
Sijia Cai
Weizhan Zhang
Caixia Yan
Yachuang Feng
Bing Deng
Jieping Ye
DiffMVGen
190
7
0
16 Jun 2025
iDiT-HOI: Inpainting-based Hand Object Interaction Reenactment via Video Diffusion Transformer
iDiT-HOI: Inpainting-based Hand Object Interaction Reenactment via Video Diffusion Transformer
Zhelun Shen
Chenming Wu
Junsheng Zhou
Chen Zhao
Kaisiyuan Wang
Hang Zhou
Yingying Li
Haocheng Feng
Wei He
Jingdong Wang
DiffM
230
0
0
15 Jun 2025
EraserDiT: Fast Video Inpainting with Diffusion Transformer Model
EraserDiT: Fast Video Inpainting with Diffusion Transformer Model
Jie Liu
Zheng Hui
DiffMVGen
210
0
0
15 Jun 2025
Auditing Data Provenance in Real-world Text-to-Image Diffusion Models for Privacy and Copyright Protection
Auditing Data Provenance in Real-world Text-to-Image Diffusion Models for Privacy and Copyright Protection
Jie Zhu
Leye Wang
205
0
0
13 Jun 2025
PosterCraft: Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework
PosterCraft: Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework
Sixiang Chen
Jianyu Lai
Jialin Gao
Tian-Chun Ye
Haoyu Chen
...
Zhaohu Xing
Yeying Jin
Junfeng Luo
Xiaoming Wei
Lei Zhu
DiffM
278
8
0
12 Jun 2025
CreatiPoster: Towards Editable and Controllable Multi-Layer Graphic Design Generation
CreatiPoster: Towards Editable and Controllable Multi-Layer Graphic Design Generation
Zhao Zhang
Yutao Cheng
Dexiang Hong
Maoke Yang
Gonglei Shi
Lei Ma
H. Zhang
Jie Shao
Xinglong Wu
DiffM
330
5
0
12 Jun 2025
Where and How to Perturb: On the Design of Perturbation Guidance in Diffusion and Flow Models
Where and How to Perturb: On the Design of Perturbation Guidance in Diffusion and Flow Models
Donghoon Ahn
Jiwon Kang
Sanghyun Lee
Minjae Kim
Jaewon Min
Wooseok Jang
Saungwu Lee
Sayak Paul
S. Hong
Seungryong Kim
DiffMAAML
473
0
0
12 Jun 2025
Pisces: An Auto-regressive Foundation Model for Image Understanding and Generation
Pisces: An Auto-regressive Foundation Model for Image Understanding and Generation
Zhiyang Xu
Jiuhai Chen
Zhaojiang Lin
Xichen Pan
Lifu Huang
...
Di Jin
Michihiro Yasunaga
Lili Yu
Xi Lin
Shaoliang Nie
361
4
0
12 Jun 2025
DreamActor-H1: High-Fidelity Human-Product Demonstration Video Generation via Motion-designed Diffusion Transformers
DreamActor-H1: High-Fidelity Human-Product Demonstration Video Generation via Motion-designed Diffusion Transformers
Lizhen Wang
Zhurong Xia
T. Hu
P. Wang
Pengfei Wang
Zerong Zheng
Ming Zhou
Yuan Zhang
Mingyuan Gao
DiffMVGen
438
9
0
12 Jun 2025
Symmetrical Flow Matching: Unified Image Generation, Segmentation, and Classification with Score-Based Generative Models
Symmetrical Flow Matching: Unified Image Generation, Segmentation, and Classification with Score-Based Generative Models
Francisco Caetano
Christiaan Viviers
Peter H. N. de With
Fons van der Sommen
DiffM
346
1
0
12 Jun 2025
Consistent Story Generation: Unlocking the Potential of Zigzag Sampling
Consistent Story Generation: Unlocking the Potential of Zigzag Sampling
Mingxiao Li
Mang Ning
Marie-Francine Moens
DiffM
445
0
0
11 Jun 2025
Geometric Regularity in Deterministic Sampling Dynamics of Diffusion-based Generative Models
Geometric Regularity in Deterministic Sampling Dynamics of Diffusion-based Generative Models
Defang Chen
Zhenyu Zhou
C. Wang
Siwei Lyu
DiffM
332
1
0
11 Jun 2025
ScoreMix: Synthetic Data Generation by Score Composition in Diffusion Models Improves Recognition
ScoreMix: Synthetic Data Generation by Score Composition in Diffusion Models Improves Recognition
Parsa Rahimi
S´ebastien Marcel
DiffM
270
1
0
11 Jun 2025
A High-Quality Dataset and Reliable Evaluation for Interleaved Image-Text Generation
A High-Quality Dataset and Reliable Evaluation for Interleaved Image-Text Generation
Yukang Feng
Jianwen Sun
Chuanhao Li
Zizhen Li
Jiaxin Ai
...
Yifan Chang
Sizhuo Zhou
Shenglin Zhang
Yu Dai
Kaipeng Zhang
MLLMEGVM
306
0
0
11 Jun 2025
Previous
123...111213...232425
Next
Page 12 of 25
Pageof 25