ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.03206
  4. Cited By
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

5 March 2024
Patrick Esser
Sumith Kulal
A. Blattmann
Rahim Entezari
Jonas Muller
Harry Saini
Yam Levi
Dominik Lorenz
Axel Sauer
Frederic Boesel
Dustin Podell
Tim Dockhorn
Zion English
Kyle Lacey
Alex Goodwin
Yannik Marek
Robin Rombach
    DiffM
ArXiv (abs)PDFHTMLHuggingFace (68 upvotes)

Papers citing "Scaling Rectified Flow Transformers for High-Resolution Image Synthesis"

50 / 1,247 papers shown
Consistent Story Generation: Unlocking the Potential of Zigzag Sampling
Consistent Story Generation: Unlocking the Potential of Zigzag Sampling
Mingxiao Li
Mang Ning
Marie-Francine Moens
DiffM
445
0
0
11 Jun 2025
Audio Generation Through Score-Based Generative Modeling: Design Principles and Implementation
Ge Zhu
Yutong Wen
Zhiyao Duan
DiffMMedIm
241
3
0
10 Jun 2025
CAIRe: Cultural Attribution of Images by Retrieval-Augmented Evaluation
CAIRe: Cultural Attribution of Images by Retrieval-Augmented Evaluation
Arnav Yayavaram
Siddharth Yayavaram
Simran Khanuja
Michael Saxon
Graham Neubig
249
0
0
10 Jun 2025
Bias Analysis in Unconditional Image Generative Models
Xiaofeng Zhang
Michelle Lin
Damien Scieur
Aaron Courville
Yash Goyal
189
0
0
10 Jun 2025
HunyuanVideo-HOMA: Generic Human-Object Interaction in Multimodal Driven Human Animation
Ziyao Huang
Zixiang Zhou
Juan Cao
Yifeng Ma
Yi Chen
...
Hongmei Wang
Qin Lin
Yuan Zhou
Qinglin Lu
Fan Tang
VGen
225
5
0
10 Jun 2025
Flow Diverse and Efficient: Learning Momentum Flow Matching via Stochastic Velocity Field Sampling
Zhiyuan Ma
Ruixun Liu
Sixian Liu
Jianjun Li
Bowen Zhou
235
2
0
10 Jun 2025
Edit Flows: Flow Matching with Edit Operations
Marton Havasi
Brian Karrer
Itai Gat
Ricky T. Q. Chen
BDL
506
18
0
10 Jun 2025
FlagEvalMM: A Flexible Framework for Comprehensive Multimodal Model Evaluation
FlagEvalMM: A Flexible Framework for Comprehensive Multimodal Model Evaluation
Xue Sun
Yesheng Liu
Jing-shu Zheng
Xuejing Li
Richeng Xuan
Jin-Ge Yao
Xi Yang
Xi Yang
VLMMLLM
285
2
0
10 Jun 2025
Re-Thinking the Automatic Evaluation of Image-Text Alignment in Text-to-Image Models
Huixuan Zhang
Xiaojun Wan
EGVM
188
0
0
10 Jun 2025
CulturalFrames: Assessing Cultural Expectation Alignment in Text-to-Image Models and Evaluation Metrics
Shravan Nayak
Mehar Bhatia
Xiaofeng Zhang
Verena Rieser
Lisa Anne Hendricks
Sjoerd van Steenkiste
Yash Goyal
Karolina Stañczak
Aishwarya Agrawal
EGVM
395
5
0
10 Jun 2025
How Much To Guide: Revisiting Adaptive Guidance in Classifier-Free Guidance Text-to-Vision Diffusion Models
Huixuan Zhang
Junzhe Zhang
Xiaojun Wan
192
2
0
10 Jun 2025
Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers
Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers
Zhengyao Lv
Tianlin Pan
Chenyang Si
Zhaoxi Chen
W. Zuo
Yu Qiao
Kwan-Yee K. Wong
300
5
0
09 Jun 2025
Evaluating Robustness in Latent Diffusion Models via Embedding Level Augmentation
Evaluating Robustness in Latent Diffusion Models via Embedding Level Augmentation
Boris Martirosyan
Alexey Karmanov
DiffM
146
0
0
09 Jun 2025
Diffuse Everything: Multimodal Diffusion Models on Arbitrary State Spaces
Diffuse Everything: Multimodal Diffusion Models on Arbitrary State Spaces
Kevin Rojas
Yuchen Zhu
Sichen Zhu
Felix X.-F. Ye
Molei Tao
DiffM
267
11
0
09 Jun 2025
Dreamland: Controllable World Creation with Simulator and Generative Models
Dreamland: Controllable World Creation with Simulator and Generative Models
Sicheng Mo
Ziyang Leng
Leon Liu
Weizhen Wang
Honglin He
Bolei Zhou
VGen
134
1
0
09 Jun 2025
Generative Modeling of Weights: Generalization or Memorization?
Generative Modeling of Weights: Generalization or Memorization?
Boya Zeng
Yida Yin
Zhiqiu Xu
Zhuang Liu
DiffM
309
4
0
09 Jun 2025
SUDER: Self-Improving Unified Large Multimodal Models for Understanding and Generation with Dual Self-Rewards
SUDER: Self-Improving Unified Large Multimodal Models for Understanding and Generation with Dual Self-Rewards
Jixiang Hong
Yiran Zhang
Guanzhong Wang
Yi Liu
Ji-Rong Wen
Rui Yan
LRM
236
1
0
09 Jun 2025
Snap-and-tune: combining deep learning and test-time optimization for high-fidelity cardiovascular volumetric meshing
Daniel H. Pak
Shubh Thaker
Kyle Baylous
Xiaoran Zhang
Danny Bluestein
James S. Duncan
AI4CE
229
9
0
09 Jun 2025
PairEdit: Learning Semantic Variations for Exemplar-based Image Editing
PairEdit: Learning Semantic Variations for Exemplar-based Image Editing
Haoguang Lu
Jiacheng Chen
Zhenguo Yang
Aurele Tohokantche Gnanha
Fu Lee Wang
Li Qing
Xudong Mao
DiffM
353
1
0
09 Jun 2025
OneIG-Bench: Omni-dimensional Nuanced Evaluation for Image Generation
OneIG-Bench: Omni-dimensional Nuanced Evaluation for Image Generation
Jingjing Chang
Yixiao Fang
Peng Xing
Shuhan Wu
Wei Cheng
Rui Wang
Xianfang Zeng
Gang Yu
H. Chen
EGVMVLM
450
21
0
09 Jun 2025
R3D2: Realistic 3D Asset Insertion via Diffusion for Autonomous Driving Simulation
R3D2: Realistic 3D Asset Insertion via Diffusion for Autonomous Driving Simulation
William Ljungbergh
Bernardo Taveira
Wenzhao Zheng
Adam Tonderski
Chensheng Peng
...
Christoffer Petersson
Michael Felsberg
Kurt Keutzer
Masayoshi Tomizuka
Wei Zhan
227
6
0
09 Jun 2025
VIVAT: Virtuous Improving VAE Training through Artifact Mitigation
VIVAT: Virtuous Improving VAE Training through Artifact Mitigation
Lev Novitskiy
Viacheslav Vasilev
Maria Kovaleva
V. Arkhipkin
Denis Dimitrov
VGen
209
1
0
09 Jun 2025
PolyVivid: Vivid Multi-Subject Video Generation with Cross-Modal Interaction and Enhancement
PolyVivid: Vivid Multi-Subject Video Generation with Cross-Modal Interaction and Enhancement
Teng Hu
Zhentao Yu
Zhengguang Zhou
Jiangning Zhang
Yuan Zhou
Qinglin Lu
Ran Yi
VGen
230
4
0
09 Jun 2025
Difference Inversion: Interpolate and Isolate the Difference with Token Consistency for Image Analogy Generation
Difference Inversion: Interpolate and Isolate the Difference with Token Consistency for Image Analogy GenerationComputer Vision and Pattern Recognition (CVPR), 2025
H. Kim
Donghyun Kim
Suhyun Kim
DiffM
231
1
0
09 Jun 2025
Breaking Data Silos: Towards Open and Scalable Mobility Foundation Models via Generative Continual Learning
Breaking Data Silos: Towards Open and Scalable Mobility Foundation Models via Generative Continual Learning
Yuan Yuan
Yukun Liu
Chonghua Han
Jie Feng
Yong Li
196
0
0
07 Jun 2025
FontAdapter: Instant Font Adaptation in Visual Text Generation
FontAdapter: Instant Font Adaptation in Visual Text Generation
Myungkyu Koo
Subin Kim
Sangkyung Kwak
Jaehyun Nam
Seojin Kim
Jinwoo Shin
DiffMVLM
290
1
0
06 Jun 2025
STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis
STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis
Jiatao Gu
Tianrong Chen
David Berthelot
Huangjie Zheng
Yuyang Wang
Ruixiang Zhang
Laurent Dinh
Miguel Angel Bautista
Josh Susskind
Shuangfei Zhai
246
13
0
06 Jun 2025
AQUATIC-Diff: Additive Quantization for Truly Tiny Compressed Diffusion Models
AQUATIC-Diff: Additive Quantization for Truly Tiny Compressed Diffusion Models
Adil Hasan
Thomas Peyrin
DiffMMQ
391
0
0
06 Jun 2025
FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing
FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing
Guangzhao Li
Yanming Yang
Chenxi Song
Chi Zhang
DiffMVGen
277
6
0
05 Jun 2025
ContentV: Efficient Training of Video Generation Models with Limited Compute
Wenfeng Lin
Renjie Chen
Boyuan Liu
Shiyue Yan
Ruoyu Feng
...
Chao Feng
Jiao Ran
Qi Wu
Zuotao Liu
Mingyu Guo
VGen
442
3
0
05 Jun 2025
FPSAttention: Training-Aware FP8 and Sparsity Co-Design for Fast Video Diffusion
FPSAttention: Training-Aware FP8 and Sparsity Co-Design for Fast Video Diffusion
Akide Liu
Zeyu Zhang
Zhexin Li
Xuehai Bai
Yizeng Han
...
Jiahao He
Yuanyu He
F. Wang
Gholamreza Haffari
Bohan Zhuang
VGenMQ
530
8
0
05 Jun 2025
Contrastive Flow Matching
George Stoica
Vivek Ramanujan
Xiang Fan
Ali Farhadi
Ranjay Krishna
Judy Hoffman
319
9
0
05 Jun 2025
Rectified Point Flow: Generic Point Cloud Pose Estimation
Rectified Point Flow: Generic Point Cloud Pose Estimation
Tao Sun
Liyuan Zhu
S. Huang
Shuran Song
Iro Armeni
3DPC
287
3
0
05 Jun 2025
Towards Reliable Identification of Diffusion-based Image Manipulations
Towards Reliable Identification of Diffusion-based Image Manipulations
Alex Costanzino
Woody Bayliss
Juil Sock
Marc Gorriz Blanch
Danijela Horak
Ivan Laptev
Juil Sock
Fabio Pizzati
DiffM
268
1
0
05 Jun 2025
FocusDiff: Advancing Fine-Grained Text-Image Alignment for Autoregressive Visual Generation through RL
FocusDiff: Advancing Fine-Grained Text-Image Alignment for Autoregressive Visual Generation through RL
Kaihang Pan
Wendong Bu
Y. Wu
Yang Wu
Kai Shen
Yunfei Li
Hang Zhao
Juncheng Billy Li
Siliang Tang
Yueting Zhuang
224
9
0
05 Jun 2025
DIMCIM: A Quantitative Evaluation Framework for Default-mode Diversity and Generalization in Text-to-Image Generative Models
Revant Teotia
Candace Ross
Karen Ullrich
S. Chopra
Adriana Romero-Soriano
Melissa Hall
Matthew Muckley
EGVMVLM
352
2
0
05 Jun 2025
HuGeDiff: 3D Human Generation via Diffusion with Gaussian Splatting
Maksym Ivashechkin
Oscar Mendez
Richard Bowden
3DGS
213
0
0
04 Jun 2025
RAID: A Dataset for Testing the Adversarial Robustness of AI-Generated Image Detectors
RAID: A Dataset for Testing the Adversarial Robustness of AI-Generated Image Detectors
Hicham Eddoubi
Jonas Ricker
Federico Cocchi
Lorenzo Baraldi
Angelo Sotgiu
...
Marcella Cornia
Lorenzo Baraldi
Asja Fischer
Rita Cucchiara
Battista Biggio
AAML
521
0
0
04 Jun 2025
Negative-Guided Subject Fidelity Optimization for Zero-Shot Subject-Driven Generation
Negative-Guided Subject Fidelity Optimization for Zero-Shot Subject-Driven Generation
Chaehun Shin
Jooyoung Choi
J. Mok
Jungbeom Lee
Sungroh Yoon
DiffM
356
0
0
04 Jun 2025
Resolving Task Objective Conflicts in Unified Model via Task-Aware Mixture-of-Experts
Resolving Task Objective Conflicts in Unified Model via Task-Aware Mixture-of-Experts
Jiaxing Zhang
Xinyi Zeng
356
0
0
04 Jun 2025
DenseDPO: Fine-Grained Temporal Preference Optimization for Video Diffusion Models
DenseDPO: Fine-Grained Temporal Preference Optimization for Video Diffusion Models
Ziyi Wu
Vidit Goel
Ivan Skorokhodov
Willi Menapace
Ashkan Mirzaei
Igor Gilitschenski
Sergey Tulyakov
Aliaksandr Siarohin
DiffMVGen
391
11
0
04 Jun 2025
DGMO: Training-Free Audio Source Separation through Diffusion-Guided Mask Optimization
DGMO: Training-Free Audio Source Separation through Diffusion-Guided Mask Optimization
Geonyoung Lee
Geonhee Han
Paul Hongsuck Seo
DiffM
236
1
0
03 Jun 2025
Rectified Flows for Fast Multiscale Fluid Flow Modeling
Rectified Flows for Fast Multiscale Fluid Flow Modeling
Victor Armegioiu
Yannick Ramic
Siddhartha Mishra
DiffMAI4CE
228
2
0
03 Jun 2025
Smoothed Preference Optimization via ReNoise Inversion for Aligning Diffusion Models with Varied Human Preferences
Smoothed Preference Optimization via ReNoise Inversion for Aligning Diffusion Models with Varied Human Preferences
Yunhong Lu
Qichao Wang
H. Cao
Xiaoyin Xu
Min Zhang
337
5
0
03 Jun 2025
FlexPainter: Flexible and Multi-View Consistent Texture Generation
FlexPainter: Flexible and Multi-View Consistent Texture Generation
Dongyu Yan
Leyi Wu
Jiantao Lin
Luozhou Wang
Tianshuo Xu
Zhifei Chen
Zhen Yang
Lie Xu
Shunsi Zhang
Yingcong Chen
DiffM
244
1
0
03 Jun 2025
EDITOR: Effective and Interpretable Prompt Inversion for Text-to-Image Diffusion Models
EDITOR: Effective and Interpretable Prompt Inversion for Text-to-Image Diffusion Models
Mingzhe Li
Gehao Zhang
Zhenting Wang
Guanhong Tao
Siqi Pan
Richard Cartwright
Juan Zhai
Shiqing Ma
DiffM
239
0
0
03 Jun 2025
Controllable Human-centric Keyframe Interpolation with Generative Prior
Controllable Human-centric Keyframe Interpolation with Generative Prior
Z. Guo
Size Wu
Zhongang Cai
Wei Li
Chen Change Loy
DiffMVGen
204
1
0
03 Jun 2025
DFBench: Benchmarking Deepfake Image Detection Capability of Large Multimodal Models
DFBench: Benchmarking Deepfake Image Detection Capability of Large Multimodal Models
Jiarui Wang
Huiyu Duan
Juntong Wang
Ziheng Jia
Woo Yi Yang
...
Yu Zhao
Jiaying Qian
Yuke Xing
Guangtao Zhai
Xiongkuo Min
246
3
0
03 Jun 2025
Rethinking Machine Unlearning in Image Generation Models
Rethinking Machine Unlearning in Image Generation Models
Renyang Liu
Wenjie Feng
Tianwei Zhang
Wei Zhou
Xueqi Cheng
See-Kiong Ng
MUVLM
327
1
0
03 Jun 2025
RefEdit: A Benchmark and Method for Improving Instruction-based Image Editing Model on Referring Expressions
RefEdit: A Benchmark and Method for Improving Instruction-based Image Editing Model on Referring Expressions
Bimsara Pathiraja
Maitreya Patel
Shivam Singh
Yezhou Yang
Chitta Baral
177
2
0
03 Jun 2025
Previous
123...121314...232425
Next
Page 13 of 25
Pageof 25