ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.03206
  4. Cited By
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

5 March 2024
Patrick Esser
Sumith Kulal
A. Blattmann
Rahim Entezari
Jonas Muller
Harry Saini
Yam Levi
Dominik Lorenz
Axel Sauer
Frederic Boesel
Dustin Podell
Tim Dockhorn
Zion English
Kyle Lacey
Alex Goodwin
Yannik Marek
Robin Rombach
    DiffM
ArXivPDFHTML

Papers citing "Scaling Rectified Flow Transformers for High-Resolution Image Synthesis"

50 / 796 papers shown
Title
MUMU: Bootstrapping Multimodal Image Generation from Text-to-Image Data
MUMU: Bootstrapping Multimodal Image Generation from Text-to-Image Data
William Berman
A. Peysakhovich
29
4
0
26 Jun 2024
Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers
Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers
Lei Chen
Yuan Meng
Chen Tang
Xinzhu Ma
Jingyan Jiang
Xin Wang
Zhi Wang
Wenwu Zhu
MQ
23
21
0
25 Jun 2024
Identifying and Solving Conditional Image Leakage in Image-to-Video
  Diffusion Model
Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model
Min Zhao
Hongzhou Zhu
Chendong Xiang
Kaiwen Zheng
Chongxuan Li
Jun Zhu
61
8
0
22 Jun 2024
Fantastic Copyrighted Beasts and How (Not) to Generate Them
Fantastic Copyrighted Beasts and How (Not) to Generate Them
Luxi He
Yangsibo Huang
Weijia Shi
Tinghao Xie
Haotian Liu
Yue Wang
Luke Zettlemoyer
Chiyuan Zhang
Danqi Chen
Peter Henderson
39
9
0
20 Jun 2024
Conditional score-based diffusion models for solving inverse problems in
  mechanics
Conditional score-based diffusion models for solving inverse problems in mechanics
Agnimitra Dasgupta
Harisankar Ramaswamy
Javier Murgoitio-Esandi
Ken Foo
Runze Li
Qifa Zhou
Brendan Kennedy
Assad A. Oberai
DiffM
MedIm
32
2
0
19 Jun 2024
GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation
  Models
GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation Models
Yongtao Ge
Guangkai Xu
Zhiyue Zhao
Libo Sun
Zheng Huang
Yanlong Sun
Hao Chen
Chunhua Shen
MDE
37
3
0
18 Jun 2024
Learning Diffusion at Lightspeed
Learning Diffusion at Lightspeed
Antonio Terpin
Nicolas Lanzetti
Florian Dorfler
DiffM
16
6
0
18 Jun 2024
Exploring the Role of Large Language Models in Prompt Encoding for
  Diffusion Models
Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models
Bingqi Ma
Zhuofan Zong
Guanglu Song
Hongsheng Li
Yu Liu
30
19
0
17 Jun 2024
PhyBench: A Physical Commonsense Benchmark for Evaluating Text-to-Image
  Models
PhyBench: A Physical Commonsense Benchmark for Evaluating Text-to-Image Models
Fanqing Meng
Wenqi Shao
Lixin Luo
Yahong Wang
Yiran Chen
...
Yue Yang
Tianshuo Yang
Kaipeng Zhang
Yu Qiao
Ping Luo
EGVM
31
8
0
17 Jun 2024
Duoduo CLIP: Efficient 3D Understanding with Multi-View Images
Duoduo CLIP: Efficient 3D Understanding with Multi-View Images
Han-Hung Lee
Yiming Zhang
Angel X. Chang
3DPC
36
3
0
17 Jun 2024
Diffusion Models in Low-Level Vision: A Survey
Diffusion Models in Low-Level Vision: A Survey
Chunming He
Yuqi Shen
Chengyu Fang
Fengyang Xiao
Longxiang Tang
Yulun Zhang
W. Zuo
Zhenhua Guo
Xiu Li
VLM
DiffM
MedIm
67
30
0
17 Jun 2024
Poetry2Image: An Iterative Correction Framework for Images Generated
  from Chinese Classical Poetry
Poetry2Image: An Iterative Correction Framework for Images Generated from Chinese Classical Poetry
Jing Jiang
Yiran Ling
Binzhu Li
Pengxiang Li
Junming Piao
Yu Zhang
EGVM
DiffM
29
1
0
15 Jun 2024
Consistency-diversity-realism Pareto fronts of conditional image
  generative models
Consistency-diversity-realism Pareto fronts of conditional image generative models
Pietro Astolfi
Marlene Careil
Melissa Hall
Oscar Manas
Matthew Muckley
Jakob Verbeek
Adriana Romero Soriano
M. Drozdzal
45
10
0
14 Jun 2024
From Pixels to Prose: A Large Dataset of Dense Image Captions
From Pixels to Prose: A Large Dataset of Dense Image Captions
Vasu Singla
Kaiyu Yue
Sukriti Paul
Reza Shirkavand
Mayuka Jayawardhana
Alireza Ganjdanesh
Heng Huang
A. Bhatele
Gowthami Somepalli
Tom Goldstein
3DV
VLM
23
22
0
14 Jun 2024
LRM-Zero: Training Large Reconstruction Models with Synthesized Data
LRM-Zero: Training Large Reconstruction Models with Synthesized Data
Desai Xie
Sai Bi
Zhixin Shu
Kai Zhang
Zexiang Xu
Yi Zhou
Soren Pirk
Arie E. Kaufman
Xin Sun
Hao Tan
SyDa
37
14
0
13 Jun 2024
EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal
  Prompts
EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal Prompts
Yucheng Han
Rui Wang
Chi Zhang
Juntao Hu
Pei Cheng
Bin-Bin Fu
Hanwang Zhang
68
6
0
13 Jun 2024
FouRA: Fourier Low Rank Adaptation
FouRA: Fourier Low Rank Adaptation
Shubhankar Borse
Shreya Kadambi
N. Pandey
Kartikeya Bhardwaj
Viswanath Ganapathy
Sweta Priyadarshi
Risheek Garrepalli
Rafael Esteves
Munawar Hayat
Fatih Porikli
28
7
0
13 Jun 2024
AV-DiT: Efficient Audio-Visual Diffusion Transformer for Joint Audio and
  Video Generation
AV-DiT: Efficient Audio-Visual Diffusion Transformer for Joint Audio and Video Generation
Kai Wang
Shijian Deng
Jing Shi
Dimitrios Hatzinakos
Yapeng Tian
VGen
69
8
0
11 Jun 2024
Commonsense-T2I Challenge: Can Text-to-Image Generation Models
  Understand Commonsense?
Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?
Xingyu Fu
Muyu He
Yujie Lu
William Yang Wang
Dan Roth
EGVM
LRM
23
15
0
11 Jun 2024
Flow Map Matching
Flow Map Matching
Nicholas M. Boffi
M. S. Albergo
Eric Vanden-Eijnden
27
4
0
11 Jun 2024
Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with
  Foundation Models
Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation Models
Athanasios Tragakis
Marco Aversa
Chaitanya Kaul
Roderick Murray-Smith
Daniele Faccio
44
2
0
11 Jun 2024
MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
X. Wang
Siming Fu
Qihan Huang
Wanggui He
Hao Jiang
DiffM
34
41
0
11 Jun 2024
Margin-aware Preference Optimization for Aligning Diffusion Models
  without Reference
Margin-aware Preference Optimization for Aligning Diffusion Models without Reference
Jiwoo Hong
Sayak Paul
Noah Lee
Kashif Rasul
James Thorne
Jongheon Jeong
31
13
0
10 Jun 2024
ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise
  Optimization
ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization
L. Eyring
Shyamgopal Karthik
Karsten Roth
Alexey Dosovitskiy
Zeynep Akata
71
16
0
06 Jun 2024
VideoPhy: Evaluating Physical Commonsense for Video Generation
VideoPhy: Evaluating Physical Commonsense for Video Generation
Hritik Bansal
Zongyu Lin
Tianyi Xie
Zeshun Zong
Michal Yarom
Yonatan Bitton
Chenfanfu Jiang
Yizhou Sun
Kai-Wei Chang
Aditya Grover
EGVM
VGen
32
36
0
05 Jun 2024
Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT
Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT
Le Zhuo
Ruoyi Du
Han Xiao
Yangguang Li
Dongyang Liu
...
Wanli Ouyang
Ziwei Liu
Yu Qiao
Hongsheng Li
Peng Gao
52
43
0
05 Jun 2024
Seed-TTS: A Family of High-Quality Versatile Speech Generation Models
Seed-TTS: A Family of High-Quality Versatile Speech Generation Models
Philip Anastassiou
Jiawei Chen
J. Chen
Yuanzhe Chen
Zhuo Chen
...
Wenjie Zhang
Y. Zhang
Zilin Zhao
Dejian Zhong
Xiaobin Zhuang
36
74
0
04 Jun 2024
Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few
  Steps Image Generation
Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation
Clement Chadebec
O. Tasar
Eyal Benaroche
Benjamin Aubin
VLM
57
8
0
04 Jun 2024
The Crystal Ball Hypothesis in diffusion models: Anticipating object
  positions from initial noise
The Crystal Ball Hypothesis in diffusion models: Anticipating object positions from initial noise
Yuanhao Ban
Ruochen Wang
Tianyi Zhou
Boqing Gong
Cho-Jui Hsieh
Minhao Cheng
DiffM
38
4
0
04 Jun 2024
Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching
Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching
Xinyin Ma
Gongfan Fang
Michael Bi Mi
Xinchao Wang
53
30
0
03 Jun 2024
Dimba: Transformer-Mamba Diffusion Models
Dimba: Transformer-Mamba Diffusion Models
Zhengcong Fei
Mingyuan Fan
Changqian Yu
Debang Li
Youqiang Zhang
Junshi Huang
Mamba
54
16
0
03 Jun 2024
$Δ$-DiT: A Training-Free Acceleration Method Tailored for Diffusion
  Transformers
ΔΔΔ-DiT: A Training-Free Acceleration Method Tailored for Diffusion Transformers
Pengtao Chen
Mingzhu Shen
Peng Ye
Jianjian Cao
Chongjun Tu
C. Bouganis
Yiren Zhao
Tao Chen
53
26
0
03 Jun 2024
Diffusion Tuning: Transferring Diffusion Models via Chain of Forgetting
Diffusion Tuning: Transferring Diffusion Models via Chain of Forgetting
Jincheng Zhong
Xingzhuo Guo
Jiaxiang Dong
Mingsheng Long
DiffM
33
2
0
02 Jun 2024
Learning Multimodal Behaviors from Scratch with Diffusion Policy
  Gradient
Learning Multimodal Behaviors from Scratch with Diffusion Policy Gradient
Zechu Li
Rickmer Krohn
Tao Chen
Anurag Ajay
Pulkit Agrawal
Georgia Chalvatzaki
DiffM
42
7
0
02 Jun 2024
Improving Text Generation on Images with Synthetic Captions
Improving Text Generation on Images with Synthetic Captions
Jun Young Koh
Sang Hyun Park
Joy Song
DiffM
41
2
0
01 Jun 2024
Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow Matching
Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow Matching
Yongqi Wang
Wenxiang Guo
Rongjie Huang
Jia-Bin Huang
Zehan Wang
Fuming You
Ruiqi Li
Zhou Zhao
VGen
DiffM
21
11
0
01 Jun 2024
Bootstrap3D: Improving 3D Content Creation with Synthetic Data
Bootstrap3D: Improving 3D Content Creation with Synthetic Data
Zeyi Sun
Tong Wu
Pan Zhang
Yuhang Zang
Xiao-wen Dong
Yuanjun Xiong
Dahua Lin
Jiaqi Wang
34
0
0
31 May 2024
Spectrum-Aware Parameter Efficient Fine-Tuning for Diffusion Models
Spectrum-Aware Parameter Efficient Fine-Tuning for Diffusion Models
Xinxi Zhang
Song Wen
Ligong Han
Felix Juefei Xu
Akash Srivastava
Junzhou Huang
Hao Wang
Molei Tao
Dimitris N. Metaxas
DiffM
23
5
0
31 May 2024
Improving the Training of Rectified Flows
Improving the Training of Rectified Flows
Sangyun Lee
Zinan Lin
Giulia Fanti
34
19
0
30 May 2024
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified
  Flow
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow
Chaoyang Wang
Xiangtai Li
Lu Qi
Henghui Ding
Yunhai Tong
Ming-Hsuan Yang
DiffM
69
6
0
30 May 2024
CV-VAE: A Compatible Video VAE for Latent Generative Video Models
CV-VAE: A Compatible Video VAE for Latent Generative Video Models
Sijie Zhao
Yong Zhang
Xiaodong Cun
Shaoshu Yang
Muyao Niu
Xiaoyu Li
Wenbo Hu
Ying Shan
DiffM
56
23
0
30 May 2024
Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory Matching
Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory Matching
Yasi Zhang
Peiyu Yu
Yaxuan Zhu
Yingshan Chang
Feng Gao
Yingnian Wu
Oscar Leong
67
6
0
29 May 2024
T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model
  with Mixed Reward Feedback
T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward Feedback
Jiachen Li
Weixi Feng
Tsu-jui Fu
Xinyi Wang
Sugato Basu
Wenhu Chen
William Yang Wang
VGen
29
27
0
29 May 2024
DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention
DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention
Lianghui Zhu
Zilong Huang
Bencheng Liao
Jun Hao Liew
Hanshu Yan
Jiashi Feng
Xinggang Wang
68
12
0
28 May 2024
FAIntbench: A Holistic and Precise Benchmark for Bias Evaluation in
  Text-to-Image Models
FAIntbench: A Holistic and Precise Benchmark for Bias Evaluation in Text-to-Image Models
Hanjun Luo
Ziye Deng
Ruizhe Chen
Zuo-Qiang Liu
EGVM
25
9
0
28 May 2024
FlowSDF: Flow Matching for Medical Image Segmentation Using Distance Transforms
FlowSDF: Flow Matching for Medical Image Segmentation Using Distance Transforms
L. Bogensperger
Dominik Narnhofer
Alexander Falk
Konrad Schindler
Thomas Pock
MedIm
28
3
0
28 May 2024
A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training
A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training
Kai Wang
Yukun Zhou
Mingjia Shi
Zhihang Yuan
Yuzhang Shang
Yuzhang Shang
Hanwang Zhang
Hanwang Zhang
Yang You
60
9
0
27 May 2024
Automatic Jailbreaking of the Text-to-Image Generative AI Systems
Automatic Jailbreaking of the Text-to-Image Generative AI Systems
Minseon Kim
Hyomin Lee
Boqing Gong
Huishuai Zhang
Sung Ju Hwang
19
11
0
26 May 2024
Towards Black-Box Membership Inference Attack for Diffusion Models
Towards Black-Box Membership Inference Attack for Diffusion Models
Jingwei Li
Jingyi Dong
Tianxing He
Jingzhao Zhang
23
3
0
25 May 2024
Lateralization MLP: A Simple Brain-inspired Architecture for Diffusion
Lateralization MLP: A Simple Brain-inspired Architecture for Diffusion
Zizhao Hu
Mohammad Rostami
21
0
0
25 May 2024
Previous
123...141516
Next