Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

5 March 2024

ArXiv (abs)PDF HTML HuggingFace (68 upvotes)

Papers citing "Scaling Rectified Flow Transformers for High-Resolution Image Synthesis"

50 / 1,251 papers shown

Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining

Shitian Zhao

Xinyue Li

Qi Qin

Yu Qiao

Hongsheng Li

Peng Gao

MLLM

425

111

05 Aug 2024

Bailing-TTS: Chinese Dialectal Speech Synthesis Towards Human-like Spontaneous Representation

275

01 Aug 2024

VolDoGer: LLM-assisted Datasets for Domain Generalization in Vision-Language Tasks

322

29 Jul 2024

RNACG: A Universal RNA Sequence Conditional Generation model based on Flow-Matching

Letian Gao

Zhi John Lu

315

29 Jul 2024

Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget

246

22 Jul 2024

764

138

19 Jul 2024

I2AM: Interpreting Image-to-Image Latent Diffusion Models via Bi-Attribution Maps

Junseo Park

Hyeryung Jang

610

17 Jul 2024

Exploring the Potentials and Challenges of Deep Generative Models in Product Design Conception

Phillip Mueller

Lars Mikelsons

AI4CE

396

15 Jul 2024

Graph-Based Captioning: Enhancing Visual Descriptions by Interconnecting Region Captions

Pavan Kumar Anasosalu Vasu

384

09 Jul 2024

Improved Noise Schedule for Diffusion Training

Tiankai Hang

Shuyang Gu

DiffM

334

03 Jul 2024

GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models

615

02 Jul 2024

OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation

Zhenheng Yang

Zhijie Chen

Xiang Li

Jian Yang

Ying Tai

563

200

02 Jul 2024

Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model

332

22 Jun 2024

Fantastic Copyrighted Beasts and How (Not) to Generate Them

Luxi He

Yangsibo Huang

Weijia Shi

Luke Zettlemoyer

389

20 Jun 2024

Conditional score-based diffusion models for solving inverse problems in mechanicsComputer Methods in Applied Mechanics and Engineering (CMAME), 2024

Agnimitra Dasgupta

Harisankar Ramaswamy

Javier Murgoitio-Esandi

359

19 Jun 2024

Learning Diffusion at LightspeedNeural Information Processing Systems (NeurIPS), 2024

263

18 Jun 2024

AITTI: Learning Adaptive Inclusive Token for Text-to-Image Generation

Xinyu Hou

Xiaoming Li

Chen Change Loy

DiffM

228

18 Jun 2024

Duoduo CLIP: Efficient 3D Understanding with Multi-View Images

596

17 Jun 2024

Diffusion Models in Low-Level Vision: A Survey

Chunming He

Yuqi Shen

Chengyu Fang

Fengyang Xiao

Longxiang Tang

Yulun Zhang

W. Zuo

Zhenhua Guo

Xiu Li

VLM DiffM MedIm

520

17 Jun 2024

LRM-Zero: Training Large Reconstruction Models with Synthesized Data

Yi Zhou

344

13 Jun 2024

Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation Models

Athanasios Tragakis

Marco Aversa

Chaitanya Kaul

Roderick Murray-Smith

Daniele Faccio

344

11 Jun 2024

MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance

X. Wang

Siming Fu

Qihan Huang

Wanggui He

Hao Jiang

DiffM

553

104

11 Jun 2024

Margin-aware Preference Optimization for Aligning Diffusion Models without Reference

344

10 Jun 2024

The Crystal Ball Hypothesis in diffusion models: Anticipating object positions from initial noise

254

04 Jun 2024

Spectrum-Aware Parameter Efficient Fine-Tuning for Diffusion Models

Akash Srivastava

200

31 May 2024

Improving the Training of Rectified Flows

Sangyun Lee

Zinan Lin

Giulia Fanti

274

30 May 2024

Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory Matching

Yaxuan Zhu

461

29 May 2024

FlowSDF: Flow Matching for Medical Image Segmentation Using Distance Transforms

Konrad Schindler

521

28 May 2024

A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training

Yang You

439

27 May 2024

Automatic Jailbreaking of the Text-to-Image Generative AI Systems

299

26 May 2024

Towards Black-Box Membership Inference Attack for Diffusion Models

465

25 May 2024

Fisher Flow Matching for Generative Modeling over Discrete DataNeural Information Processing Systems (NeurIPS), 2024

Oscar Davis

Samuel Kessler

Mircea Petrache

.Ismail .Ilkan Ceylan

Michael M. Bronstein

A. Bose

470

23 May 2024

LiteVAE: Lightweight and Efficient Variational Autoencoders for Latent Diffusion ModelsNeural Information Processing Systems (NeurIPS), 2024

Otmar Hilliges

407

23 May 2024

TerDiT: Ternary Diffusion Models with Transformers

382

23 May 2024

Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Jianwei Zhang

...

Wei Liu

297

228

14 May 2024

Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers

...

Weicai Ye

Yu Qiao

341

125

09 May 2024

Video Diffusion Models: A Survey

358

06 May 2024

CCDM: Continuous Conditional Diffusion Models for Image Generation

Xin Ding

Member Ieee Yongwei Wang

Kao Zhang

F. I. Z. Jane Wang

DiffM

501

06 May 2024

ObjectAdd: Adding Objects into Image via a Training-Free Diffusion Modification Fashion

508

26 Apr 2024

TextCenGen: Attention-Guided Text-Centric Background Adaptation for Text-to-Image Generation

563

18 Apr 2024

Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale PredictionNeural Information Processing Systems (NeurIPS), 2024

411

743

03 Apr 2024

Faster Diffusion via Temporal Attention Decomposition

Juan-Manuel Perez-Rua

Jürgen Schmidhuber

DiffM

533

03 Apr 2024

Diffusion Model for Data-Driven Black-Box Optimization

Zihao Li

Hui Yuan

Kaixuan Huang

Mengdi Wang

250

20 Mar 2024

Just Say the Name: Online Continual Learning with Category Names Only via Data Generation

381

16 Mar 2024

MuLan: Multimodal-LLM Agent for Progressive and Interactive Multi-Object Diffusion

162

20 Feb 2024

Score-based Diffusion Models via Stochastic Differential Equations -- a Technical TutorialStatistics Survey (Stat. Surv.), 2024

Wenpin Tang

Hanyang Zhao

DiffM

397

12 Feb 2024

AI Art Neural Constellation: Revealing the Collective and Contrastive State of AI-Generated and Human Art

222

04 Feb 2024

CoCoGen: Physically-Consistent and Conditioned Score-based Generative Models for Forward and Inverse ProblemsSIAM Journal on Scientific Computing (SISC), 2023

Christian L. Jacobsen

Yilin Zhuang

Karthik Duraisamy

AI4CE SyDa DiffM

271

16 Dec 2023

Exploring Sparse MoE in GANs for Text-conditioned Image SynthesisComputer Vision and Pattern Recognition (CVPR), 2023

262

07 Sep 2023

On the Design Fundamentals of Diffusion Models: A SurveyPattern Recognition (Pattern Recogn.), 2023

Ziyi Chang

George Alex Koulieris

Hyung Jin Chang

Hubert P. H. Shum

DiffM

650

07 Jun 2023