Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2403.03206
Cited By

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

5 March 2024

Frederic Boesel

ArXiv (abs)PDF HTML HuggingFace (68 upvotes)

Papers citing "Scaling Rectified Flow Transformers for High-Resolution Image Synthesis"

50 / 1,247 papers shown

Native-Resolution Image Synthesis

Native-Resolution Image Synthesis

315

4

0

03 Jun 2025

Feature-aware Hypergraph Generation via Next-Scale Prediction

Feature-aware Hypergraph Generation via Next-Scale Prediction

Dorian Gailhard

Enzo Tartaglione

Jhony H. Giraldo

268

0

0

02 Jun 2025

Cycle Consistency as Reward: Learning Image-Text Alignment without Human Preferences

Cycle Consistency as Reward: Learning Image-Text Alignment without Human Preferences

417

7

0

02 Jun 2025

Image Generation from Contextually-Contradictory Prompts

Image Generation from Contextually-Contradictory Prompts

Daniel Cohen-Or

232

3

0

02 Jun 2025

Ultra-High-Resolution Image Synthesis: Data, Method and Evaluation

Ultra-High-Resolution Image Synthesis: Data, Method and Evaluation

230

2

0

02 Jun 2025

DNAEdit: Direct Noise Alignment for Text-Guided Rectified Flow Editing

DNAEdit: Direct Noise Alignment for Text-Guided Rectified Flow Editing

292

6

0

02 Jun 2025

Many-for-Many: Unify the Training of Multiple Video and Image Generation and Manipulation Tasks

Many-for-Many: Unify the Training of Multiple Video and Image Generation and Manipulation Tasks

267

0

0

02 Jun 2025

TIIF-Bench: How Does Your T2I Model Follow Your Instructions?

TIIF-Bench: How Does Your T2I Model Follow Your Instructions?

214

24

0

02 Jun 2025

OmniV2V: Versatile Video Generation and Editing via Dynamic Content Manipulation

OmniV2V: Versatile Video Generation and Editing via Dynamic Content Manipulation

Zhengguang Zhou

...

DiffM VGen SyDa

275

6

0

02 Jun 2025

Humanoid World Models: Open World Foundation Models for Humanoid Robotics

Humanoid World Models: Open World Foundation Models for Humanoid Robotics

Muhammad Qasim Ali

Shahbuland Matiana

Mohammad Al-Sharman

226

3

0

01 Jun 2025

Rhythm Controllable and Efficient Zero-Shot Voice Conversion via Shortcut Flow Matching

Rhythm Controllable and Efficient Zero-Shot Voice Conversion via Shortcut Flow MatchingAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

223

0

0

01 Jun 2025

DS-VTON: An Enhanced Dual-Scale Coarse-to-Fine Framework for Virtual Try-On

DS-VTON: An Enhanced Dual-Scale Coarse-to-Fine Framework for Virtual Try-On

236

1

0

01 Jun 2025

SkyReels-Audio: Omni Audio-Conditioned Talking Portraits in Video Diffusion Transformers

SkyReels-Audio: Omni Audio-Conditioned Talking Portraits in Video Diffusion Transformers

...

226

6

0

01 Jun 2025

Seg2Any: Open-set Segmentation-Mask-to-Image Generation with Precise Shape and Semantic Control

Seg2Any: Open-set Segmentation-Mask-to-Image Generation with Precise Shape and Semantic Control

346

1

0

31 May 2025

SenseFlow: Scaling Distribution Matching for Flow-based Text-to-Image Distillation

SenseFlow: Scaling Distribution Matching for Flow-based Text-to-Image Distillation

235

6

0

31 May 2025

Parallel Rescaling: Rebalancing Consistency Guidance for Personalized Diffusion Models

Parallel Rescaling: Rebalancing Consistency Guidance for Personalized Diffusion Models

145

0

0

31 May 2025

ViStoryBench: Comprehensive Benchmark Suite for Story Visualization

ViStoryBench: Comprehensive Benchmark Suite for Story Visualization

...

480

12

0

30 May 2025

PDE-Transformer: Efficient and Versatile Transformers for Physics Simulations

PDE-Transformer: Efficient and Versatile Transformers for Physics Simulations

Benjamin Holzschuh

249

9

0

30 May 2025

Accelerated Sampling from Masked Diffusion Models via Entropy Bounded Unmasking

Accelerated Sampling from Masked Diffusion Models via Entropy Bounded Unmasking

251

40

0

30 May 2025

Inference-Time Alignment of Diffusion Models via Evolutionary Algorithms

Inference-Time Alignment of Diffusion Models via Evolutionary Algorithms

Nick Eliopoulos

Benjamin Shiue-Hal Chou

George K. Thiruvathukal

188

1

0

30 May 2025

STORK: Faster Diffusion And Flow Matching Sampling By Resolving Both Stiffness And Structure-Dependence

STORK: Faster Diffusion And Flow Matching Sampling By Resolving Both Stiffness And Structure-Dependence

Andrea L. Bertozzi

185

2

0

30 May 2025

EasyText: Controllable Diffusion Transformer for Multilingual Text Rendering

EasyText: Controllable Diffusion Transformer for Multilingual Text Rendering

215

13

0

30 May 2025

GenSpace: Benchmarking Spatially-Aware Image Generation

GenSpace: Benchmarking Spatially-Aware Image Generation

Hengshuang Zhao

278

2

0

30 May 2025

ComposeAnything: Composite Object Priors for Text-to-Image Generation

ComposeAnything: Composite Object Priors for Text-to-Image Generation

Cordelia Schmid

274

1

0

30 May 2025

TumorGen: Boundary-Aware Tumor-Mask Synthesis with Rectified Flow Matching

TumorGen: Boundary-Aware Tumor-Mask Synthesis with Rectified Flow Matching

101

0

0

30 May 2025

Dimension-Reduction Attack! Video Generative Models are Experts on Controllable Image Synthesis

Dimension-Reduction Attack! Video Generative Models are Experts on Controllable Image Synthesis

189

3

0

29 May 2025

FlowAlign: Trajectory-Regularized, Inversion-Free Flow-based Image Editing

FlowAlign: Trajectory-Regularized, Inversion-Free Flow-based Image Editing

337

6

0

29 May 2025

Knowledge Insulating Vision-Language-Action Models: Train Fast, Run Fast, Generalize Better

Knowledge Insulating Vision-Language-Action Models: Train Fast, Run Fast, Generalize Better

Jost Tobias Springenberg

...

Lucy Xiaoyang Shi

294

46

0

29 May 2025

LoRAShop: Training-Free Multi-Concept Image Generation and Editing with Rectified Flow Transformers

LoRAShop: Training-Free Multi-Concept Image Generation and Editing with Rectified Flow Transformers

Hidir Yesiltepe

261

5

0

29 May 2025

A Survey of Generative Categories and Techniques in Multimodal Generative Models

A Survey of Generative Categories and Techniques in Multimodal Generative Models

Almas Baimagambetov

Nikolaos Polatidis

407

0

0

29 May 2025

Fooling the Watchers: Breaking AIGC Detectors via Semantic Prompt Attacks

Fooling the Watchers: Breaking AIGC Detectors via Semantic Prompt Attacks

353

0

0

29 May 2025

Hallo4: High-Fidelity Dynamic Portrait Animation via Direct Preference Optimization

Hallo4: High-Fidelity Dynamic Portrait Animation via Direct Preference Optimization

543

8

0

29 May 2025

Fine-Tuning Next-Scale Visual Autoregressive Models with Group Relative Policy Optimization

Fine-Tuning Next-Scale Visual Autoregressive Models with Group Relative Policy Optimization

Haitz Sáez de Ocáriz Borde

175

3

0

29 May 2025

UniTEX: Universal High Fidelity Generative Texturing for 3D Shapes

UniTEX: Universal High Fidelity Generative Texturing for 3D Shapes

277

8

0

29 May 2025

Composite Flow Matching for Reinforcement Learning with Shifted-Dynamics Data

Composite Flow Matching for Reinforcement Learning with Shifted-Dynamics Data

351

7

0

29 May 2025

Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model

Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model

...

Shuangyong Song

336

23

0

29 May 2025

HiDream-I1: A High-Efficient Image Generative Foundation Model with Sparse Diffusion Transformer

HiDream-I1: A High-Efficient Image Generative Foundation Model with Sparse Diffusion Transformer

...

195

69

0

28 May 2025

Scaling Offline RL via Efficient and Expressive Shortcut Models

Scaling Offline RL via Efficient and Expressive Shortcut Models

Nicolas Espinosa-Dice

Kianté Brantley

259

5

0

28 May 2025

Streaming Flow Policy: Simplifying diffusion/flow-matching policies by treating action trajectories as flow trajectories

Streaming Flow Policy: Simplifying diffusion/flow-matching policies by treating action trajectories as flow trajectories

Tomás Lozano-Pérez

Leslie Pack Kaelbling

Siddharth Ancha

349

0

0

28 May 2025

ReinFlow: Fine-tuning Flow Matching Policy with Online Reinforcement Learning

ReinFlow: Fine-tuning Flow Matching Policy with Online Reinforcement Learning

590

10

0

28 May 2025

SineLoRA$Δ$: Sine-Activated Delta Compression

Δ

: Sine-Activated Delta Compression

Hemanth Saratchandran

354

0

0

28 May 2025

SridBench: Benchmark of Scientific Research Illustration Drawing of Image Generation Model

SridBench: Benchmark of Scientific Research Illustration Drawing of Image Generation Model

215

5

0

28 May 2025

Versatile Cardiovascular Signal Generation with a Unified Diffusion Transformer

Versatile Cardiovascular Signal Generation with a Unified Diffusion Transformer

280

0

0

28 May 2025

Re-ttention: Ultra Sparse Visual Generation via Attention Statistical Reshape

Re-ttention: Ultra Sparse Visual Generation via Attention Statistical Reshape

414

1

0

28 May 2025

ISAC: Training-Free Instance-to-Semantic Attention Control for Improving Multi-Instance Generation

ISAC: Training-Free Instance-to-Semantic Attention Control for Improving Multi-Instance Generation

1.1K

0

0

27 May 2025

LeDiFlow: Learned Distribution-guided Flow Matching to Accelerate Image Generation

LeDiFlow: Learned Distribution-guided Flow Matching to Accelerate Image Generation

Nils Friederich

Maximilian Beichter

Lennart Hilbert

Oliver Bringmann

161

0

0

27 May 2025

Differentiable Solver Search for Fast Diffusion Sampling

Differentiable Solver Search for Fast Diffusion Sampling

299

2

0

27 May 2025

Normalized Attention Guidance: Universal Negative Guidance for Diffusion Models

Normalized Attention Guidance: Universal Negative Guidance for Diffusion Models

Hmrishav Bandyopadhyay

451

6

0

27 May 2025

Advancing high-fidelity 3D and Texture Generation with 2.5D latents

Advancing high-fidelity 3D and Texture Generation with 2.5D latents

290

3

0

27 May 2025

MMIG-Bench: Towards Comprehensive and Explainable Evaluation of Multi-Modal Image Generation Models

MMIG-Bench: Towards Comprehensive and Explainable Evaluation of Multi-Modal Image Generation Models

Daniel G. Aliaga

404

2

0

26 May 2025

1 2 3...13 14 15...23 24 25

Page 14 of 25

Pageof 25