Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2403.03206
Cited By

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

5 March 2024

Frederic Boesel

ArXiv (abs)PDF HTML HuggingFace (68 upvotes)

Papers citing "Scaling Rectified Flow Transformers for High-Resolution Image Synthesis"

50 / 1,247 papers shown

QR-LoRA: Efficient and Disentangled Fine-tuning via QR Decomposition for Customized Generation

QR-LoRA: Efficient and Disentangled Fine-tuning via QR Decomposition for Customized Generation

291

1

0

07 Jul 2025

Effort-Optimized, Accuracy-Driven Labelling and Validation of Test Inputs for DL Systems: A Mixed-Integer Linear Programming Approach

Effort-Optimized, Accuracy-Driven Labelling and Validation of Test Inputs for DL Systems: A Mixed-Integer Linear Programming Approach

Mohammad Hossein Amini

184

0

0

07 Jul 2025

ICAS: Detecting Training Data from Autoregressive Image Generative Models

ICAS: Detecting Training Data from Autoregressive Image Generative Models

136

5

0

07 Jul 2025

LACONIC: A 3D Layout Adapter for Controllable Image Creation

LACONIC: A 3D Layout Adapter for Controllable Image Creation

Léopold Maillard

Adrien Ramanana Rahary

Maks Ovsjanikov

209

0

0

04 Jul 2025

MoDA: Multi-modal Diffusion Architecture for Talking Head Generation

MoDA: Multi-modal Diffusion Architecture for Talking Head Generation

283

0

0

04 Jul 2025

Lost in Latent Space: An Empirical Study of Latent Diffusion Models for Physics Emulation

Lost in Latent Space: An Empirical Study of Latent Diffusion Models for Physics Emulation

François Rozet

250

7

0

03 Jul 2025

RichControl: Structure- and Appearance-Rich Training-Free Spatial Control for Text-to-Image Generation

RichControl: Structure- and Appearance-Rich Training-Free Spatial Control for Text-to-Image Generation

331

0

0

03 Jul 2025

Visual Contextual Attack: Jailbreaking MLLMs with Image-Driven Context Injection

Visual Contextual Attack: Jailbreaking MLLMs with Image-Driven Context Injection

279

8

0

03 Jul 2025

IC-Custom: Diverse Image Customization via In-Context Learning

IC-Custom: Diverse Image Customization via In-Context Learning

...

185

2

0

02 Jul 2025

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

1.3K

86

0

01 Jul 2025

Parameter-aware high-fidelity microstructure generation using stable diffusion

Parameter-aware high-fidelity microstructure generation using stable diffusionAdvanced Engineering Informatics (AEI), 2025

Hoang Cuong Phan

142

0

0

01 Jul 2025

Seedance 1.0: Exploring the Boundaries of Video Generation Models

Seedance 1.0: Exploring the Boundaries of Video Generation Models

...

Xiaozheng Zheng

246

104

0

01 Jul 2025

Towards foundational LiDAR world models with efficient latent flow matching

Towards foundational LiDAR world models with efficient latent flow matching

Nicholas Rhinehart

226

4

0

30 Jun 2025

ThinkSound: Chain-of-Thought Reasoning in Multimodal Large Language Models for Audio Generation and Editing

ThinkSound: Chain-of-Thought Reasoning in Multimodal Large Language Models for Audio Generation and Editing

441

16

0

26 Jun 2025

BitMark: Watermarking Bitwise Autoregressive Image Generative Models

BitMark: Watermarking Bitwise Autoregressive Image Generative Models

Franziska Boenisch

474

1

0

26 Jun 2025

TADA: Improved Diffusion Sampling with Training-free Augmented Dynamics

TADA: Improved Diffusion Sampling with Training-free Augmented Dynamics

David Berthelot

182

1

0

26 Jun 2025

ODE$_t$(ODE$_l$): Shortcutting the Time and the Length in Diffusion and Flow Models for Faster Sampling

_t

_l

): Shortcutting the Time and the Length in Diffusion and Flow Models for Faster Sampling

Denis A. Gudovskiy

223

0

0

26 Jun 2025

Step-by-Step Video-to-Audio Synthesis via Negative Audio Guidance

Step-by-Step Video-to-Audio Synthesis via Negative Audio Guidance

Takashi Shibuya

316

1

0

26 Jun 2025

From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios

From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios

259

1

0

25 Jun 2025

Orthogonal Finetuning Made Scalable

Orthogonal Finetuning Made Scalable

Bernhard Schölkopf

221

1

0

24 Jun 2025

SimpleGVR: A Simple Baseline for Latent-Cascaded Video Super-Resolution

SimpleGVR: A Simple Baseline for Latent-Cascaded Video Super-Resolution

DiffM VGen SupR

413

1

0

24 Jun 2025

OmniGen2: Exploration to Advanced Multimodal Generation

OmniGen2: Exploration to Advanced Multimodal Generation

...

304

169

0

23 Jun 2025

Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition

Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition

208

24

0

20 Jun 2025

Fast and Stable Diffusion Planning through Variational Adaptive Weighting

Fast and Stable Diffusion Planning through Variational Adaptive Weighting

192

0

0

20 Jun 2025

DreamCube: 3D Panorama Generation via Multi-plane Synchronization

DreamCube: 3D Panorama Generation via Multi-plane Synchronization

165

6

0

20 Jun 2025

How to Train your Text-to-Image Model: Evaluating Design Choices for Synthetic Training Captions

How to Train your Text-to-Image Model: Evaluating Design Choices for Synthetic Training Captions

Felix Friedrich

Kristian Kersting

178

1

0

20 Jun 2025

UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation

UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation

228

4

0

20 Jun 2025

Emergent Temporal Correspondences from Video Diffusion Transformers

Emergent Temporal Correspondences from Video Diffusion Transformers

346

10

0

20 Jun 2025

The Hidden Cost of an Image: Quantifying the Energy Consumption of AI Image Generation

The Hidden Cost of an Image: Quantifying the Energy Consumption of AI Image Generation

Giulia Bertazzini

Chiara Albisani

Daniele Baracchi

Dasara Shullani

Roberto Verdecchia

198

2

0

20 Jun 2025

FlowRAM: Grounding Flow Matching Policy with Region-Aware Mamba Framework for Robotic Manipulation

FlowRAM: Grounding Flow Matching Policy with Region-Aware Mamba Framework for Robotic ManipulationComputer Vision and Pattern Recognition (CVPR), 2025

202

7

0

19 Jun 2025

DT-UFC: Universal Large Model Feature Coding via Peaky-to-Balanced Distribution Transformation

DT-UFC: Universal Large Model Feature Coding via Peaky-to-Balanced Distribution Transformation

149

1

0

19 Jun 2025

Improving Rectified Flow with Boundary Conditions

Improving Rectified Flow with Boundary Conditions

224

1

0

18 Jun 2025

Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model

Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model

Anirud Aggarwal

Abhinav Shrivastava

417

0

0

18 Jun 2025

Show-o2: Improved Native Unified Multimodal Models

Show-o2: Improved Native Unified Multimodal Models

Mike Zheng Shou

477

90

0

18 Jun 2025

Hunyuan3D 2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material

Hunyuan3D 2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material

...

257

47

0

18 Jun 2025

FLUX.1 Kontext: Flow Matching for In-Context Image Generation and Editing in Latent Space

FLUX.1 Kontext: Flow Matching for In-Context Image Generation and Editing in Latent Space

Black Forest Labs

Stephen Batifol

Frederic Boesel

...

353

343

0

17 Jun 2025

EchoShot: Multi-Shot Portrait Video Generation

EchoShot: Multi-Shot Portrait Video Generation

190

7

0

16 Jun 2025

iDiT-HOI: Inpainting-based Hand Object Interaction Reenactment via Video Diffusion Transformer

iDiT-HOI: Inpainting-based Hand Object Interaction Reenactment via Video Diffusion Transformer

230

0

0

15 Jun 2025

EraserDiT: Fast Video Inpainting with Diffusion Transformer Model

EraserDiT: Fast Video Inpainting with Diffusion Transformer Model

210

0

0

15 Jun 2025

Auditing Data Provenance in Real-world Text-to-Image Diffusion Models for Privacy and Copyright Protection

Auditing Data Provenance in Real-world Text-to-Image Diffusion Models for Privacy and Copyright Protection

205

0

0

13 Jun 2025

PosterCraft: Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework

PosterCraft: Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework

...

278

8

0

12 Jun 2025

CreatiPoster: Towards Editable and Controllable Multi-Layer Graphic Design Generation

CreatiPoster: Towards Editable and Controllable Multi-Layer Graphic Design Generation

330

5

0

12 Jun 2025

Where and How to Perturb: On the Design of Perturbation Guidance in Diffusion and Flow Models

Where and How to Perturb: On the Design of Perturbation Guidance in Diffusion and Flow Models

473

0

0

12 Jun 2025

Pisces: An Auto-regressive Foundation Model for Image Understanding and Generation

Pisces: An Auto-regressive Foundation Model for Image Understanding and Generation

...

Michihiro Yasunaga

361

4

0

12 Jun 2025

DreamActor-H1: High-Fidelity Human-Product Demonstration Video Generation via Motion-designed Diffusion Transformers

DreamActor-H1: High-Fidelity Human-Product Demonstration Video Generation via Motion-designed Diffusion Transformers

438

9

0

12 Jun 2025

Symmetrical Flow Matching: Unified Image Generation, Segmentation, and Classification with Score-Based Generative Models

Symmetrical Flow Matching: Unified Image Generation, Segmentation, and Classification with Score-Based Generative Models

Francisco Caetano

Christiaan Viviers

Peter H. N. de With

Fons van der Sommen

346

1

0

12 Jun 2025

Consistent Story Generation: Unlocking the Potential of Zigzag Sampling

Consistent Story Generation: Unlocking the Potential of Zigzag Sampling

Marie-Francine Moens

445

0

0

11 Jun 2025

Geometric Regularity in Deterministic Sampling Dynamics of Diffusion-based Generative Models

Geometric Regularity in Deterministic Sampling Dynamics of Diffusion-based Generative Models

332

1

0

11 Jun 2025

ScoreMix: Synthetic Data Generation by Score Composition in Diffusion Models Improves Recognition

ScoreMix: Synthetic Data Generation by Score Composition in Diffusion Models Improves Recognition

S´ebastien Marcel

270

1

0

11 Jun 2025

A High-Quality Dataset and Reliable Evaluation for Interleaved Image-Text Generation

A High-Quality Dataset and Reliable Evaluation for Interleaved Image-Text Generation

...

306

0

0

11 Jun 2025

1 2 3...11 12 13...23 24 25

Page 12 of 25

Pageof 25