v1v2 (latest)

Visual Generation Without Guidance

26 January 2025

ArXiv (abs)PDF HTML HuggingFace (8 upvotes)Github (52★)

Papers citing "Visual Generation Without Guidance"

50 / 60 papers shown

Improved Mean Flows: On the Challenges of Fastforward Generative Models

250

01 Dec 2025

FlowerDance: MeanFlow for Efficient and Refined 3D Dance Generation

273

26 Nov 2025

Terminal Velocity Matching

137

24 Nov 2025

GenAR: Next-Scale Autoregressive Generation for Spatial Gene Expression Prediction

183

05 Oct 2025

DiffusionNFT: Online Diffusion Reinforcement with Forward Process

360

19 Sep 2025

NFT: Bridging Supervised Learning and Reinforcement Learning in Math Reasoning

...

724

23 May 2025

Gradient-Free Classifier Guidance for Diffusion Model Sampling

281

23 Nov 2024

Dynamic Negative Guidance of Diffusion ModelsInternational Conference on Learning Representations (ICLR), 2024

656

18 Oct 2024

Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous TokensInternational Conference on Learning Representations (ICLR), 2024

Yuanzhen Li

Michael Rubinstein

446

140

17 Oct 2024

Toward Guidance-Free AR Visual Generation via Condition Contrastive AlignmentInternational Conference on Learning Representations (ICLR), 2024

281

12 Oct 2024

Show-o: One Single Transformer to Unify Multimodal Understanding and GenerationInternational Conference on Learning Representations (ICLR), 2024

Weihao Wang

Kevin Qinghong Lin

Yuchao Gu

Zhijie Chen

Zhenheng Yang

Mike Zheng Shou

583

575

22 Aug 2024

VAR-CLIP: Text-to-Image Generator with Visual Auto-Regressive Modeling

Xiang An

Xingyu Ren

358

02 Aug 2024

Autoregressive Image Generation without Vector Quantization

600

586

17 Jun 2024

CFG++: Manifold-constrained Classifier Free Guidance for Diffusion Models

Hyungjin Chung

Jong Chul Ye

248

101

12 Jun 2024

An Image is Worth 32 Tokens for Reconstruction and Generation

Daniel Cremers

481

253

11 Jun 2024

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Yi Jiang

Bingyue Peng

637

654

10 Jun 2024

Guiding a Diffusion Model with a Bad Version of Itself

459

238

04 Jun 2024

Improved Distribution Matching Distillation for Fast Image Synthesis

623

397

23 May 2024

Chameleon: Mixed-Modal Early-Fusion Foundation Models

Chameleon Team

MLLM

776

786

16 May 2024

Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models

332

199

11 Apr 2024

Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale PredictionNeural Information Processing Systems (NeurIPS), 2024

493

912

03 Apr 2024

Noise Contrastive Alignment of Language Models with Explicit Rewards

Jun Zhu

470

08 Feb 2024

One-step Diffusion with Distribution Matching DistillationComputer Vision and Pattern Recognition (CVPR), 2023

1.2K

694

30 Nov 2023

Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation

...

Ming-Hsuan Yang

528

589

09 Oct 2023

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Hang Zhao

606

750

06 Oct 2023

Direct Preference Optimization: Your Language Model is Secretly a Reward ModelNeural Information Processing Systems (NeurIPS), 2023

Christopher D. Manning

Chelsea Finn

ALM

1.1K

8,135

29 May 2023

Training Diffusion Models with Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2023

763

796

22 May 2023

MDTv2: Masked Diffusion Transformer is a Strong Image SynthesizerIEEE International Conference on Computer Vision (ICCV), 2023

1.2K

289

25 Mar 2023

Scaling up GANs for Text-to-Image SynthesisComputer Vision and Pattern Recognition (CVPR), 2023

Jun-Yan Zhu

573

655

09 Mar 2023

Muse: Text-To-Image Generation via Masked Generative TransformersInternational Conference on Machine Learning (ICML), 2023

...

William T. Freeman

Michael Rubinstein

Yuanzhen Li

Dilip Krishnan

DiffM

658

746

02 Jan 2023

Scalable Diffusion Models with TransformersIEEE International Conference on Computer Vision (ICCV), 2022

William S. Peebles

Saining Xie

GNN

2.7K

5,448

19 Dec 2022

Reproducible scaling laws for contrastive language-image learningComputer Vision and Pattern Recognition (CVPR), 2022

723

1,326

14 Dec 2022

MAGVIT: Masked Generative Video TransformerComputer Vision and Pattern Recognition (CVPR), 2022

...

Alexander G. Hauptmann

Ming-Hsuan Yang

411

368

10 Dec 2022

MAGE: MAsked Generative Encoder to Unify Representation Learning and Image SynthesisComputer Vision and Pattern Recognition (CVPR), 2022

459

260

16 Nov 2022

DPM-Solver++: Fast Solver for Guided Sampling of Diffusion Probabilistic ModelsMachine Intelligence Research (MIR), 2022

1.1K

950

02 Nov 2022

LAION-5B: An open large-scale dataset for training next generation image-text modelsNeural Information Processing Systems (NeurIPS), 2022

...

1.5K

4,964

16 Oct 2022

On Distillation of Guided Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2022

Ruiqi Gao

401

767

06 Oct 2022

Diffusion Posterior Sampling for General Noisy Inverse ProblemsInternational Conference on Learning Representations (ICLR), 2022

830

1,441

29 Sep 2022

All are Worth Words: A ViT Backbone for Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2022

Hang Su

Jun Zhu

VLM

699

573

25 Sep 2022

Classifier-Free Diffusion Guidance

Jonathan Ho

Tim Salimans

FaML

710

5,964

26 Jul 2022

EGSDE: Unpaired Image-to-Image Translation via Energy-Guided Stochastic Differential EquationsNeural Information Processing Systems (NeurIPS), 2022

Jun Zhu

651

253

14 Jul 2022

Elucidating the Design Space of Diffusion-Based Generative ModelsNeural Information Processing Systems (NeurIPS), 2022

1.1K

3,189

01 Jun 2022

Photorealistic Text-to-Image Diffusion Models with Deep Language UnderstandingNeural Information Processing Systems (NeurIPS), 2022

...

Raphael Gontijo-Lopes

David J Fleet

1.5K

8,076

23 May 2022

Hierarchical Text-Conditional Image Generation with CLIP Latents

1.5K

8,816

13 Apr 2022

Autoregressive Image Generation using Residual QuantizationComputer Vision and Pattern Recognition (CVPR), 2022

1.5K

739

03 Mar 2022

MaskGIT: Masked Generative Image TransformerComputer Vision and Pattern Recognition (CVPR), 2022

William T. Freeman

762

1,110

08 Feb 2022

High-Resolution Image Synthesis with Latent Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2021

4.8K

23,580

20 Dec 2021

GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion ModelsInternational Conference on Machine Learning (ICML), 2021

1.4K

4,672

20 Dec 2021

Vector-quantized Image Modeling with Improved VQGANInternational Conference on Learning Representations (ICLR), 2021

692

741

09 Oct 2021

Variational Diffusion Models

1.1K

1,448

01 Jul 2021