Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2501.15420
Cited By
v1
v2 (latest)
Visual Generation Without Guidance
26 January 2025
Huayu Chen
Kai Jiang
Kaiwen Zheng
Jianfei Chen
Hang Su
Jun Zhu
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (8 upvotes)
Github (52★)
Papers citing
"Visual Generation Without Guidance"
50 / 60 papers shown
Improved Mean Flows: On the Challenges of Fastforward Generative Models
Zhengyang Geng
Yiyang Lu
Zongze Wu
Eli Shechtman
J. Zico Kolter
Kaiming He
AI4CE
250
28
0
01 Dec 2025
FlowerDance: MeanFlow for Efficient and Refined 3D Dance Generation
Kaixing Yang
Xulong Tang
Ziqiao Peng
X. Zhang
Puwei Wang
Jun He
Hongyan Liu
273
5
0
26 Nov 2025
Terminal Velocity Matching
Linqi Zhou
Mathias Parger
Ayaan Haque
Jiaming Song
137
4
0
24 Nov 2025
GenAR: Next-Scale Autoregressive Generation for Spatial Gene Expression Prediction
Jiarui Ouyang
Yihui Wang
Yihang Gao
Yingxue Xu
Shu Yang
Hao Chen
183
0
0
05 Oct 2025
DiffusionNFT: Online Diffusion Reinforcement with Forward Process
Kaiwen Zheng
Huayu Chen
Haotian Ye
Haoxiang Wang
Qinsheng Zhang
Kai Jiang
Hang Su
Stefano Ermon
Jun Zhu
Ming-Yu Liu
360
65
0
19 Sep 2025
NFT: Bridging Supervised Learning and Reinforcement Learning in Math Reasoning
Huayu Chen
Kaiwen Zheng
Qinsheng Zhang
Ganqu Cui
Yin Cui
...
Tsung-Yi Lin
Ming-Yu Liu
Jun Zhu
Haoxiang Wang
Haoxiang Wang
OffRL
LRM
724
24
0
23 May 2025
Gradient-Free Classifier Guidance for Diffusion Model Sampling
Rahul Shenoy
Zhihong Pan
Kaushik Balakrishnan
Qisen Cheng
Yongmoon Jeon
Heejune Yang
Jaewon Kim
281
9
0
23 Nov 2024
Dynamic Negative Guidance of Diffusion Models
International Conference on Learning Representations (ICLR), 2024
Felix Koulischer
Johannes Deleu
G. Raya
T. Demeester
Luca Ambrogioni
DiffM
656
22
0
18 Oct 2024
Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens
International Conference on Learning Representations (ICLR), 2024
Lijie Fan
Tianhong Li
Siyang Qin
Yuanzhen Li
Chen Sun
Michael Rubinstein
Deqing Sun
Kaiming He
Yonglong Tian
VLM
DiffM
446
140
0
17 Oct 2024
Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment
International Conference on Learning Representations (ICLR), 2024
Huayu Chen
Hang Su
Peize Sun
Jun Zhu
VLM
281
13
0
12 Oct 2024
Show-o: One Single Transformer to Unify Multimodal Understanding and Generation
International Conference on Learning Representations (ICLR), 2024
Jinheng Xie
Weijia Mao
Zechen Bai
David Junhao Zhang
Weihao Wang
Kevin Qinghong Lin
Yuchao Gu
Zhijie Chen
Zhenheng Yang
Mike Zheng Shou
583
575
0
22 Aug 2024
VAR-CLIP: Text-to-Image Generator with Visual Auto-Regressive Modeling
Qian Zhang
Xiangzi Dai
Ninghua Yang
Xiang An
Ziyong Feng
Xingyu Ren
VLM
CLIP
358
42
0
02 Aug 2024
Autoregressive Image Generation without Vector Quantization
Tianhong Li
Yonglong Tian
He Li
Mingyang Deng
Kaiming He
DiffM
600
586
0
17 Jun 2024
CFG++: Manifold-constrained Classifier Free Guidance for Diffusion Models
Hyungjin Chung
Jeongsol Kim
Geon Yeong Park
Hyelin Nam
Jong Chul Ye
DiffM
248
101
0
12 Jun 2024
An Image is Worth 32 Tokens for Reconstruction and Generation
Qihang Yu
Mark Weber
XueQing Deng
Xiaohui Shen
Daniel Cremers
Liang-Chieh Chen
VLM
ViT
481
253
0
11 Jun 2024
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation
Peize Sun
Yi Jiang
Shoufa Chen
Shilong Zhang
Bingyue Peng
Ping Luo
Zehuan Yuan
VLM
637
654
0
10 Jun 2024
Guiding a Diffusion Model with a Bad Version of Itself
Tero Karras
M. Aittala
Tuomas Kynkaanniemi
J. Lehtinen
Timo Aila
S. Laine
459
238
0
04 Jun 2024
Improved Distribution Matching Distillation for Fast Image Synthesis
Tianwei Yin
Michael Gharbi
Taesung Park
Richard Zhang
Eli Shechtman
Frédo Durand
William T. Freeman
DiffM
623
397
0
23 May 2024
Chameleon: Mixed-Modal Early-Fusion Foundation Models
Chameleon Team
MLLM
776
786
0
16 May 2024
Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models
Tuomas Kynkaanniemi
M. Aittala
Tero Karras
S. Laine
Timo Aila
J. Lehtinen
332
199
0
11 Apr 2024
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction
Neural Information Processing Systems (NeurIPS), 2024
Keyu Tian
Yi Jiang
Zehuan Yuan
Zehuan Yuan
Liwei Wang
VGen
493
912
0
03 Apr 2024
Noise Contrastive Alignment of Language Models with Explicit Rewards
Huayu Chen
Guande He
Lifan Yuan
Ganqu Cui
Hang Su
Jun Zhu
470
87
0
08 Feb 2024
One-step Diffusion with Distribution Matching Distillation
Computer Vision and Pattern Recognition (CVPR), 2023
Tianwei Yin
Michael Gharbi
Richard Zhang
Eli Shechtman
Frédo Durand
William T. Freeman
Taesung Park
DiffM
1.2K
694
0
30 Nov 2023
Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
Lijun Yu
José Lezama
N. B. Gundavarapu
Luca Versari
Kihyuk Sohn
...
Boqing Gong
Ming-Hsuan Yang
Irfan Essa
David A. Ross
Lu Jiang
528
589
0
09 Oct 2023
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Simian Luo
Yiqin Tan
Longbo Huang
Jian Li
Hang Zhao
DiffM
606
750
0
06 Oct 2023
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Neural Information Processing Systems (NeurIPS), 2023
Rafael Rafailov
Archit Sharma
E. Mitchell
Stefano Ermon
Christopher D. Manning
Chelsea Finn
ALM
1.1K
8,135
0
29 May 2023
Training Diffusion Models with Reinforcement Learning
International Conference on Learning Representations (ICLR), 2023
Kevin Black
Michael Janner
Yilun Du
Ilya Kostrikov
Sergey Levine
EGVM
763
796
0
22 May 2023
MDTv2: Masked Diffusion Transformer is a Strong Image Synthesizer
IEEE International Conference on Computer Vision (ICCV), 2023
Shanghua Gao
Pan Zhou
Mingg-Ming Cheng
Shuicheng Yan
DiffM
1.2K
289
0
25 Mar 2023
Scaling up GANs for Text-to-Image Synthesis
Computer Vision and Pattern Recognition (CVPR), 2023
Minguk Kang
Jun-Yan Zhu
Richard Y. Zhang
Jaesik Park
Eli Shechtman
Sylvain Paris
Taesung Park
573
655
0
09 Mar 2023
Muse: Text-To-Image Generation via Masked Generative Transformers
International Conference on Machine Learning (ICML), 2023
Huiwen Chang
Han Zhang
Jarred Barber
AJ Maschinot
José Lezama
...
Kevin Patrick Murphy
William T. Freeman
Michael Rubinstein
Yuanzhen Li
Dilip Krishnan
DiffM
658
746
0
02 Jan 2023
Scalable Diffusion Models with Transformers
IEEE International Conference on Computer Vision (ICCV), 2022
William S. Peebles
Saining Xie
GNN
2.7K
5,448
0
19 Dec 2022
Reproducible scaling laws for contrastive language-image learning
Computer Vision and Pattern Recognition (CVPR), 2022
Mehdi Cherti
Romain Beaumont
Ross Wightman
Mitchell Wortsman
Gabriel Ilharco
Cade Gordon
Christoph Schuhmann
Ludwig Schmidt
J. Jitsev
VLM
CLIP
723
1,326
0
14 Dec 2022
MAGVIT: Masked Generative Video Transformer
Computer Vision and Pattern Recognition (CVPR), 2022
Lijun Yu
Yong Cheng
Kihyuk Sohn
José Lezama
Han Zhang
...
Alexander G. Hauptmann
Ming-Hsuan Yang
Yuan Hao
Irfan Essa
Lu Jiang
DiffM
VGen
411
368
0
10 Dec 2022
MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis
Computer Vision and Pattern Recognition (CVPR), 2022
Tianhong Li
Huiwen Chang
Shlok Kumar Mishra
Han Zhang
Dina Katabi
Dilip Krishnan
459
260
0
16 Nov 2022
DPM-Solver++: Fast Solver for Guided Sampling of Diffusion Probabilistic Models
Machine Intelligence Research (MIR), 2022
Cheng Lu
Yuhao Zhou
Fan Bao
Jianfei Chen
Chongxuan Li
Jun Zhu
DiffM
1.1K
950
0
02 Nov 2022
LAION-5B: An open large-scale dataset for training next generation image-text models
Neural Information Processing Systems (NeurIPS), 2022
Christoph Schuhmann
Romain Beaumont
Richard Vencu
Cade Gordon
Ross Wightman
...
Srivatsa Kundurthy
Katherine Crowson
Ludwig Schmidt
R. Kaczmarczyk
J. Jitsev
VLM
MLLM
CLIP
1.5K
4,964
0
16 Oct 2022
On Distillation of Guided Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2022
Chenlin Meng
Robin Rombach
Ruiqi Gao
Diederik P. Kingma
Stefano Ermon
Jonathan Ho
Tim Salimans
VLM
DiffM
401
767
0
06 Oct 2022
Diffusion Posterior Sampling for General Noisy Inverse Problems
International Conference on Learning Representations (ICLR), 2022
Hyungjin Chung
Jeongsol Kim
Michael T. McCann
M. Klasky
J. C. Ye
DiffM
830
1,441
0
29 Sep 2022
All are Worth Words: A ViT Backbone for Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2022
Fan Bao
Shen Nie
Kaiwen Xue
Yue Cao
Chongxuan Li
Hang Su
Jun Zhu
VLM
699
573
0
25 Sep 2022
Classifier-Free Diffusion Guidance
Jonathan Ho
Tim Salimans
FaML
710
5,964
0
26 Jul 2022
EGSDE: Unpaired Image-to-Image Translation via Energy-Guided Stochastic Differential Equations
Neural Information Processing Systems (NeurIPS), 2022
Min Zhao
Fan Bao
Chongxuan Li
Jun Zhu
DiffM
651
253
0
14 Jul 2022
Elucidating the Design Space of Diffusion-Based Generative Models
Neural Information Processing Systems (NeurIPS), 2022
Tero Karras
M. Aittala
Timo Aila
S. Laine
DiffM
1.1K
3,189
0
01 Jun 2022
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Neural Information Processing Systems (NeurIPS), 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
...
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
1.5K
8,076
0
23 May 2022
Hierarchical Text-Conditional Image Generation with CLIP Latents
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLM
DiffM
1.5K
8,816
0
13 Apr 2022
Autoregressive Image Generation using Residual Quantization
Computer Vision and Pattern Recognition (CVPR), 2022
Doyup Lee
Chiheon Kim
Saehoon Kim
Minsu Cho
Wook-Shin Han
VGen
1.5K
739
0
03 Mar 2022
MaskGIT: Masked Generative Image Transformer
Computer Vision and Pattern Recognition (CVPR), 2022
Huiwen Chang
Han Zhang
Lu Jiang
Ce Liu
William T. Freeman
ViT
762
1,110
0
08 Feb 2022
High-Resolution Image Synthesis with Latent Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2021
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
DiffM
4.8K
23,580
0
20 Dec 2021
GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
International Conference on Machine Learning (ICML), 2021
Alex Nichol
Prafulla Dhariwal
Aditya A. Ramesh
Pranav Shyam
Pamela Mishkin
Bob McGrew
Ilya Sutskever
Mark Chen
1.4K
4,672
0
20 Dec 2021
Vector-quantized Image Modeling with Improved VQGAN
International Conference on Learning Representations (ICLR), 2021
Jiahui Yu
Xin Li
Jing Yu Koh
Han Zhang
Ruoming Pang
James Qin
Alexander Ku
Yuanzhong Xu
Jason Baldridge
Yonghui Wu
ViT
VLM
DRL
692
741
0
09 Oct 2021
Variational Diffusion Models
Diederik P. Kingma
Tim Salimans
Ben Poole
Jonathan Ho
DiffM
1.1K
1,448
0
01 Jul 2021
1
2
Next
Page 1 of 2