ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2501.15420
  4. Cited By
Visual Generation Without Guidance
v1v2 (latest)

Visual Generation Without Guidance

26 January 2025
Huayu Chen
Kai Jiang
Kaiwen Zheng
Jianfei Chen
Hang Su
Jun Zhu
    VLM
ArXiv (abs)PDFHTMLHuggingFace (8 upvotes)Github (52★)

Papers citing "Visual Generation Without Guidance"

50 / 60 papers shown
Improved Mean Flows: On the Challenges of Fastforward Generative Models
Improved Mean Flows: On the Challenges of Fastforward Generative Models
Zhengyang Geng
Yiyang Lu
Zongze Wu
Eli Shechtman
J. Zico Kolter
Kaiming He
AI4CE
250
28
0
01 Dec 2025
FlowerDance: MeanFlow for Efficient and Refined 3D Dance Generation
FlowerDance: MeanFlow for Efficient and Refined 3D Dance Generation
Kaixing Yang
Xulong Tang
Ziqiao Peng
X. Zhang
Puwei Wang
Jun He
Hongyan Liu
273
5
0
26 Nov 2025
Terminal Velocity Matching
Terminal Velocity Matching
Linqi Zhou
Mathias Parger
Ayaan Haque
Jiaming Song
137
4
0
24 Nov 2025
GenAR: Next-Scale Autoregressive Generation for Spatial Gene Expression Prediction
GenAR: Next-Scale Autoregressive Generation for Spatial Gene Expression Prediction
Jiarui Ouyang
Yihui Wang
Yihang Gao
Yingxue Xu
Shu Yang
Hao Chen
183
0
0
05 Oct 2025
DiffusionNFT: Online Diffusion Reinforcement with Forward Process
DiffusionNFT: Online Diffusion Reinforcement with Forward Process
Kaiwen Zheng
Huayu Chen
Haotian Ye
Haoxiang Wang
Qinsheng Zhang
Kai Jiang
Hang Su
Stefano Ermon
Jun Zhu
Ming-Yu Liu
360
65
0
19 Sep 2025
NFT: Bridging Supervised Learning and Reinforcement Learning in Math Reasoning
NFT: Bridging Supervised Learning and Reinforcement Learning in Math Reasoning
Huayu Chen
Kaiwen Zheng
Qinsheng Zhang
Ganqu Cui
Yin Cui
...
Tsung-Yi Lin
Ming-Yu Liu
Jun Zhu
Haoxiang Wang
Haoxiang Wang
OffRLLRM
724
24
0
23 May 2025
Gradient-Free Classifier Guidance for Diffusion Model Sampling
Gradient-Free Classifier Guidance for Diffusion Model Sampling
Rahul Shenoy
Zhihong Pan
Kaushik Balakrishnan
Qisen Cheng
Yongmoon Jeon
Heejune Yang
Jaewon Kim
281
9
0
23 Nov 2024
Dynamic Negative Guidance of Diffusion Models
Dynamic Negative Guidance of Diffusion ModelsInternational Conference on Learning Representations (ICLR), 2024
Felix Koulischer
Johannes Deleu
G. Raya
T. Demeester
Luca Ambrogioni
DiffM
656
22
0
18 Oct 2024
Fluid: Scaling Autoregressive Text-to-image Generative Models with
  Continuous Tokens
Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous TokensInternational Conference on Learning Representations (ICLR), 2024
Lijie Fan
Tianhong Li
Siyang Qin
Yuanzhen Li
Chen Sun
Michael Rubinstein
Deqing Sun
Kaiming He
Yonglong Tian
VLMDiffM
446
140
0
17 Oct 2024
Toward Guidance-Free AR Visual Generation via Condition Contrastive
  Alignment
Toward Guidance-Free AR Visual Generation via Condition Contrastive AlignmentInternational Conference on Learning Representations (ICLR), 2024
Huayu Chen
Hang Su
Peize Sun
Jun Zhu
VLM
281
13
0
12 Oct 2024
Show-o: One Single Transformer to Unify Multimodal Understanding and
  Generation
Show-o: One Single Transformer to Unify Multimodal Understanding and GenerationInternational Conference on Learning Representations (ICLR), 2024
Jinheng Xie
Weijia Mao
Zechen Bai
David Junhao Zhang
Weihao Wang
Kevin Qinghong Lin
Yuchao Gu
Zhijie Chen
Zhenheng Yang
Mike Zheng Shou
583
575
0
22 Aug 2024
VAR-CLIP: Text-to-Image Generator with Visual Auto-Regressive Modeling
VAR-CLIP: Text-to-Image Generator with Visual Auto-Regressive Modeling
Qian Zhang
Xiangzi Dai
Ninghua Yang
Xiang An
Ziyong Feng
Xingyu Ren
VLMCLIP
358
42
0
02 Aug 2024
Autoregressive Image Generation without Vector Quantization
Autoregressive Image Generation without Vector Quantization
Tianhong Li
Yonglong Tian
He Li
Mingyang Deng
Kaiming He
DiffM
600
586
0
17 Jun 2024
CFG++: Manifold-constrained Classifier Free Guidance for Diffusion
  Models
CFG++: Manifold-constrained Classifier Free Guidance for Diffusion Models
Hyungjin Chung
Jeongsol Kim
Geon Yeong Park
Hyelin Nam
Jong Chul Ye
DiffM
248
101
0
12 Jun 2024
An Image is Worth 32 Tokens for Reconstruction and Generation
An Image is Worth 32 Tokens for Reconstruction and Generation
Qihang Yu
Mark Weber
XueQing Deng
Xiaohui Shen
Daniel Cremers
Liang-Chieh Chen
VLMViT
481
253
0
11 Jun 2024
Autoregressive Model Beats Diffusion: Llama for Scalable Image
  Generation
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation
Peize Sun
Yi Jiang
Shoufa Chen
Shilong Zhang
Bingyue Peng
Ping Luo
Zehuan Yuan
VLM
637
654
0
10 Jun 2024
Guiding a Diffusion Model with a Bad Version of Itself
Guiding a Diffusion Model with a Bad Version of Itself
Tero Karras
M. Aittala
Tuomas Kynkaanniemi
J. Lehtinen
Timo Aila
S. Laine
459
238
0
04 Jun 2024
Improved Distribution Matching Distillation for Fast Image Synthesis
Improved Distribution Matching Distillation for Fast Image Synthesis
Tianwei Yin
Michael Gharbi
Taesung Park
Richard Zhang
Eli Shechtman
Frédo Durand
William T. Freeman
DiffM
623
397
0
23 May 2024
Chameleon: Mixed-Modal Early-Fusion Foundation Models
Chameleon: Mixed-Modal Early-Fusion Foundation Models
Chameleon Team
MLLM
776
786
0
16 May 2024
Applying Guidance in a Limited Interval Improves Sample and Distribution
  Quality in Diffusion Models
Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models
Tuomas Kynkaanniemi
M. Aittala
Tero Karras
S. Laine
Timo Aila
J. Lehtinen
332
199
0
11 Apr 2024
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale
  Prediction
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale PredictionNeural Information Processing Systems (NeurIPS), 2024
Keyu Tian
Yi Jiang
Zehuan Yuan
Zehuan Yuan
Liwei Wang
VGen
493
912
0
03 Apr 2024
Noise Contrastive Alignment of Language Models with Explicit Rewards
Noise Contrastive Alignment of Language Models with Explicit Rewards
Huayu Chen
Guande He
Lifan Yuan
Ganqu Cui
Hang Su
Jun Zhu
470
87
0
08 Feb 2024
One-step Diffusion with Distribution Matching Distillation
One-step Diffusion with Distribution Matching DistillationComputer Vision and Pattern Recognition (CVPR), 2023
Tianwei Yin
Michael Gharbi
Richard Zhang
Eli Shechtman
Frédo Durand
William T. Freeman
Taesung Park
DiffM
1.2K
694
0
30 Nov 2023
Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
Lijun Yu
José Lezama
N. B. Gundavarapu
Luca Versari
Kihyuk Sohn
...
Boqing Gong
Ming-Hsuan Yang
Irfan Essa
David A. Ross
Lu Jiang
528
589
0
09 Oct 2023
Latent Consistency Models: Synthesizing High-Resolution Images with
  Few-Step Inference
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Simian Luo
Yiqin Tan
Longbo Huang
Jian Li
Hang Zhao
DiffM
606
750
0
06 Oct 2023
Direct Preference Optimization: Your Language Model is Secretly a Reward
  Model
Direct Preference Optimization: Your Language Model is Secretly a Reward ModelNeural Information Processing Systems (NeurIPS), 2023
Rafael Rafailov
Archit Sharma
E. Mitchell
Stefano Ermon
Christopher D. Manning
Chelsea Finn
ALM
1.1K
8,135
0
29 May 2023
Training Diffusion Models with Reinforcement Learning
Training Diffusion Models with Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2023
Kevin Black
Michael Janner
Yilun Du
Ilya Kostrikov
Sergey Levine
EGVM
763
796
0
22 May 2023
MDTv2: Masked Diffusion Transformer is a Strong Image Synthesizer
MDTv2: Masked Diffusion Transformer is a Strong Image SynthesizerIEEE International Conference on Computer Vision (ICCV), 2023
Shanghua Gao
Pan Zhou
Mingg-Ming Cheng
Shuicheng Yan
DiffM
1.2K
289
0
25 Mar 2023
Scaling up GANs for Text-to-Image Synthesis
Scaling up GANs for Text-to-Image SynthesisComputer Vision and Pattern Recognition (CVPR), 2023
Minguk Kang
Jun-Yan Zhu
Richard Y. Zhang
Jaesik Park
Eli Shechtman
Sylvain Paris
Taesung Park
573
655
0
09 Mar 2023
Muse: Text-To-Image Generation via Masked Generative Transformers
Muse: Text-To-Image Generation via Masked Generative TransformersInternational Conference on Machine Learning (ICML), 2023
Huiwen Chang
Han Zhang
Jarred Barber
AJ Maschinot
José Lezama
...
Kevin Patrick Murphy
William T. Freeman
Michael Rubinstein
Yuanzhen Li
Dilip Krishnan
DiffM
658
746
0
02 Jan 2023
Scalable Diffusion Models with Transformers
Scalable Diffusion Models with TransformersIEEE International Conference on Computer Vision (ICCV), 2022
William S. Peebles
Saining Xie
GNN
2.7K
5,448
0
19 Dec 2022
Reproducible scaling laws for contrastive language-image learning
Reproducible scaling laws for contrastive language-image learningComputer Vision and Pattern Recognition (CVPR), 2022
Mehdi Cherti
Romain Beaumont
Ross Wightman
Mitchell Wortsman
Gabriel Ilharco
Cade Gordon
Christoph Schuhmann
Ludwig Schmidt
J. Jitsev
VLMCLIP
723
1,326
0
14 Dec 2022
MAGVIT: Masked Generative Video Transformer
MAGVIT: Masked Generative Video TransformerComputer Vision and Pattern Recognition (CVPR), 2022
Lijun Yu
Yong Cheng
Kihyuk Sohn
José Lezama
Han Zhang
...
Alexander G. Hauptmann
Ming-Hsuan Yang
Yuan Hao
Irfan Essa
Lu Jiang
DiffMVGen
411
368
0
10 Dec 2022
MAGE: MAsked Generative Encoder to Unify Representation Learning and
  Image Synthesis
MAGE: MAsked Generative Encoder to Unify Representation Learning and Image SynthesisComputer Vision and Pattern Recognition (CVPR), 2022
Tianhong Li
Huiwen Chang
Shlok Kumar Mishra
Han Zhang
Dina Katabi
Dilip Krishnan
459
260
0
16 Nov 2022
DPM-Solver++: Fast Solver for Guided Sampling of Diffusion Probabilistic Models
DPM-Solver++: Fast Solver for Guided Sampling of Diffusion Probabilistic ModelsMachine Intelligence Research (MIR), 2022
Cheng Lu
Yuhao Zhou
Fan Bao
Jianfei Chen
Chongxuan Li
Jun Zhu
DiffM
1.1K
950
0
02 Nov 2022
LAION-5B: An open large-scale dataset for training next generation
  image-text models
LAION-5B: An open large-scale dataset for training next generation image-text modelsNeural Information Processing Systems (NeurIPS), 2022
Christoph Schuhmann
Romain Beaumont
Richard Vencu
Cade Gordon
Ross Wightman
...
Srivatsa Kundurthy
Katherine Crowson
Ludwig Schmidt
R. Kaczmarczyk
J. Jitsev
VLMMLLMCLIP
1.5K
4,964
0
16 Oct 2022
On Distillation of Guided Diffusion Models
On Distillation of Guided Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2022
Chenlin Meng
Robin Rombach
Ruiqi Gao
Diederik P. Kingma
Stefano Ermon
Jonathan Ho
Tim Salimans
VLMDiffM
401
767
0
06 Oct 2022
Diffusion Posterior Sampling for General Noisy Inverse Problems
Diffusion Posterior Sampling for General Noisy Inverse ProblemsInternational Conference on Learning Representations (ICLR), 2022
Hyungjin Chung
Jeongsol Kim
Michael T. McCann
M. Klasky
J. C. Ye
DiffM
830
1,441
0
29 Sep 2022
All are Worth Words: A ViT Backbone for Diffusion Models
All are Worth Words: A ViT Backbone for Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2022
Fan Bao
Shen Nie
Kaiwen Xue
Yue Cao
Chongxuan Li
Hang Su
Jun Zhu
VLM
699
573
0
25 Sep 2022
Classifier-Free Diffusion Guidance
Classifier-Free Diffusion Guidance
Jonathan Ho
Tim Salimans
FaML
710
5,964
0
26 Jul 2022
EGSDE: Unpaired Image-to-Image Translation via Energy-Guided Stochastic
  Differential Equations
EGSDE: Unpaired Image-to-Image Translation via Energy-Guided Stochastic Differential EquationsNeural Information Processing Systems (NeurIPS), 2022
Min Zhao
Fan Bao
Chongxuan Li
Jun Zhu
DiffM
651
253
0
14 Jul 2022
Elucidating the Design Space of Diffusion-Based Generative Models
Elucidating the Design Space of Diffusion-Based Generative ModelsNeural Information Processing Systems (NeurIPS), 2022
Tero Karras
M. Aittala
Timo Aila
S. Laine
DiffM
1.1K
3,189
0
01 Jun 2022
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding
Photorealistic Text-to-Image Diffusion Models with Deep Language UnderstandingNeural Information Processing Systems (NeurIPS), 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
...
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
1.5K
8,076
0
23 May 2022
Hierarchical Text-Conditional Image Generation with CLIP Latents
Hierarchical Text-Conditional Image Generation with CLIP Latents
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLMDiffM
1.5K
8,816
0
13 Apr 2022
Autoregressive Image Generation using Residual Quantization
Autoregressive Image Generation using Residual QuantizationComputer Vision and Pattern Recognition (CVPR), 2022
Doyup Lee
Chiheon Kim
Saehoon Kim
Minsu Cho
Wook-Shin Han
VGen
1.5K
739
0
03 Mar 2022
MaskGIT: Masked Generative Image Transformer
MaskGIT: Masked Generative Image TransformerComputer Vision and Pattern Recognition (CVPR), 2022
Huiwen Chang
Han Zhang
Lu Jiang
Ce Liu
William T. Freeman
ViT
762
1,110
0
08 Feb 2022
High-Resolution Image Synthesis with Latent Diffusion Models
High-Resolution Image Synthesis with Latent Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2021
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
DiffM
4.8K
23,580
0
20 Dec 2021
GLIDE: Towards Photorealistic Image Generation and Editing with
  Text-Guided Diffusion Models
GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion ModelsInternational Conference on Machine Learning (ICML), 2021
Alex Nichol
Prafulla Dhariwal
Aditya A. Ramesh
Pranav Shyam
Pamela Mishkin
Bob McGrew
Ilya Sutskever
Mark Chen
1.4K
4,672
0
20 Dec 2021
Vector-quantized Image Modeling with Improved VQGAN
Vector-quantized Image Modeling with Improved VQGANInternational Conference on Learning Representations (ICLR), 2021
Jiahui Yu
Xin Li
Jing Yu Koh
Han Zhang
Ruoming Pang
James Qin
Alexander Ku
Yuanzhong Xu
Jason Baldridge
Yonghui Wu
ViTVLMDRL
692
741
0
09 Oct 2021
Variational Diffusion Models
Variational Diffusion Models
Diederik P. Kingma
Tim Salimans
Ben Poole
Jonathan Ho
DiffM
1.1K
1,448
0
01 Jul 2021
12
Next
Page 1 of 2