ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.15505
  4. Cited By
Finite Scalar Quantization: VQ-VAE Made Simple

Finite Scalar Quantization: VQ-VAE Made Simple

27 September 2023
Fabian Mentzer
David C. Minnen
E. Agustsson
Michael Tschannen
ArXivPDFHTML

Papers citing "Finite Scalar Quantization: VQ-VAE Made Simple"

18 / 118 papers shown
Title
SimpleSpeech: Towards Simple and Efficient Text-to-Speech with Scalar
  Latent Transformer Diffusion Models
SimpleSpeech: Towards Simple and Efficient Text-to-Speech with Scalar Latent Transformer Diffusion Models
Dongchao Yang
Dingdong Wang
Haohan Guo
Xueyuan Chen
Xixin Wu
Helen M. Meng
57
25
0
04 Jun 2024
$\text{Di}^2\text{Pose}$: Discrete Diffusion Model for Occluded 3D Human
  Pose Estimation
Di2Pose\text{Di}^2\text{Pose}Di2Pose: Discrete Diffusion Model for Occluded 3D Human Pose Estimation
Weiquan Wang
Jun Xiao
Chunping Wang
Wei Liu
Zhao Wang
Long Chen
DiffM
34
1
0
27 May 2024
VQDNA: Unleashing the Power of Vector Quantization for Multi-Species
  Genomic Sequence Modeling
VQDNA: Unleashing the Power of Vector Quantization for Multi-Species Genomic Sequence Modeling
Siyuan Li
Zedong Wang
Zicheng Liu
Di Wu
Cheng Tan
Jiangbin Zheng
Yufei Huang
Stan Z. Li
29
7
0
13 May 2024
Efficient Text-driven Motion Generation via Latent Consistency Training
Efficient Text-driven Motion Generation via Latent Consistency Training
Mengxian Hu
Minghao Zhu
Xun Zhou
Qingqing Yan
Shu Li
Chengju Liu
Qi Chen
33
1
0
05 May 2024
ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized
  Transformers
ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers
Yuzhe Gu
Enmao Diao
21
4
0
30 Apr 2024
Tripod: Three Complementary Inductive Biases for Disentangled
  Representation Learning
Tripod: Three Complementary Inductive Biases for Disentangled Representation Learning
Kyle Hsu
Jubayer Ibn Hamid
Kaylee Burns
Chelsea Finn
Jiajun Wu
CML
19
4
0
16 Apr 2024
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale
  Prediction
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction
Keyu Tian
Yi-Xin Jiang
Zehuan Yuan
Bingyue Peng
Liwei Wang
VGen
25
248
0
03 Apr 2024
GLAD: Improving Latent Graph Generative Modeling with Simple
  Quantization
GLAD: Improving Latent Graph Generative Modeling with Simple Quantization
Van Khoa Nguyen
Yoann Boget
Frantzeska Lavda
Alexandros Kalousis
29
2
0
25 Mar 2024
Unlocking the Potential of Multimodal Unified Discrete Representation
  through Training-Free Codebook Optimization and Hierarchical Alignment
Unlocking the Potential of Multimodal Unified Discrete Representation through Training-Free Codebook Optimization and Hierarchical Alignment
Hai Huang
Yan Xia
Shengpeng Ji
Shulei Wang
Hanting Wang
Jieming Zhu
Zhenhua Dong
Zhou Zhao
22
6
0
08 Mar 2024
FoldToken: Learning Protein Language via Vector Quantization and Beyond
FoldToken: Learning Protein Language via Vector Quantization and Beyond
Zhangyang Gao
Cheng Tan
Jue Wang
Yufei Huang
Lirong Wu
Stan Z. Li
25
9
0
04 Feb 2024
Machine Perceptual Quality: Evaluating the Impact of Severe Lossy
  Compression on Audio and Image Models
Machine Perceptual Quality: Evaluating the Impact of Severe Lossy Compression on Audio and Image Models
Dan G. Jacobellis
Daniel Cummings
N. Yadwadkar
11
2
0
15 Jan 2024
GIVT: Generative Infinite-Vocabulary Transformers
GIVT: Generative Infinite-Vocabulary Transformers
Michael Tschannen
Cian Eastwood
Fabian Mentzer
10
34
0
04 Dec 2023
On the Identifiability of Quantized Factors
On the Identifiability of Quantized Factors
Vitória Barin Pacela
Kartik Ahuja
Simon Lacoste-Julien
Pascal Vincent
OOD
CML
8
1
0
28 Jun 2023
Not All Image Regions Matter: Masked Vector Quantization for
  Autoregressive Image Generation
Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image Generation
Mengqi Huang
Zhendong Mao
Quang Wang
Yongdong Zhang
VGen
DiffM
68
21
0
23 May 2023
Muse: Text-To-Image Generation via Masked Generative Transformers
Muse: Text-To-Image Generation via Masked Generative Transformers
Huiwen Chang
Han Zhang
Jarred Barber
AJ Maschinot
José Lezama
...
Kevin Patrick Murphy
William T. Freeman
Michael Rubinstein
Yuanzhen Li
Dilip Krishnan
DiffM
197
517
0
02 Jan 2023
UViM: A Unified Modeling Approach for Vision with Learned Guiding Codes
UViM: A Unified Modeling Approach for Vision with Learned Guiding Codes
Alexander Kolesnikov
André Susano Pinto
Lucas Beyer
Xiaohua Zhai
Jeremiah Harmsen
N. Houlsby
103
67
0
20 May 2022
Autoregressive Image Generation using Residual Quantization
Autoregressive Image Generation using Residual Quantization
Doyup Lee
Chiheon Kim
Saehoon Kim
Minsu Cho
Wook-Shin Han
VGen
168
325
0
03 Mar 2022
Colorization Transformer
Colorization Transformer
Manoj Kumar
Dirk Weissenborn
Nal Kalchbrenner
ViT
218
140
0
08 Feb 2021
Previous
123