ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.04627
  4. Cited By
Vector-quantized Image Modeling with Improved VQGAN

Vector-quantized Image Modeling with Improved VQGAN

9 October 2021
Jiahui Yu
Xin Li
Jing Yu Koh
Han Zhang
Ruoming Pang
James Qin
Alexander Ku
Yuanzhong Xu
Jason Baldridge
Yonghui Wu
    ViT
    VLM
    DRL
ArXivPDFHTML

Papers citing "Vector-quantized Image Modeling with Improved VQGAN"

50 / 372 papers shown
Title
MeLM, a generative pretrained language modeling framework that solves
  forward and inverse mechanics problems
MeLM, a generative pretrained language modeling framework that solves forward and inverse mechanics problems
Markus J. Buehler
AI4CE
14
41
0
30 Jun 2023
Generate Anything Anywhere in Any Scene
Generate Anything Anywhere in Any Scene
Yuheng Li
Haotian Liu
Yangming Wen
Yong Jae Lee
DiffM
30
12
0
29 Jun 2023
CLIPAG: Towards Generator-Free Text-to-Image Generation
CLIPAG: Towards Generator-Free Text-to-Image Generation
Roy Ganz
Michael Elad
VLM
18
7
0
29 Jun 2023
Training Multimedia Event Extraction With Generated Images and Captions
Training Multimedia Event Extraction With Generated Images and Captions
Zilin Du
Yunxin Li
Xu Guo
Yidan Sun
Boyang Albert Li
DiffM
21
7
0
15 Jun 2023
VidEdit: Zero-Shot and Spatially Aware Text-Driven Video Editing
VidEdit: Zero-Shot and Spatially Aware Text-Driven Video Editing
Paul Couairon
Clément Rambour
Jean-Emmanuel Haugeard
Nicolas Thome
DiffM
VGen
4
29
0
14 Jun 2023
Better Generalization with Semantic IDs: A Case Study in Ranking for
  Recommendations
Better Generalization with Semantic IDs: A Case Study in Ranking for Recommendations
Anima Singh
Trung Vu
Nikhil Mehta
Raghunandan H. Keshavan
M. Sathiamoorthy
...
Lukasz Heldt
Li Wei
Devansh Tandon
Ed H. Chi
Xinyang Yi
11
19
0
13 Jun 2023
High-Fidelity Audio Compression with Improved RVQGAN
High-Fidelity Audio Compression with Improved RVQGAN
Rithesh Kumar
Prem Seetharaman
Alejandro Luebs
I. Kumar
Kundan Kumar
18
282
0
11 Jun 2023
Learning Image-Adaptive Codebooks for Class-Agnostic Image Restoration
Learning Image-Adaptive Codebooks for Class-Agnostic Image Restoration
Kechun Liu
Yitong Jiang
Inchang Choi
Jinwei Gu
11
13
0
10 Jun 2023
How Can Recommender Systems Benefit from Large Language Models: A Survey
How Can Recommender Systems Benefit from Large Language Models: A Survey
Jianghao Lin
Xinyi Dai
Yunjia Xi
Weiwen Liu
Bo Chen
...
Chenxu Zhu
Huifeng Guo
Yong Yu
Ruiming Tang
Weinan Zhang
LRM
30
195
0
09 Jun 2023
ADDP: Learning General Representations for Image Recognition and
  Generation with Alternating Denoising Diffusion Process
ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process
Changyao Tian
Chenxin Tao
Jifeng Dai
Hao Li
Ziheng Li
Lewei Lu
Xiaogang Wang
Hongsheng Li
Gao Huang
Xizhou Zhu
DiffM
23
9
0
08 Jun 2023
PassGPT: Password Modeling and (Guided) Generation with Large Language
  Models
PassGPT: Password Modeling and (Guided) Generation with Large Language Models
Javier Rando
F. Pérez-Cruz
B. Hitaj
GAN
14
8
0
02 Jun 2023
StyleDrop: Text-to-Image Generation in Any Style
StyleDrop: Text-to-Image Generation in Any Style
Kihyuk Sohn
Nataniel Ruiz
Kimin Lee
Daniel Castro Chin
Irina Blok
...
Yuanzhen Li
Yuan Hao
Irfan Essa
Michael Rubinstein
Dilip Krishnan
4
141
0
01 Jun 2023
Learning Sampling Dictionaries for Efficient and Generalizable Robot
  Motion Planning with Transformers
Learning Sampling Dictionaries for Efficient and Generalizable Robot Motion Planning with Transformers
Jacob J. Johnson
A. H. Qureshi
Michael C. Yip
17
12
0
01 Jun 2023
Data Interpolants -- That's What Discriminators in Higher-order
  Gradient-regularized GANs Are
Data Interpolants -- That's What Discriminators in Higher-order Gradient-regularized GANs Are
Siddarth Asokan
C. Seelamantula
19
4
0
01 Jun 2023
Too Large; Data Reduction for Vision-Language Pre-Training
Too Large; Data Reduction for Vision-Language Pre-Training
Alex Jinpeng Wang
Kevin Qinghong Lin
David Junhao Zhang
Stan Weixian Lei
Mike Zheng Shou
VLM
21
24
0
31 May 2023
SAVE: Spectral-Shift-Aware Adaptation of Image Diffusion Models for
  Text-driven Video Editing
SAVE: Spectral-Shift-Aware Adaptation of Image Diffusion Models for Text-driven Video Editing
Nazmul Karim
Umar Khalid
M. Joneidi
Chen Chen
Nazanin Rahnavard
DiffM
VGen
19
5
0
30 May 2023
BRICS: Bi-level feature Representation of Image CollectionS
BRICS: Bi-level feature Representation of Image CollectionS
Dingdong Yang
Yizhi Wang
Ali Mahdavi-Amiri
Hao Zhang
DiffM
8
0
0
29 May 2023
PaLI-X: On Scaling up a Multilingual Vision and Language Model
PaLI-X: On Scaling up a Multilingual Vision and Language Model
Xi Chen
Josip Djolonga
Piotr Padlewski
Basil Mustafa
Soravit Changpinyo
...
Mojtaba Seyedhosseini
A. Angelova
Xiaohua Zhai
N. Houlsby
Radu Soricut
VLM
44
187
0
29 May 2023
Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learning
  and Diffusion Priors
Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learning and Diffusion Priors
Paul S. Scotti
Atmadeep Banerjee
J. Goode
Stepan Shabalin
A. Nguyen
...
Nathalie Verlinde
Elad Yundler
David Weisberg
K. A. Norman
Tanishq Mathew Abraham
DiffM
32
106
0
29 May 2023
Gen-L-Video: Multi-Text to Long Video Generation via Temporal
  Co-Denoising
Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising
Fu Lee Wang
Wenshuo Chen
Guanglu Song
Han-Jia Ye
Yu Liu
Hongsheng Li
VGen
DiffM
33
88
0
29 May 2023
GenerateCT: Text-Conditional Generation of 3D Chest CT Volumes
GenerateCT: Text-Conditional Generation of 3D Chest CT Volumes
Ibrahim Ethem Hamamci
Sezgin Er
Anjany Sekuboyina
Enis Simsar
A. Tezcan
...
Hadrien Reynaud
Sarthak Pati
Christian Bluethgen
M. K. Özdemir
Bjoern H. Menze
DiffM
MedIm
32
16
0
25 May 2023
Parameter Estimation in DAGs from Incomplete Data via Optimal Transport
Parameter Estimation in DAGs from Incomplete Data via Optimal Transport
Vy Vo
Trung Le
L. Vuong
He Zhao
Edwin V. Bonilla
Dinh Q. Phung
OT
21
4
0
25 May 2023
Reparo: Loss-Resilient Generative Codec for Video Conferencing
Reparo: Loss-Resilient Generative Codec for Video Conferencing
Tianhong Li
Vibhaalakshmi Sivaraman
Pantea Karimi
Lijie Fan
M. Alizadeh
Dina Katabi
14
7
0
23 May 2023
Unsafe Diffusion: On the Generation of Unsafe Images and Hateful Memes
  From Text-To-Image Models
Unsafe Diffusion: On the Generation of Unsafe Images and Hateful Memes From Text-To-Image Models
Y. Qu
Xinyue Shen
Xinlei He
Michael Backes
Savvas Zannettou
Yang Zhang
14
105
0
23 May 2023
Know Your Self-supervised Learning: A Survey on Image-based Generative
  and Discriminative Training
Know Your Self-supervised Learning: A Survey on Image-based Generative and Discriminative Training
Utku Ozbulak
Hyun Jung Lee
Beril Boga
Esla Timothy Anzaku
Ho-min Park
Arnout Van Messem
W. D. Neve
J. Vankerschaver
DiffM
19
36
0
23 May 2023
Not All Image Regions Matter: Masked Vector Quantization for
  Autoregressive Image Generation
Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image Generation
Mengqi Huang
Zhendong Mao
Quang Wang
Yongdong Zhang
VGen
DiffM
68
21
0
23 May 2023
Watermarking Diffusion Model
Watermarking Diffusion Model
Yugeng Liu
Zheng Li
Michael Backes
Yun Shen
Yang Zhang
WIGM
13
34
0
21 May 2023
Inventing art styles with no artistic training data
Inventing art styles with no artistic training data
Nilin Abrahamsen
Jiahao Yao
GAN
14
3
0
19 May 2023
Towards Accurate Image Coding: Improved Autoregressive Image Generation
  with Dynamic Vector Quantization
Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization
Mengqi Huang
Zhendong Mao
Zhuowei Chen
Yongdong Zhang
MQ
24
35
0
19 May 2023
Restoring Images Captured in Arbitrary Hybrid Adverse Weather Conditions
  in One Go
Restoring Images Captured in Arbitrary Hybrid Adverse Weather Conditions in One Go
Yecong Wan
Mingzhen Shao
Yuanshuo Cheng
YueQin Liu
Zhipeng Bao
19
5
0
17 May 2023
Straightening Out the Straight-Through Estimator: Overcoming
  Optimization Challenges in Vector Quantized Networks
Straightening Out the Straight-Through Estimator: Overcoming Optimization Challenges in Vector Quantized Networks
Minyoung Huh
Brian Cheung
Pulkit Agrawal
Phillip Isola
MQ
17
48
0
15 May 2023
Spider GAN: Leveraging Friendly Neighbors to Accelerate GAN Training
Spider GAN: Leveraging Friendly Neighbors to Accelerate GAN Training
Siddarth Asokan
C. Seelamantula
GAN
53
1
0
12 May 2023
V2Meow: Meowing to the Visual Beat via Video-to-Music Generation
V2Meow: Meowing to the Visual Beat via Video-to-Music Generation
Kun Su
Judith Yue Li
Qingqing Huang
Dima Kuzmin
Joonseok Lee
...
Fei Sha
A. Jansen
Yu Wang
Mauro Verzetti
Timo I. Denk
VGen
26
12
0
11 May 2023
Recommender Systems with Generative Retrieval
Recommender Systems with Generative Retrieval
Shashank Rajput
Nikhil Mehta
Anima Singh
Raghunandan H. Keshavan
T. Vu
...
Vinh Q. Tran
Jonah Samost
Maciej Kula
Ed H. Chi
M. Sathiamoorthy
RALM
3DV
18
74
0
08 May 2023
FashionTex: Controllable Virtual Try-on with Text and Texture
FashionTex: Controllable Virtual Try-on with Text and Texture
Anran Lin
Nanxuan Zhao
Shuliang Ning
Yuda Qiu
Baoyuan Wang
Xiaoguang Han
DiffM
25
12
0
08 May 2023
A vector quantized masked autoencoder for audiovisual speech emotion recognition
A vector quantized masked autoencoder for audiovisual speech emotion recognition
Samir Sadok
Simon Leglaive
Renaud Séguier
SSL
79
6
0
05 May 2023
Catch Missing Details: Image Reconstruction with Frequency Augmented
  Variational Autoencoder
Catch Missing Details: Image Reconstruction with Frequency Augmented Variational Autoencoder
Xinmiao Lin
Yikang Li
Jenhao Hsiao
C. Ho
Yu Kong
80
16
0
04 May 2023
Geometric Latent Diffusion Models for 3D Molecule Generation
Geometric Latent Diffusion Models for 3D Molecule Generation
Minkai Xu
Alexander Powers
R. Dror
Stefano Ermon
J. Leskovec
DiffM
AI4CE
48
133
0
02 May 2023
StyleGenes: Discrete and Efficient Latent Distributions for GANs
StyleGenes: Discrete and Efficient Latent Distributions for GANs
Evangelos Ntavelis
Mohamad Shahbazi
I. Kastanis
Radu Timofte
Martin Danelljan
Luc Van Gool
30
1
0
30 Apr 2023
A vector quantized masked autoencoder for speech emotion recognition
A vector quantized masked autoencoder for speech emotion recognition
Samir Sadok
Simon Leglaive
Renaud Séguier
22
20
0
21 Apr 2023
Modeling and design of heterogeneous hierarchical bioinspired spider web
  structures using generative deep learning and additive manufacturing
Modeling and design of heterogeneous hierarchical bioinspired spider web structures using generative deep learning and additive manufacturing
Wei Lu
Nicolas A. Lee
Markus J. Buehler
AI4CE
11
1
0
11 Apr 2023
Binary Latent Diffusion
Binary Latent Diffusion
Ze Wang
Jiang Wang
Zicheng Liu
Qiang Qiu
11
13
0
10 Apr 2023
GINA-3D: Learning to Generate Implicit Neural Assets in the Wild
GINA-3D: Learning to Generate Implicit Neural Assets in the Wild
Bokui Shen
Xinchen Yan
C. Qi
Mahyar Najibi
Boyang Deng
Leonidas J. Guibas
Yin Zhou
Drago Anguelov
3DV
22
20
0
04 Apr 2023
Text-Conditioned Sampling Framework for Text-to-Image Generation with
  Masked Generative Models
Text-Conditioned Sampling Framework for Text-to-Image Generation with Masked Generative Models
Jaewoong Lee
Sang-Sub Jang
Jaehyeong Jo
Jaehong Yoon
Yunji Kim
Jin-Hwa Kim
Jung-Woo Ha
Sung Ju Hwang
DiffM
16
4
0
04 Apr 2023
ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding with
  GPT and Prototype Guidance
ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding with GPT and Prototype Guidance
Zoey Guo
Yiwen Tang
Renrui Zhang
Dong Wang
Zhigang Wang
Bin Zhao
Xuelong Li
28
53
0
29 Mar 2023
SC-VAE: Sparse Coding-based Variational Autoencoder with Learned ISTA
SC-VAE: Sparse Coding-based Variational Autoencoder with Learned ISTA
Pan Xiao
Peijie Qiu
Sungmin Ha
Abdalla Bani
Shuang Zhou
Aristeidis Sotiras
DRL
23
4
0
29 Mar 2023
The Stable Signature: Rooting Watermarks in Latent Diffusion Models
The Stable Signature: Rooting Watermarks in Latent Diffusion Models
Pierre Fernandez
Guillaume Couairon
Hervé Jégou
Matthijs Douze
Teddy Furon
WIGM
10
174
0
27 Mar 2023
Learning Versatile 3D Shape Generation with Improved AR Models
Learning Versatile 3D Shape Generation with Improved AR Models
Simian Luo
Xuelin Qian
Yanwei Fu
Yinda Zhang
Ying Tai
Zhenyu Zhang
Chengjie Wang
Xiangyang Xue
31
3
0
26 Mar 2023
Enhancing Multiple Reliability Measures via Nuisance-extended
  Information Bottleneck
Enhancing Multiple Reliability Measures via Nuisance-extended Information Bottleneck
Jongheon Jeong
Sihyun Yu
Hankook Lee
Jinwoo Shin
AAML
31
0
0
24 Mar 2023
High Fidelity Image Synthesis With Deep VAEs In Latent Space
High Fidelity Image Synthesis With Deep VAEs In Latent Space
Troy Luhman
Eric Luhman
DRL
3DV
20
7
0
23 Mar 2023
Previous
12345678
Next