Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1711.00937
Cited By
v1
v2 (latest)
Neural Discrete Representation Learning
2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Neural Discrete Representation Learning"
50 / 3,807 papers shown
Unsupervised Acoustic Unit Discovery by Leveraging a Language-Independent Subword Discriminative Feature Representation
Interspeech (Interspeech), 2021
Siyuan Feng
Piotr Żelasko
Laureano Moro-Velazquez
O. Scharenborg
181
4
0
02 Apr 2021
Speech Resynthesis from Discrete Disentangled Self-Supervised Representations
Interspeech (Interspeech), 2021
Adam Polyak
Yossi Adi
Jade Copet
Eugene Kharitonov
Kushal Lakhotia
Wei-Ning Hsu
Abdel-rahman Mohamed
Emmanuel Dupoux
461
371
0
01 Apr 2021
A Closer Look at Fourier Spectrum Discrepancies for CNN-generated Images Detection
Computer Vision and Pattern Recognition (CVPR), 2021
Keshigeyan Chandrasegaran
Ngoc-Trung Tran
Ngai-Man Cheung
211
97
0
31 Mar 2021
Unsupervised Disentanglement of Linear-Encoded Facial Semantics
Computer Vision and Pattern Recognition (CVPR), 2021
Yutong Zheng
Yu-Kai Huang
R. Tao
Zhiqiang Shen
Marios Savvides
CVBM
DRL
143
14
0
30 Mar 2021
PixelTransformer: Sample Conditioned Signal Generation
International Conference on Machine Learning (ICML), 2021
Shubham Tulsiani
Abhinav Gupta
167
18
0
29 Mar 2021
Scalable and Efficient Neural Speech Coding: A Hybrid Design
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Kai Zhen
Jongmo Sung
Mi Suk Lee
Seung-Wha Beack
Minje Kim
237
19
0
27 Mar 2021
Decomposing Normal and Abnormal Features of Medical Images into Discrete Latent Codes for Content-Based Image Retrieval
Kazuma Kobayashi
Ryuichiro Hataya
Y. Kurose
M. Miyake
Masamichi Takahashi
Akiko Nakagawa
Tatsuya Harada
Ryuji Hamamoto
MedIm
272
22
0
23 Mar 2021
Tiny Transformers for Environmental Sound Classification at the Edge
David Elliott
Carlos E. Otero
Steven Wyatt
Evan Martino
156
20
0
22 Mar 2021
Generating Diverse Structure for Image Inpainting With Hierarchical VQ-VAE
Computer Vision and Pattern Recognition (CVPR), 2021
Jialun Peng
Dong Liu
Songcen Xu
Houqiang Li
DiffM
184
232
0
18 Mar 2021
Variable-rate discrete representation learning
Sander Dieleman
C. Nash
Jesse Engel
Karen Simonyan
BDL
DRL
209
32
0
10 Mar 2021
Wav2vec-C: A Self-supervised Model for Speech Representation Learning
Interspeech (Interspeech), 2021
Samik Sadhu
Di He
Che-Wei Huang
Sri Harish Reddy Mallidi
Minhua Wu
Ariya Rastrow
A. Stolcke
J. Droppo
Roland Maas
SSL
217
52
0
09 Mar 2021
Deep Generative Modelling: A Comparative Review of VAEs, GANs, Normalizing Flows, Energy-Based and Autoregressive Models
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Sam Bond-Taylor
Adam Leach
Yang Long
Chris G. Willcocks
VLM
TPM
727
630
0
08 Mar 2021
Learning to Generate 3D Shapes with Generative Cellular Automata
International Conference on Learning Representations (ICLR), 2021
Dongsu Zhang
Changwoon Choi
Jeonghwan Kim
Y. Kim
155
28
0
06 Mar 2021
Generating Images with Sparse Representations
International Conference on Machine Learning (ICML), 2021
C. Nash
Jacob Menick
Sander Dieleman
Peter W. Battaglia
236
269
0
05 Mar 2021
crank: An Open-Source Software for Nonparallel Voice Conversion Based on Vector-Quantized Variational Autoencoder
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Kazuhiro Kobayashi
Wen-Chin Huang
Yi-Chiao Wu
Patrick Lumban Tobing
Tomoki Hayashi
Tomoki Toda
BDL
DRL
150
19
0
04 Mar 2021
Enabling Visual Action Planning for Object Manipulation through Latent Space Roadmap
IEEE Transactions on robotics (TRO), 2021
M. Lippi
Petra Poklukar
Michael C. Welle
Anastasia Varava
Hang Yin
Alessandro Marino
Danica Kragic
266
17
0
03 Mar 2021
Predicting Video with VQVAE
Jacob Walker
Ali Razavi
Aaron van den Oord
DRL
247
75
0
02 Mar 2021
A survey on Variational Autoencoders from a GreenAI perspective
SN Computer Science (SN Comput. Sci.), 2021
Andrea Asperti
David Evangelista
E. Loli Piccolomini
DRL
193
68
0
01 Mar 2021
M6: A Chinese Multimodal Pretrainer
Junyang Lin
Rui Men
An Yang
Chan Zhou
Ming Ding
...
Yong Li
Jialin Li
Jingren Zhou
J. Tang
Hongxia Yang
VLM
MoE
347
147
0
01 Mar 2021
Zero-Shot Text-to-Image Generation
International Conference on Machine Learning (ICML), 2021
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
829
6,024
0
24 Feb 2021
Unsupervised Brain Anomaly Detection and Segmentation with Transformers
International Conference on Medical Imaging with Deep Learning (MIDL), 2021
W. H. Pinaya
Petru-Daniel Tudosiu
Robert J. Gray
G. Rees
P. Nachev
Sebastien Ourselin
M. Jorge Cardoso
ViT
MedIm
149
68
0
23 Feb 2021
Anytime Sampling for Autoregressive Models via Ordered Autoencoding
International Conference on Learning Representations (ICLR), 2021
Yilun Xu
Yang Song
Sahaj Garg
Linyuan Gong
Rui Shu
Aditya Grover
Stefano Ermon
DiffM
189
13
0
23 Feb 2021
Uncertainty Estimation Using Riemannian Model Dynamics for Offline Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2021
Guy Tennenholtz
Shie Mannor
OffRL
227
14
0
22 Feb 2021
Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding
International Conference on Machine Learning (ICML), 2021
Yangjun Ruan
Karen Ullrich
Daniel de Souza Severo
James Townsend
Ashish Khisti
Arnaud Doucet
Alireza Makhzani
Chris J. Maddison
238
25
0
22 Feb 2021
Measuring the Stability of Learned Features
Kris Sankaran
OOD
84
0
0
20 Feb 2021
Preventing Oversmoothing in VAE via Generalized Variance Parameterization
Neurocomputing (Neurocomputing), 2021
Yuhta Takida
Wei-Hsiang Liao
Chieh-Hsin Lai
Toshimitsu Uesaka
Shusuke Takahashi
Yuki Mitsufuji
DRL
219
18
0
17 Feb 2021
Enhancing into the codec: Noise Robust Speech Coding with Vector-Quantized Autoencoders
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Jonah Casebeer
Vinjai Vale
Umut Isik
J. Valin
Ritwik Giri
A. Krishnaswamy
200
25
0
12 Feb 2021
VARA-TTS: Non-Autoregressive Text-to-Speech Synthesis based on Very Deep VAE with Residual Attention
Peng Liu
Yuewen Cao
Songxiang Liu
Na Hu
Guangzhi Li
Chao Weng
Jane Polak Scowcroft
183
23
0
12 Feb 2021
Self-Supervised VQ-VAE for One-Shot Music Style Transfer
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Ondřej Cífka
A. Ozerov
Umut Simsekli
G. Richard
224
36
0
10 Feb 2021
Inducing Meaningful Units from Character Sequences with Dynamic Capacity Slot Attention
Melika Behjati
James Henderson
OCL
171
1
0
01 Feb 2021
Generative Spoken Language Modeling from Raw Audio
Transactions of the Association for Computational Linguistics (TACL), 2021
Kushal Lakhotia
Evgeny Kharitonov
Wei-Ning Hsu
Yossi Adi
Adam Polyak
...
Tu Nguyen
Jade Copet
Alexei Baevski
A. Mohamed
Emmanuel Dupoux
AuLLM
643
439
0
01 Feb 2021
CNN with large memory layers
R. Karimov
Yury Malkov
Karim Iskakov
Victor Lempitsky
223
0
0
27 Jan 2021
Disentangled Sequence Clustering for Human Intention Inference
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2021
Mark Zolotas
Y. Demiris
DRL
338
6
0
23 Jan 2021
Hierarchical disentangled representation learning for singing voice conversion
IEEE International Joint Conference on Neural Network (IJCNN), 2021
Naoya Takahashi
M. Singh
Yuki Mitsufuji
DRL
181
19
0
18 Jan 2021
Cauchy-Schwarz Regularized Autoencoder
Journal of machine learning research (JMLR), 2021
Linh-Tam Tran
Maja Pantic
M. Deisenroth
DRL
BDL
245
19
0
06 Jan 2021
HAVANA: Hierarchical and Variation-Normalized Autoencoder for Person Re-identification
Jiawei Ren
Xiao Ma
Chen Xu
Haiyu Zhao
Shuai Yi
BDL
267
5
0
06 Jan 2021
Psychoacoustic Calibration of Loss Functions for Efficient End-to-End Neural Audio Coding
IEEE Signal Processing Letters (IEEE SPL), 2020
Kai Zhen
Mi Suk Lee
Jongmo Sung
Seung-Wha Beack
Minje Kim
206
26
0
31 Dec 2020
Discovering Dialog Structure Graph for Open-Domain Dialog Generation
Jun Xu
Zeyang Lei
Haifeng Wang
Zheng-Yu Niu
Hua Wu
Wanxiang Che
Ting Liu
193
6
0
31 Dec 2020
Text-Free Image-to-Speech Synthesis Using Learned Segmental Units
Annual Meeting of the Association for Computational Linguistics (ACL), 2020
Wei-Ning Hsu
David Harwath
Christopher Song
James R. Glass
CLIP
191
74
0
31 Dec 2020
Interpretable NLG for Task-oriented Dialogue Systems with Heterogeneous Rendering Machines
AAAI Conference on Artificial Intelligence (AAAI), 2020
Yangming Li
Kaisheng Yao
188
4
0
29 Dec 2020
A Survey on Visual Transformer
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Kai Han
Yunhe Wang
Hanting Chen
Xinghao Chen
Jianyuan Guo
...
Chunjing Xu
Yixing Xu
Zhaohui Yang
Yiman Zhang
Dacheng Tao
ViT
1.1K
3,127
0
23 Dec 2020
Motif-Driven Contrastive Learning of Graph Representations
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2020
Shichang Zhang
Ziniu Hu
Arjun Subramonian
Luke Huan
SSL
244
11
0
23 Dec 2020
OBoW: Online Bag-of-Visual-Words Generation for Self-Supervised Learning
Computer Vision and Pattern Recognition (CVPR), 2020
Spyros Gidaris
Andrei Bursuc
Gilles Puy
N. Komodakis
Matthieu Cord
P. Pérez
SSL
297
81
0
21 Dec 2020
Exploring Fluent Query Reformulations with Text-to-Text Transformers and Reinforcement Learning
Jerry Zikun Chen
S. Yu
Haoran Wang
913
5
0
18 Dec 2020
Taming Transformers for High-Resolution Image Synthesis
Computer Vision and Pattern Recognition (CVPR), 2020
Patrick Esser
Robin Rombach
Bjorn Ommer
ViT
740
3,822
0
17 Dec 2020
The effectiveness of unsupervised subword modeling with autoregressive and cross-lingual phone-aware networks
IEEE Open Journal of Signal Processing (JOSP), 2020
Siyuan Feng
O. Scharenborg
SSL
222
3
0
17 Dec 2020
Computational principles of intelligence: learning and reasoning with neural networks
Abel Torres Montoya
PINN
AI4CE
117
1
0
17 Dec 2020
Planning from Pixels in Atari with Learned Symbolic Representations
AAAI Conference on Artificial Intelligence (AAAI), 2020
Andrea Dittadi
Frederik K. Drachmann
Thomas Bolander
358
11
0
16 Dec 2020
Unsupervised Learning of Global Factors in Deep Generative Models
Pattern Recognition (Pattern Recognit.), 2020
I. Peis
Pablo M. Olmos
Antonio Artés-Rodríguez
BDL
DRL
220
13
0
15 Dec 2020
Towards unsupervised phone and word segmentation using self-supervised vector-quantized neural networks
Interspeech (Interspeech), 2020
Herman Kamper
Benjamin van Niekerk
SSL
MQ
297
38
0
14 Dec 2020
Previous
1
2
3
...
68
69
70
...
75
76
77
Next
Page 69 of 77
Page
of 77
Go