Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1711.00937
Cited By
v1
v2 (latest)
Neural Discrete Representation Learning
2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Neural Discrete Representation Learning"
50 / 3,807 papers shown
Anomaly detection through latent space restoration using vector-quantized variational autoencoders
IEEE International Symposium on Biomedical Imaging (ISBI), 2020
Jonathan Passerat-Palmbach
G. Tarroni
DRL
295
69
0
12 Dec 2020
DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector Quantization
Shaoshi Ling
Yuzong Liu
159
113
0
11 Dec 2020
Long Term Motion Prediction Using Keyposes
Sena Kiciroglu
Wei Wang
Mathieu Salzmann
Pascal Fua
3DH
425
9
0
08 Dec 2020
I'm Sorry for Your Loss: Spectrally-Based Audio Distances Are Bad at Pitch
Joseph P. Turian
Max Henry
245
36
0
08 Dec 2020
Bayesian Image Reconstruction using Deep Generative Models
Razvan Marinescu
Daniel Moyer
Polina Golland
OOD
DiffM
382
50
0
08 Dec 2020
Extractive Opinion Summarization in Quantized Transformer Spaces
Stefanos Angelidis
Reinald Kim Amplayo
Yoshihiko Suhara
Xiaolan Wang
Mirella Lapata
MQ
298
118
0
08 Dec 2020
Multi-Instrumentalist Net: Unsupervised Generation of Music from Body Movements
Kun Su
Xiulong Liu
Eli Shlizerman
242
32
0
07 Dec 2020
Learning Vector Quantized Shape Code for Amodal Blastomere Instance Segmentation
IEEE International Symposium on Biomedical Imaging (ISBI), 2020
Won-Dong Jang
D. Wei
Xingxuan Zhang
B. Leahy
Helen Y Yang
James Tompkin
D. Ben-Yosef
D. Needleman
Hanspeter Pfister
193
21
0
02 Dec 2020
Mutual Information Constraints for Monte-Carlo Objectives
Journal of machine learning research (JMLR), 2020
Gábor Melis
András Gyorgy
Phil Blunsom
223
1
0
01 Dec 2020
Unpaired Image-to-Image Translation via Latent Energy Transport
Computer Vision and Pattern Recognition (CVPR), 2020
Yang Zhao
Changyou Chen
364
29
0
01 Dec 2020
Latent Programmer: Discrete Latent Codes for Program Synthesis
International Conference on Machine Learning (ICML), 2020
Joey Hong
David Dohan
Rishabh Singh
Charles Sutton
Manzil Zaheer
363
24
0
01 Dec 2020
Direct Evolutionary Optimization of Variational Autoencoders With Binary Latents
E. Guiraud
Jakob Drefs
Jörg Lücke
DRL
249
3
0
27 Nov 2020
Neural Representations for Modeling Variation in Speech
Martijn Bartelds
Wietse de Vries
Faraz Sanal
Caitlin Richter
M. Liberman
Martijn B. Wieling
SSL
DRL
197
29
0
25 Nov 2020
BinPlay: A Binary Latent Autoencoder for Generative Replay Continual Learning
IEEE International Joint Conference on Neural Network (IJCNN), 2020
Kamil Deja
Pawel Wawrzyñski
Daniel Marczak
Wojciech Masarczyk
Tomasz Trzciñski
SyDa
162
10
0
25 Nov 2020
Mixture-based Feature Space Learning for Few-shot Image Classification
IEEE International Conference on Computer Vision (ICCV), 2020
Arman Afrasiyabi
Jean-François Lalonde
Christian Gagné
VLM
324
89
0
24 Nov 2020
Parrot: Data-Driven Behavioral Priors for Reinforcement Learning
International Conference on Learning Representations (ICLR), 2020
Avi Singh
Huihan Liu
G. Zhou
Albert Yu
Nicholas Rhinehart
Sergey Levine
OffRL
OnRL
258
159
0
19 Nov 2020
Latent Adversarial Debiasing: Mitigating Collider Bias in Deep Neural Networks
L. N. Darlow
Stanisław Jastrzębski
Amos Storkey
247
26
0
19 Nov 2020
End-To-End Dilated Variational Autoencoder with Bottleneck Discriminative Loss for Sound Morphing -- A Preliminary Study
Matteo Lionello
Hendrik Purwins
155
0
0
19 Nov 2020
Neuro-Symbolic Representations for Video Captioning: A Case for Leveraging Inductive Biases for Vision and Language
Hassan Akbari
Hamid Palangi
Jianwei Yang
Sudha Rao
Asli Celikyilmaz
Roland Fernandez
P. Smolensky
Jianfeng Gao
Shih-Fu Chang
210
3
0
18 Nov 2020
Optimizing voice conversion network with cycle consistency loss of speaker identity
Spoken Language Technology Workshop (SLT), 2020
Hongqiang Du
Xiaohai Tian
Lei Xie
Haizhou Li
162
20
0
17 Nov 2020
A Comprehensive Survey on Deep Music Generation: Multi-level Representations, Algorithms, Evaluations, and Future Directions
Shulei Ji
Jing Luo
Xinyu Yang
MGen
269
143
0
13 Nov 2020
Hierarchical Prosody Modeling for Non-Autoregressive Speech Synthesis
Spoken Language Technology Workshop (SLT), 2020
C. Chien
Hung-yi Lee
292
41
0
12 Nov 2020
Testing for Typicality with Respect to an Ensemble of Learned Distributions
F. Laine
Claire Tomlin
132
0
0
11 Nov 2020
Semi-supervised Learning for Singing Synthesis Timbre
J. Bonada
Merlijn Blaauw
132
4
0
05 Nov 2020
A Hierarchical Subspace Model for Language-Attuned Acoustic Unit Discovery
Bolaji Yusuf
Lucas Ondel
L. Burget
J. Černocký
Murat Saraclar
174
12
0
04 Nov 2020
Paralinguistic Privacy Protection at the Edge
Ranya Aloufi
Hamed Haddadi
David E. Boyle
253
15
0
04 Nov 2020
Mixing Consistent Deep Clustering
D. Lutscher
Ali el Hassouni
M. Stol
Mark Hoogendoorn
SSL
70
2
0
03 Nov 2020
CVC: Contrastive Learning for Non-parallel Voice Conversion
Interspeech (Interspeech), 2020
Tingle Li
Yichen Liu
Chenxu Hu
Hang Zhao
DRL
205
13
0
02 Nov 2020
Non-Autoregressive Predictive Coding for Learning Speech Representations from Local Dependencies
Interspeech (Interspeech), 2020
Alexander H. Liu
Yu-An Chung
James R. Glass
SSL
227
93
0
01 Nov 2020
AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance Normalization
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Yen-Hao Chen
Da-Yi Wu
Tsung-Han Wu
Hung-yi Lee
287
122
0
31 Oct 2020
GENs: Generative Encoding Networks
Machine-mediated learning (ML), 2020
Surojit Saha
Shireen Y. Elhabian
Ross T. Whitaker
GAN
183
9
0
28 Oct 2020
Scaling Laws for Autoregressive Generative Modeling
T. Henighan
Jared Kaplan
Mor Katz
Mark Chen
Christopher Hesse
...
Nick Ryder
Daniel M. Ziegler
John Schulman
Dario Amodei
Sam McCandlish
476
560
0
28 Oct 2020
Spiking Neural Networks -- Part III: Neuromorphic Communications
IEEE Communications Letters (IEEE Commun. Lett.), 2020
N. Skatchkovsky
Hyeryung Jang
Osvaldo Simeone
183
14
0
27 Oct 2020
Unsupervised Learning of Disentangled Speech Content and Style Representation
Interspeech (Interspeech), 2020
Andros Tjandra
Ruoming Pang
Yu Zhang
Shigeki Karita
BDL
DRL
263
20
0
24 Oct 2020
A Comparison of Discrete Latent Variable Models for Speech Representation Learning
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Henry Zhou
Alexei Baevski
Michael Auli
DRL
123
11
0
24 Oct 2020
Representation Learning for High-Dimensional Data Collection under Local Differential Privacy
Alex Mansbridge
G. Barbour
Davide Piras
Michael Murray
Christopher Frye
Ilya Feige
David Barber
254
1
0
23 Oct 2020
A Framework for Generative and Contrastive Learning of Audio Representations
Prateek Verma
J. Smith
SSL
220
21
0
22 Oct 2020
Learning Speaker Embedding from Text-to-Speech
Jaejin Cho
Piotr Żelasko
Jesus Villalba
Shinji Watanabe
Najim Dehak
140
13
0
21 Oct 2020
Learning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE Paradigm
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Jennifer Williams
Yi Zhao
Erica Cooper
Junichi Yamagishi
SSL
223
26
0
21 Oct 2020
Towards End-to-End In-Image Neural Machine Translation
Elman Mansimov
Mitchell Stern
Mengzhao Chen
Orhan Firat
Jakob Uszkoreit
Puneet Jain
185
30
0
20 Oct 2020
End-to-End Text-to-Speech using Latent Duration based on VQ-VAE
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Yusuke Yasuda
Xin Wang
Junichi Yamagishi
173
17
0
19 Oct 2020
CLAR: Contrastive Learning of Auditory Representations
International Conference on Artificial Intelligence and Statistics (AISTATS), 2020
Haider Al-Tahan
Y. Mohsenzadeh
SSL
383
67
0
19 Oct 2020
Evidential Sparsification of Multimodal Latent Spaces in Conditional Variational Autoencoders
Neural Information Processing Systems (NeurIPS), 2020
Masha Itkina
Boris Ivanovic
Ransalu Senanayake
Mykel J. Kochenderfer
Marco Pavone
243
18
0
19 Oct 2020
CQ-VAE: Coordinate Quantized VAE for Uncertainty Estimation with Application to Disk Shape Analysis from Lumbar Spine MRI Images
International Conference on Machine Learning and Applications (ICMLA), 2020
Linchen Qian
Jiasong Chen
Timur Urakov
Weiyong Gu
Liang Liang
105
3
0
17 Oct 2020
Latent Vector Recovery of Audio GANs
Andrew Keyes
N. Bayat
Vahid Reza Khazaie
Y. Mohsenzadeh
93
3
0
16 Oct 2020
The NeteaseGames System for Voice Conversion Challenge 2020 with Vector-quantization Variational Autoencoder and WaveNet
Haitong Zhang
DRL
110
4
0
15 Oct 2020
Smaller World Models for Reinforcement Learning
Jan Robine
Tobias Uelwer
Stefan Harmeling
DRL
102
4
0
12 Oct 2020
Deep Sequence Learning for Video Anticipation: From Discrete and Deterministic to Continuous and Stochastic
S. Aliakbarian
AI4TS
128
0
0
09 Oct 2020
Event Representation with Sequential, Semi-Supervised Discrete Variables
North American Chapter of the Association for Computational Linguistics (NAACL), 2020
Mehdi Rezaee
Francis Ferraro
BDL
DRL
226
14
0
09 Oct 2020
FastVC: Fast Voice Conversion with non-parallel data
Oriol Barbany
Milos Cernak
130
7
0
08 Oct 2020
Previous
1
2
3
...
69
70
71
...
75
76
77
Next
Page 70 of 77
Page
of 77
Go