ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.00937
  4. Cited By
Neural Discrete Representation Learning
v1v2 (latest)

Neural Discrete Representation Learning

2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
    BDLSSLOCL
ArXiv (abs)PDFHTML

Papers citing "Neural Discrete Representation Learning"

50 / 3,807 papers shown
Anomaly detection through latent space restoration using
  vector-quantized variational autoencoders
Anomaly detection through latent space restoration using vector-quantized variational autoencodersIEEE International Symposium on Biomedical Imaging (ISBI), 2020
Jonathan Passerat-Palmbach
G. Tarroni
DRL
295
69
0
12 Dec 2020
DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector
  Quantization
DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector Quantization
Shaoshi Ling
Yuzong Liu
159
113
0
11 Dec 2020
Long Term Motion Prediction Using Keyposes
Long Term Motion Prediction Using Keyposes
Sena Kiciroglu
Wei Wang
Mathieu Salzmann
Pascal Fua
3DH
425
9
0
08 Dec 2020
I'm Sorry for Your Loss: Spectrally-Based Audio Distances Are Bad at
  Pitch
I'm Sorry for Your Loss: Spectrally-Based Audio Distances Are Bad at Pitch
Joseph P. Turian
Max Henry
245
36
0
08 Dec 2020
Bayesian Image Reconstruction using Deep Generative Models
Bayesian Image Reconstruction using Deep Generative Models
Razvan Marinescu
Daniel Moyer
Polina Golland
OODDiffM
382
50
0
08 Dec 2020
Extractive Opinion Summarization in Quantized Transformer Spaces
Extractive Opinion Summarization in Quantized Transformer Spaces
Stefanos Angelidis
Reinald Kim Amplayo
Yoshihiko Suhara
Xiaolan Wang
Mirella Lapata
MQ
298
118
0
08 Dec 2020
Multi-Instrumentalist Net: Unsupervised Generation of Music from Body
  Movements
Multi-Instrumentalist Net: Unsupervised Generation of Music from Body Movements
Kun Su
Xiulong Liu
Eli Shlizerman
242
32
0
07 Dec 2020
Learning Vector Quantized Shape Code for Amodal Blastomere Instance
  Segmentation
Learning Vector Quantized Shape Code for Amodal Blastomere Instance SegmentationIEEE International Symposium on Biomedical Imaging (ISBI), 2020
Won-Dong Jang
D. Wei
Xingxuan Zhang
B. Leahy
Helen Y Yang
James Tompkin
D. Ben-Yosef
D. Needleman
Hanspeter Pfister
193
21
0
02 Dec 2020
Mutual Information Constraints for Monte-Carlo Objectives
Mutual Information Constraints for Monte-Carlo ObjectivesJournal of machine learning research (JMLR), 2020
Gábor Melis
András Gyorgy
Phil Blunsom
223
1
0
01 Dec 2020
Unpaired Image-to-Image Translation via Latent Energy Transport
Unpaired Image-to-Image Translation via Latent Energy TransportComputer Vision and Pattern Recognition (CVPR), 2020
Yang Zhao
Changyou Chen
364
29
0
01 Dec 2020
Latent Programmer: Discrete Latent Codes for Program Synthesis
Latent Programmer: Discrete Latent Codes for Program SynthesisInternational Conference on Machine Learning (ICML), 2020
Joey Hong
David Dohan
Rishabh Singh
Charles Sutton
Manzil Zaheer
363
24
0
01 Dec 2020
Direct Evolutionary Optimization of Variational Autoencoders With Binary
  Latents
Direct Evolutionary Optimization of Variational Autoencoders With Binary Latents
E. Guiraud
Jakob Drefs
Jörg Lücke
DRL
249
3
0
27 Nov 2020
Neural Representations for Modeling Variation in Speech
Neural Representations for Modeling Variation in Speech
Martijn Bartelds
Wietse de Vries
Faraz Sanal
Caitlin Richter
M. Liberman
Martijn B. Wieling
SSLDRL
197
29
0
25 Nov 2020
BinPlay: A Binary Latent Autoencoder for Generative Replay Continual
  Learning
BinPlay: A Binary Latent Autoencoder for Generative Replay Continual LearningIEEE International Joint Conference on Neural Network (IJCNN), 2020
Kamil Deja
Pawel Wawrzyñski
Daniel Marczak
Wojciech Masarczyk
Tomasz Trzciñski
SyDa
162
10
0
25 Nov 2020
Mixture-based Feature Space Learning for Few-shot Image Classification
Mixture-based Feature Space Learning for Few-shot Image ClassificationIEEE International Conference on Computer Vision (ICCV), 2020
Arman Afrasiyabi
Jean-François Lalonde
Christian Gagné
VLM
324
89
0
24 Nov 2020
Parrot: Data-Driven Behavioral Priors for Reinforcement Learning
Parrot: Data-Driven Behavioral Priors for Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2020
Avi Singh
Huihan Liu
G. Zhou
Albert Yu
Nicholas Rhinehart
Sergey Levine
OffRLOnRL
258
159
0
19 Nov 2020
Latent Adversarial Debiasing: Mitigating Collider Bias in Deep Neural
  Networks
Latent Adversarial Debiasing: Mitigating Collider Bias in Deep Neural Networks
L. N. Darlow
Stanisław Jastrzębski
Amos Storkey
247
26
0
19 Nov 2020
End-To-End Dilated Variational Autoencoder with Bottleneck
  Discriminative Loss for Sound Morphing -- A Preliminary Study
End-To-End Dilated Variational Autoencoder with Bottleneck Discriminative Loss for Sound Morphing -- A Preliminary Study
Matteo Lionello
Hendrik Purwins
155
0
0
19 Nov 2020
Neuro-Symbolic Representations for Video Captioning: A Case for
  Leveraging Inductive Biases for Vision and Language
Neuro-Symbolic Representations for Video Captioning: A Case for Leveraging Inductive Biases for Vision and Language
Hassan Akbari
Hamid Palangi
Jianwei Yang
Sudha Rao
Asli Celikyilmaz
Roland Fernandez
P. Smolensky
Jianfeng Gao
Shih-Fu Chang
210
3
0
18 Nov 2020
Optimizing voice conversion network with cycle consistency loss of
  speaker identity
Optimizing voice conversion network with cycle consistency loss of speaker identitySpoken Language Technology Workshop (SLT), 2020
Hongqiang Du
Xiaohai Tian
Lei Xie
Haizhou Li
162
20
0
17 Nov 2020
A Comprehensive Survey on Deep Music Generation: Multi-level
  Representations, Algorithms, Evaluations, and Future Directions
A Comprehensive Survey on Deep Music Generation: Multi-level Representations, Algorithms, Evaluations, and Future Directions
Shulei Ji
Jing Luo
Xinyu Yang
MGen
269
143
0
13 Nov 2020
Hierarchical Prosody Modeling for Non-Autoregressive Speech Synthesis
Hierarchical Prosody Modeling for Non-Autoregressive Speech SynthesisSpoken Language Technology Workshop (SLT), 2020
C. Chien
Hung-yi Lee
292
41
0
12 Nov 2020
Testing for Typicality with Respect to an Ensemble of Learned
  Distributions
Testing for Typicality with Respect to an Ensemble of Learned Distributions
F. Laine
Claire Tomlin
132
0
0
11 Nov 2020
Semi-supervised Learning for Singing Synthesis Timbre
Semi-supervised Learning for Singing Synthesis Timbre
J. Bonada
Merlijn Blaauw
132
4
0
05 Nov 2020
A Hierarchical Subspace Model for Language-Attuned Acoustic Unit
  Discovery
A Hierarchical Subspace Model for Language-Attuned Acoustic Unit Discovery
Bolaji Yusuf
Lucas Ondel
L. Burget
J. Černocký
Murat Saraclar
174
12
0
04 Nov 2020
Paralinguistic Privacy Protection at the Edge
Paralinguistic Privacy Protection at the Edge
Ranya Aloufi
Hamed Haddadi
David E. Boyle
253
15
0
04 Nov 2020
Mixing Consistent Deep Clustering
Mixing Consistent Deep Clustering
D. Lutscher
Ali el Hassouni
M. Stol
Mark Hoogendoorn
SSL
70
2
0
03 Nov 2020
CVC: Contrastive Learning for Non-parallel Voice Conversion
CVC: Contrastive Learning for Non-parallel Voice ConversionInterspeech (Interspeech), 2020
Tingle Li
Yichen Liu
Chenxu Hu
Hang Zhao
DRL
205
13
0
02 Nov 2020
Non-Autoregressive Predictive Coding for Learning Speech Representations
  from Local Dependencies
Non-Autoregressive Predictive Coding for Learning Speech Representations from Local DependenciesInterspeech (Interspeech), 2020
Alexander H. Liu
Yu-An Chung
James R. Glass
SSL
227
93
0
01 Nov 2020
AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and
  Adaptive Instance Normalization
AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance NormalizationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Yen-Hao Chen
Da-Yi Wu
Tsung-Han Wu
Hung-yi Lee
287
122
0
31 Oct 2020
GENs: Generative Encoding Networks
GENs: Generative Encoding NetworksMachine-mediated learning (ML), 2020
Surojit Saha
Shireen Y. Elhabian
Ross T. Whitaker
GAN
183
9
0
28 Oct 2020
Scaling Laws for Autoregressive Generative Modeling
Scaling Laws for Autoregressive Generative Modeling
T. Henighan
Jared Kaplan
Mor Katz
Mark Chen
Christopher Hesse
...
Nick Ryder
Daniel M. Ziegler
John Schulman
Dario Amodei
Sam McCandlish
476
560
0
28 Oct 2020
Spiking Neural Networks -- Part III: Neuromorphic Communications
Spiking Neural Networks -- Part III: Neuromorphic CommunicationsIEEE Communications Letters (IEEE Commun. Lett.), 2020
N. Skatchkovsky
Hyeryung Jang
Osvaldo Simeone
183
14
0
27 Oct 2020
Unsupervised Learning of Disentangled Speech Content and Style
  Representation
Unsupervised Learning of Disentangled Speech Content and Style RepresentationInterspeech (Interspeech), 2020
Andros Tjandra
Ruoming Pang
Yu Zhang
Shigeki Karita
BDLDRL
263
20
0
24 Oct 2020
A Comparison of Discrete Latent Variable Models for Speech
  Representation Learning
A Comparison of Discrete Latent Variable Models for Speech Representation LearningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Henry Zhou
Alexei Baevski
Michael Auli
DRL
123
11
0
24 Oct 2020
Representation Learning for High-Dimensional Data Collection under Local
  Differential Privacy
Representation Learning for High-Dimensional Data Collection under Local Differential Privacy
Alex Mansbridge
G. Barbour
Davide Piras
Michael Murray
Christopher Frye
Ilya Feige
David Barber
254
1
0
23 Oct 2020
A Framework for Generative and Contrastive Learning of Audio
  Representations
A Framework for Generative and Contrastive Learning of Audio Representations
Prateek Verma
J. Smith
SSL
220
21
0
22 Oct 2020
Learning Speaker Embedding from Text-to-Speech
Learning Speaker Embedding from Text-to-Speech
Jaejin Cho
Piotr Żelasko
Jesus Villalba
Shinji Watanabe
Najim Dehak
140
13
0
21 Oct 2020
Learning Disentangled Phone and Speaker Representations in a
  Semi-Supervised VQ-VAE Paradigm
Learning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE ParadigmIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Jennifer Williams
Yi Zhao
Erica Cooper
Junichi Yamagishi
SSL
223
26
0
21 Oct 2020
Towards End-to-End In-Image Neural Machine Translation
Towards End-to-End In-Image Neural Machine Translation
Elman Mansimov
Mitchell Stern
Mengzhao Chen
Orhan Firat
Jakob Uszkoreit
Puneet Jain
185
30
0
20 Oct 2020
End-to-End Text-to-Speech using Latent Duration based on VQ-VAE
End-to-End Text-to-Speech using Latent Duration based on VQ-VAEIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Yusuke Yasuda
Xin Wang
Junichi Yamagishi
173
17
0
19 Oct 2020
CLAR: Contrastive Learning of Auditory Representations
CLAR: Contrastive Learning of Auditory RepresentationsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2020
Haider Al-Tahan
Y. Mohsenzadeh
SSL
383
67
0
19 Oct 2020
Evidential Sparsification of Multimodal Latent Spaces in Conditional
  Variational Autoencoders
Evidential Sparsification of Multimodal Latent Spaces in Conditional Variational AutoencodersNeural Information Processing Systems (NeurIPS), 2020
Masha Itkina
Boris Ivanovic
Ransalu Senanayake
Mykel J. Kochenderfer
Marco Pavone
243
18
0
19 Oct 2020
CQ-VAE: Coordinate Quantized VAE for Uncertainty Estimation with
  Application to Disk Shape Analysis from Lumbar Spine MRI Images
CQ-VAE: Coordinate Quantized VAE for Uncertainty Estimation with Application to Disk Shape Analysis from Lumbar Spine MRI ImagesInternational Conference on Machine Learning and Applications (ICMLA), 2020
Linchen Qian
Jiasong Chen
Timur Urakov
Weiyong Gu
Liang Liang
105
3
0
17 Oct 2020
Latent Vector Recovery of Audio GANs
Latent Vector Recovery of Audio GANs
Andrew Keyes
N. Bayat
Vahid Reza Khazaie
Y. Mohsenzadeh
93
3
0
16 Oct 2020
The NeteaseGames System for Voice Conversion Challenge 2020 with
  Vector-quantization Variational Autoencoder and WaveNet
The NeteaseGames System for Voice Conversion Challenge 2020 with Vector-quantization Variational Autoencoder and WaveNet
Haitong Zhang
DRL
110
4
0
15 Oct 2020
Smaller World Models for Reinforcement Learning
Smaller World Models for Reinforcement Learning
Jan Robine
Tobias Uelwer
Stefan Harmeling
DRL
102
4
0
12 Oct 2020
Deep Sequence Learning for Video Anticipation: From Discrete and
  Deterministic to Continuous and Stochastic
Deep Sequence Learning for Video Anticipation: From Discrete and Deterministic to Continuous and Stochastic
S. Aliakbarian
AI4TS
128
0
0
09 Oct 2020
Event Representation with Sequential, Semi-Supervised Discrete Variables
Event Representation with Sequential, Semi-Supervised Discrete VariablesNorth American Chapter of the Association for Computational Linguistics (NAACL), 2020
Mehdi Rezaee
Francis Ferraro
BDLDRL
226
14
0
09 Oct 2020
FastVC: Fast Voice Conversion with non-parallel data
FastVC: Fast Voice Conversion with non-parallel data
Oriol Barbany
Milos Cernak
130
7
0
08 Oct 2020
Previous
123...697071...757677
Next
Page 70 of 77
Pageof 77