ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.00937
  4. Cited By
Neural Discrete Representation Learning
v1v2 (latest)

Neural Discrete Representation Learning

2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
    BDLSSLOCL
ArXiv (abs)PDFHTML

Papers citing "Neural Discrete Representation Learning"

50 / 3,807 papers shown
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed
  Langevin Dynamics
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics
Hirokazu Kameoka
Takuhiro Kaneko
Kou Tanaka
Nobukatsu Hojo
Shogo Seki
DiffM
293
25
0
06 Oct 2020
A Contrastive Learning Approach for Training Variational Autoencoder
  Priors
A Contrastive Learning Approach for Training Variational Autoencoder Priors
J. Aneja
Alex Schwing
Jan Kautz
Arash Vahdat
DRL
532
99
0
06 Oct 2020
The Academia Sinica Systems of Voice Conversion for VCC2020
The Academia Sinica Systems of Voice Conversion for VCC2020
Yu-Huai Peng
Cheng-Hung Hu
A. Kang
Hung-Shin Lee
Pin-Yuan Chen
Yu Tsao
Hsin-Min Wang
148
2
0
06 Oct 2020
Implicit Rank-Minimizing Autoencoder
Implicit Rank-Minimizing AutoencoderNeural Information Processing Systems (NeurIPS), 2020
Li Jing
Jure Zbontar
Yann LeCun
SSLDRL
241
54
0
01 Oct 2020
The Utility of Decorrelating Colour Spaces in Vector Quantised
  Variational Autoencoders
The Utility of Decorrelating Colour Spaces in Vector Quantised Variational Autoencoders
A. Akbarinia
Raquel Gil-Rodríguez
Alban Flachot
Matteo Toscani
DRL
79
0
0
30 Sep 2020
Controllable Text Generation with Focused Variation
Controllable Text Generation with Focused VariationFindings (Findings), 2020
Lei Shu
Alexandros Papangelis
Yi-Chia Wang
Gokhan Tur
Hu Xu
Zhaleh Feizollahi
Bing-Quan Liu
Piero Molino
200
12
0
25 Sep 2020
A Unifying Review of Deep and Shallow Anomaly Detection
A Unifying Review of Deep and Shallow Anomaly DetectionProceedings of the IEEE (Proc. IEEE), 2020
Lukas Ruff
Jacob R. Kauffmann
Robert A. Vandermeulen
G. Montavon
Wojciech Samek
Matthias Kirchler
Thomas G. Dietterich
Klaus-Robert Muller
UQCV
626
937
0
24 Sep 2020
X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal
  Transformers
X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal TransformersConference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Jaemin Cho
Jiasen Lu
Dustin Schwenk
Hannaneh Hajishirzi
Aniruddha Kembhavi
VLMMLLM
231
106
0
23 Sep 2020
Generative Model without Prior Distribution Matching
Generative Model without Prior Distribution Matching
Cong Geng
Jia Wang
Lixing Chen
Zhiyong Gao
GAN
750
1
0
23 Sep 2020
DiffWave: A Versatile Diffusion Model for Audio Synthesis
DiffWave: A Versatile Diffusion Model for Audio SynthesisInternational Conference on Learning Representations (ICLR), 2020
Zhifeng Kong
Ming-Yu Liu
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
DiffMBDL
704
1,779
0
21 Sep 2020
Target Conditioning for One-to-Many Generation
Target Conditioning for One-to-Many GenerationFindings (Findings), 2020
Marie-Anne Lachaux
Armand Joulin
Guillaume Lample
156
14
0
21 Sep 2020
DVG-Face: Dual Variational Generation for Heterogeneous Face Recognition
DVG-Face: Dual Variational Generation for Heterogeneous Face RecognitionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Chaoyou Fu
Xiang Wu
Yibo Hu
Huaibo Huang
Ran He
CVBM
156
94
0
20 Sep 2020
Discond-VAE: Disentangling Continuous Factors from the Discrete
Discond-VAE: Disentangling Continuous Factors from the Discrete
Jaewoong Choi
Geonho Hwang
Myung-joo Kang
CoGeCML
193
6
0
17 Sep 2020
Recurrent autoencoder with sequence-aware encoding
Recurrent autoencoder with sequence-aware encodingInternational Conference on Conceptual Structures (ICCS), 2020
Robert Susik
AI4TS
164
6
0
15 Sep 2020
Brain2Word: Decoding Brain Activity for Language Generation
Brain2Word: Decoding Brain Activity for Language Generation
Nicolas Affolter
Béni Egressy
Damian Pascual
Roger Wattenhofer
214
26
0
10 Sep 2020
not-so-BigGAN: Generating High-Fidelity Images on Small Compute with
  Wavelet-based Super-Resolution
not-so-BigGAN: Generating High-Fidelity Images on Small Compute with Wavelet-based Super-Resolution
Seung-Jun Han
Akash Srivastava
C. Hurwitz
P. Sattigeri
David D. Cox
122
10
0
09 Sep 2020
Multilinear Latent Conditioning for Generating Unseen Attribute
  Combinations
Multilinear Latent Conditioning for Generating Unseen Attribute CombinationsInternational Conference on Machine Learning (ICML), 2020
Markos Georgopoulos
Grigorios G. Chrysos
Maja Pantic
Yannis Panagakis
GANDRL
183
18
0
09 Sep 2020
Deep data compression for approximate ultrasonic image formation
Deep data compression for approximate ultrasonic image formationIUS (IUS), 2020
G. Pilikos
L. Horchens
K. Batenburg
Tristan van Leeuwen
F. Lucka
138
9
0
04 Sep 2020
End-to-End Learning of Neuromorphic Wireless Systems for Low-Power Edge
  Artificial Intelligence
End-to-End Learning of Neuromorphic Wireless Systems for Low-Power Edge Artificial IntelligenceAsilomar Conference on Signals, Systems and Computers (Asilomar), 2020
N. Skatchkovsky
Hyeryung Jang
Osvaldo Simeone
202
26
0
03 Sep 2020
Stochastic Graph Recurrent Neural Network
Stochastic Graph Recurrent Neural NetworkNeurocomputing (Neurocomputing), 2020
Tijin Yan
Hongwei Zhang
Zirui Li
Yuanqing Xia
GNNBDL
134
5
0
01 Sep 2020
GIF: Generative Interpretable Faces
GIF: Generative Interpretable FacesInternational Conference on 3D Vision (3DV), 2020
Partha Ghosh
Pravir Singh Gupta
Roy Uziel
Anurag Ranjan
Michael J. Black
Timo Bolkart
CVBMAI4CE
394
80
0
31 Aug 2020
Hierarchical Timbre-Painting and Articulation Generation
Hierarchical Timbre-Painting and Articulation GenerationInternational Society for Music Information Retrieval Conference (ISMIR), 2020
Michael Michelashvili
Lior Wolf
221
12
0
30 Aug 2020
Dynamical Variational Autoencoders: A Comprehensive Review
Dynamical Variational Autoencoders: A Comprehensive Review
Laurent Girin
Simon Leglaive
Xiaoyu Bie
Julien Diard
Thomas Hueber
Xavier Alameda-Pineda
BDL
499
268
0
28 Aug 2020
Voice Conversion Challenge 2020: Intra-lingual semi-parallel and
  cross-lingual voice conversion
Voice Conversion Challenge 2020: Intra-lingual semi-parallel and cross-lingual voice conversion
Yi Zhao
Wen-Chin Huang
Xiaohai Tian
Junichi Yamagishi
Rohan Kumar Das
Tomi Kinnunen
Zhenhua Ling
Tomoki Toda
198
233
0
28 Aug 2020
Nonparallel Voice Conversion with Augmented Classifier Star Generative
  Adversarial Networks
Nonparallel Voice Conversion with Augmented Classifier Star Generative Adversarial NetworksIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Hirokazu Kameoka
Takuhiro Kaneko
Kou Tanaka
Nobukatsu Hojo
347
22
0
27 Aug 2020
asya: Mindful verbal communication using deep learning
asya: Mindful verbal communication using deep learning
Ē. Urtāns
Ariel Tabaks
VLM
198
1
0
20 Aug 2020
Unsupervised Acoustic Unit Representation Learning for Voice Conversion
  using WaveNet Auto-encoders
Unsupervised Acoustic Unit Representation Learning for Voice Conversion using WaveNet Auto-encoders
Mingjie Chen
Thomas Hain
SSLDRL
127
16
0
16 Aug 2020
Jointly Fine-Tuning "BERT-like" Self Supervised Models to Improve
  Multimodal Speech Emotion Recognition
Jointly Fine-Tuning "BERT-like" Self Supervised Models to Improve Multimodal Speech Emotion Recognition
Shamane Siriwardhana
Andrew Reis
Rivindu Weerasekera
Suranga Nanayakkara
204
116
0
15 Aug 2020
Audio- and Gaze-driven Facial Animation of Codec Avatars
Audio- and Gaze-driven Facial Animation of Codec AvatarsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2020
Alexander Richard
Colin S. Lea
Shugao Ma
Juergen Gall
Fernando de la Torre
Yaser Sheikh
CVBM
166
89
0
11 Aug 2020
Unsupervised Learning For Sequence-to-sequence Text-to-speech For
  Low-resource Languages
Unsupervised Learning For Sequence-to-sequence Text-to-speech For Low-resource LanguagesInterspeech (Interspeech), 2020
Haitong Zhang
Yue Lin
123
33
0
11 Aug 2020
An Overview of Voice Conversion and its Challenges: From Statistical
  Modeling to Deep Learning
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep LearningIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Berrak Sisman
Junichi Yamagishi
Simon King
Haizhou Li
BDL
450
389
0
09 Aug 2020
Learning Sampling in Financial Statement Audits using Vector Quantised
  Autoencoder Neural Networks
Learning Sampling in Financial Statement Audits using Vector Quantised Autoencoder Neural Networks
Marco Schreyer
Timur Sattarov
Anita Gierbl
Bernd Reimer
Damian Borth
DRL
110
3
0
06 Aug 2020
Optimal Variance Control of the Score Function Gradient Estimator for
  Importance Weighted Bounds
Optimal Variance Control of the Score Function Gradient Estimator for Importance Weighted Bounds
Valentin Liévin
Andrea Dittadi
Anders Christensen
Ole Winther
DRL
230
6
0
05 Aug 2020
Timbre latent space: exploration and creative aspects
Timbre latent space: exploration and creative aspects
Antoine Caillon
Adrien Bitton
Brice Gatinet
P. Esling
203
1
0
04 Aug 2020
Learning from Few Samples: A Survey
Learning from Few Samples: A Survey
Nihar Bendre
Hugo Terashima-Marín
Peyman Najafirad
VLMBDL
246
60
0
30 Jul 2020
Foveation for Segmentation of Ultra-High Resolution Images
Foveation for Segmentation of Ultra-High Resolution Images
Chen Jin
Ryutaro Tanno
Moucheng Xu
T. Mertzanidou
Daniel C. Alexander
AI4TS
194
4
0
29 Jul 2020
dMelodies: A Music Dataset for Disentanglement Learning
dMelodies: A Music Dataset for Disentanglement LearningInternational Society for Music Information Retrieval Conference (ISMIR), 2020
Ashis Pati
Siddharth Gururani
Alexander Lerch
CoGeDRL
153
11
0
29 Jul 2020
Privacy-preserving Voice Analysis via Disentangled Representations
Privacy-preserving Voice Analysis via Disentangled Representations
Ranya Aloufi
Hamed Haddadi
David E. Boyle
DRL
306
62
0
29 Jul 2020
Generative networks as inverse problems with fractional wavelet
  scattering networks
Generative networks as inverse problems with fractional wavelet scattering networks
Jiasong Wu
Jing Zhang
Fuzhi Wu
Youyong Kong
Guanyu Yang
L. Senhadji
H. Shu
GAN
171
1
0
28 Jul 2020
Unsupervised Subword Modeling Using Autoregressive Pretraining and
  Cross-Lingual Phone-Aware Modeling
Unsupervised Subword Modeling Using Autoregressive Pretraining and Cross-Lingual Phone-Aware ModelingInterspeech (Interspeech), 2020
Siyuan Feng
O. Scharenborg
SSL
195
4
0
25 Jul 2020
Modal Uncertainty Estimation via Discrete Latent Representation
Modal Uncertainty Estimation via Discrete Latent Representation
Di Qiu
L. Lui
UQCV
105
6
0
25 Jul 2020
Learning the Latent Space of Robot Dynamics for Cutting Interaction
  Inference
Learning the Latent Space of Robot Dynamics for Cutting Interaction Inference
S. Rezaei-Shoshtari
David Meger
I. Sharf
BDLDRL
129
3
0
22 Jul 2020
Word Representation for Rhythms
Tongyu Lu
Lucheng Yan
Gus Xia
174
0
0
21 Jul 2020
Generalizing Variational Autoencoders with Hierarchical Empirical Bayes
Generalizing Variational Autoencoders with Hierarchical Empirical Bayes
Wei Cheng
Gregory Darnell
Sohini Ramachandran
Lorin Crawford
BDL
107
3
0
20 Jul 2020
Incorporating Reinforced Adversarial Learning in Autoregressive Image
  Generation
Incorporating Reinforced Adversarial Learning in Autoregressive Image Generation
Kenan E. Ak
N. Xu
Zhe Lin
Yilin Wang
212
13
0
20 Jul 2020
Relaxed-Responsibility Hierarchical Discrete VAEs
Relaxed-Responsibility Hierarchical Discrete VAEs
M. Willetts
Xenia Miscouridou
Stephen J. Roberts
Chris Holmes
BDLDRL
247
5
0
14 Jul 2020
A Deep Learning Approach for Low-Latency Packet Loss Concealment of
  Audio Signals in Networked Music Performance Applications
A Deep Learning Approach for Low-Latency Packet Loss Concealment of Audio Signals in Networked Music Performance ApplicationsConference of the Open Innovations Association (FRUCT), 2020
Prateek Verma
Alessandro Ilic Mezza
C. Chafe
C. Rottondi
99
29
0
14 Jul 2020
Failure Modes of Variational Autoencoders and Their Effects on
  Downstream Tasks
Failure Modes of Variational Autoencoders and Their Effects on Downstream Tasks
Yaniv Yacoby
Weiwei Pan
Finale Doshi-Velez
CMLDRL
476
31
0
14 Jul 2020
Vector-Quantized Timbre Representation
Vector-Quantized Timbre Representation
Adrien Bitton
P. Esling
Tatsuya Harada
158
12
0
13 Jul 2020
Deep Retrieval: Learning A Retrievable Structure for Large-Scale
  Recommendations
Deep Retrieval: Learning A Retrievable Structure for Large-Scale Recommendations
Weihao Gao
Xiangjun Fan
Chong-Jun Wang
Jiankai Sun
Kai Jia
Wen Xiao
Ruofan Ding
Xingyan Bin
Hui Yang
Xiaobing Liu
CML
261
26
0
12 Jul 2020
Previous
123...707172...757677
Next
Page 71 of 77
Pageof 77