Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1711.00937
Cited By
v1
v2 (latest)
Neural Discrete Representation Learning
2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Neural Discrete Representation Learning"
50 / 3,807 papers shown
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics
Hirokazu Kameoka
Takuhiro Kaneko
Kou Tanaka
Nobukatsu Hojo
Shogo Seki
DiffM
293
25
0
06 Oct 2020
A Contrastive Learning Approach for Training Variational Autoencoder Priors
J. Aneja
Alex Schwing
Jan Kautz
Arash Vahdat
DRL
532
99
0
06 Oct 2020
The Academia Sinica Systems of Voice Conversion for VCC2020
Yu-Huai Peng
Cheng-Hung Hu
A. Kang
Hung-Shin Lee
Pin-Yuan Chen
Yu Tsao
Hsin-Min Wang
148
2
0
06 Oct 2020
Implicit Rank-Minimizing Autoencoder
Neural Information Processing Systems (NeurIPS), 2020
Li Jing
Jure Zbontar
Yann LeCun
SSL
DRL
241
54
0
01 Oct 2020
The Utility of Decorrelating Colour Spaces in Vector Quantised Variational Autoencoders
A. Akbarinia
Raquel Gil-Rodríguez
Alban Flachot
Matteo Toscani
DRL
79
0
0
30 Sep 2020
Controllable Text Generation with Focused Variation
Findings (Findings), 2020
Lei Shu
Alexandros Papangelis
Yi-Chia Wang
Gokhan Tur
Hu Xu
Zhaleh Feizollahi
Bing-Quan Liu
Piero Molino
200
12
0
25 Sep 2020
A Unifying Review of Deep and Shallow Anomaly Detection
Proceedings of the IEEE (Proc. IEEE), 2020
Lukas Ruff
Jacob R. Kauffmann
Robert A. Vandermeulen
G. Montavon
Wojciech Samek
Matthias Kirchler
Thomas G. Dietterich
Klaus-Robert Muller
UQCV
626
937
0
24 Sep 2020
X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Jaemin Cho
Jiasen Lu
Dustin Schwenk
Hannaneh Hajishirzi
Aniruddha Kembhavi
VLM
MLLM
231
106
0
23 Sep 2020
Generative Model without Prior Distribution Matching
Cong Geng
Jia Wang
Lixing Chen
Zhiyong Gao
GAN
750
1
0
23 Sep 2020
DiffWave: A Versatile Diffusion Model for Audio Synthesis
International Conference on Learning Representations (ICLR), 2020
Zhifeng Kong
Ming-Yu Liu
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
DiffM
BDL
704
1,779
0
21 Sep 2020
Target Conditioning for One-to-Many Generation
Findings (Findings), 2020
Marie-Anne Lachaux
Armand Joulin
Guillaume Lample
156
14
0
21 Sep 2020
DVG-Face: Dual Variational Generation for Heterogeneous Face Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Chaoyou Fu
Xiang Wu
Yibo Hu
Huaibo Huang
Ran He
CVBM
156
94
0
20 Sep 2020
Discond-VAE: Disentangling Continuous Factors from the Discrete
Jaewoong Choi
Geonho Hwang
Myung-joo Kang
CoGe
CML
193
6
0
17 Sep 2020
Recurrent autoencoder with sequence-aware encoding
International Conference on Conceptual Structures (ICCS), 2020
Robert Susik
AI4TS
164
6
0
15 Sep 2020
Brain2Word: Decoding Brain Activity for Language Generation
Nicolas Affolter
Béni Egressy
Damian Pascual
Roger Wattenhofer
214
26
0
10 Sep 2020
not-so-BigGAN: Generating High-Fidelity Images on Small Compute with Wavelet-based Super-Resolution
Seung-Jun Han
Akash Srivastava
C. Hurwitz
P. Sattigeri
David D. Cox
122
10
0
09 Sep 2020
Multilinear Latent Conditioning for Generating Unseen Attribute Combinations
International Conference on Machine Learning (ICML), 2020
Markos Georgopoulos
Grigorios G. Chrysos
Maja Pantic
Yannis Panagakis
GAN
DRL
183
18
0
09 Sep 2020
Deep data compression for approximate ultrasonic image formation
IUS (IUS), 2020
G. Pilikos
L. Horchens
K. Batenburg
Tristan van Leeuwen
F. Lucka
138
9
0
04 Sep 2020
End-to-End Learning of Neuromorphic Wireless Systems for Low-Power Edge Artificial Intelligence
Asilomar Conference on Signals, Systems and Computers (Asilomar), 2020
N. Skatchkovsky
Hyeryung Jang
Osvaldo Simeone
202
26
0
03 Sep 2020
Stochastic Graph Recurrent Neural Network
Neurocomputing (Neurocomputing), 2020
Tijin Yan
Hongwei Zhang
Zirui Li
Yuanqing Xia
GNN
BDL
134
5
0
01 Sep 2020
GIF: Generative Interpretable Faces
International Conference on 3D Vision (3DV), 2020
Partha Ghosh
Pravir Singh Gupta
Roy Uziel
Anurag Ranjan
Michael J. Black
Timo Bolkart
CVBM
AI4CE
394
80
0
31 Aug 2020
Hierarchical Timbre-Painting and Articulation Generation
International Society for Music Information Retrieval Conference (ISMIR), 2020
Michael Michelashvili
Lior Wolf
221
12
0
30 Aug 2020
Dynamical Variational Autoencoders: A Comprehensive Review
Laurent Girin
Simon Leglaive
Xiaoyu Bie
Julien Diard
Thomas Hueber
Xavier Alameda-Pineda
BDL
499
268
0
28 Aug 2020
Voice Conversion Challenge 2020: Intra-lingual semi-parallel and cross-lingual voice conversion
Yi Zhao
Wen-Chin Huang
Xiaohai Tian
Junichi Yamagishi
Rohan Kumar Das
Tomi Kinnunen
Zhenhua Ling
Tomoki Toda
198
233
0
28 Aug 2020
Nonparallel Voice Conversion with Augmented Classifier Star Generative Adversarial Networks
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Hirokazu Kameoka
Takuhiro Kaneko
Kou Tanaka
Nobukatsu Hojo
347
22
0
27 Aug 2020
asya: Mindful verbal communication using deep learning
Ē. Urtāns
Ariel Tabaks
VLM
198
1
0
20 Aug 2020
Unsupervised Acoustic Unit Representation Learning for Voice Conversion using WaveNet Auto-encoders
Mingjie Chen
Thomas Hain
SSL
DRL
127
16
0
16 Aug 2020
Jointly Fine-Tuning "BERT-like" Self Supervised Models to Improve Multimodal Speech Emotion Recognition
Shamane Siriwardhana
Andrew Reis
Rivindu Weerasekera
Suranga Nanayakkara
204
116
0
15 Aug 2020
Audio- and Gaze-driven Facial Animation of Codec Avatars
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2020
Alexander Richard
Colin S. Lea
Shugao Ma
Juergen Gall
Fernando de la Torre
Yaser Sheikh
CVBM
166
89
0
11 Aug 2020
Unsupervised Learning For Sequence-to-sequence Text-to-speech For Low-resource Languages
Interspeech (Interspeech), 2020
Haitong Zhang
Yue Lin
123
33
0
11 Aug 2020
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep Learning
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Berrak Sisman
Junichi Yamagishi
Simon King
Haizhou Li
BDL
450
389
0
09 Aug 2020
Learning Sampling in Financial Statement Audits using Vector Quantised Autoencoder Neural Networks
Marco Schreyer
Timur Sattarov
Anita Gierbl
Bernd Reimer
Damian Borth
DRL
110
3
0
06 Aug 2020
Optimal Variance Control of the Score Function Gradient Estimator for Importance Weighted Bounds
Valentin Liévin
Andrea Dittadi
Anders Christensen
Ole Winther
DRL
230
6
0
05 Aug 2020
Timbre latent space: exploration and creative aspects
Antoine Caillon
Adrien Bitton
Brice Gatinet
P. Esling
203
1
0
04 Aug 2020
Learning from Few Samples: A Survey
Nihar Bendre
Hugo Terashima-Marín
Peyman Najafirad
VLM
BDL
246
60
0
30 Jul 2020
Foveation for Segmentation of Ultra-High Resolution Images
Chen Jin
Ryutaro Tanno
Moucheng Xu
T. Mertzanidou
Daniel C. Alexander
AI4TS
194
4
0
29 Jul 2020
dMelodies: A Music Dataset for Disentanglement Learning
International Society for Music Information Retrieval Conference (ISMIR), 2020
Ashis Pati
Siddharth Gururani
Alexander Lerch
CoGe
DRL
153
11
0
29 Jul 2020
Privacy-preserving Voice Analysis via Disentangled Representations
Ranya Aloufi
Hamed Haddadi
David E. Boyle
DRL
306
62
0
29 Jul 2020
Generative networks as inverse problems with fractional wavelet scattering networks
Jiasong Wu
Jing Zhang
Fuzhi Wu
Youyong Kong
Guanyu Yang
L. Senhadji
H. Shu
GAN
171
1
0
28 Jul 2020
Unsupervised Subword Modeling Using Autoregressive Pretraining and Cross-Lingual Phone-Aware Modeling
Interspeech (Interspeech), 2020
Siyuan Feng
O. Scharenborg
SSL
195
4
0
25 Jul 2020
Modal Uncertainty Estimation via Discrete Latent Representation
Di Qiu
L. Lui
UQCV
105
6
0
25 Jul 2020
Learning the Latent Space of Robot Dynamics for Cutting Interaction Inference
S. Rezaei-Shoshtari
David Meger
I. Sharf
BDL
DRL
129
3
0
22 Jul 2020
Word Representation for Rhythms
Tongyu Lu
Lucheng Yan
Gus Xia
174
0
0
21 Jul 2020
Generalizing Variational Autoencoders with Hierarchical Empirical Bayes
Wei Cheng
Gregory Darnell
Sohini Ramachandran
Lorin Crawford
BDL
107
3
0
20 Jul 2020
Incorporating Reinforced Adversarial Learning in Autoregressive Image Generation
Kenan E. Ak
N. Xu
Zhe Lin
Yilin Wang
212
13
0
20 Jul 2020
Relaxed-Responsibility Hierarchical Discrete VAEs
M. Willetts
Xenia Miscouridou
Stephen J. Roberts
Chris Holmes
BDL
DRL
247
5
0
14 Jul 2020
A Deep Learning Approach for Low-Latency Packet Loss Concealment of Audio Signals in Networked Music Performance Applications
Conference of the Open Innovations Association (FRUCT), 2020
Prateek Verma
Alessandro Ilic Mezza
C. Chafe
C. Rottondi
99
29
0
14 Jul 2020
Failure Modes of Variational Autoencoders and Their Effects on Downstream Tasks
Yaniv Yacoby
Weiwei Pan
Finale Doshi-Velez
CML
DRL
476
31
0
14 Jul 2020
Vector-Quantized Timbre Representation
Adrien Bitton
P. Esling
Tatsuya Harada
158
12
0
13 Jul 2020
Deep Retrieval: Learning A Retrievable Structure for Large-Scale Recommendations
Weihao Gao
Xiangjun Fan
Chong-Jun Wang
Jiankai Sun
Kai Jia
Wen Xiao
Ruofan Ding
Xingyan Bin
Hui Yang
Xiaobing Liu
CML
261
26
0
12 Jul 2020
Previous
1
2
3
...
70
71
72
...
75
76
77
Next
Page 71 of 77
Page
of 77
Go