v1v2 (latest)

Neural Discrete Representation Learning

2 November 2017

Papers citing "Neural Discrete Representation Learning"

50 / 3,807 papers shown

Anomaly detection through latent space restoration using vector-quantized variational autoencodersIEEE International Symposium on Biomedical Imaging (ISBI), 2020

Jonathan Passerat-Palmbach

G. Tarroni

DRL

295

12 Dec 2020

DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector Quantization

Shaoshi Ling

Yuzong Liu

159

113

11 Dec 2020

Long Term Motion Prediction Using Keyposes

425

08 Dec 2020

I'm Sorry for Your Loss: Spectrally-Based Audio Distances Are Bad at Pitch

Joseph P. Turian

Max Henry

245

08 Dec 2020

Bayesian Image Reconstruction using Deep Generative Models

382

08 Dec 2020

Extractive Opinion Summarization in Quantized Transformer Spaces

298

118

08 Dec 2020

Multi-Instrumentalist Net: Unsupervised Generation of Music from Body Movements

Kun Su

Xiulong Liu

Eli Shlizerman

242

07 Dec 2020

Learning Vector Quantized Shape Code for Amodal Blastomere Instance SegmentationIEEE International Symposium on Biomedical Imaging (ISBI), 2020

193

02 Dec 2020

Mutual Information Constraints for Monte-Carlo ObjectivesJournal of machine learning research (JMLR), 2020

Gábor Melis

András Gyorgy

Phil Blunsom

223

01 Dec 2020

Unpaired Image-to-Image Translation via Latent Energy TransportComputer Vision and Pattern Recognition (CVPR), 2020

Yang Zhao

Changyou Chen

364

01 Dec 2020

Latent Programmer: Discrete Latent Codes for Program SynthesisInternational Conference on Machine Learning (ICML), 2020

363

01 Dec 2020

Direct Evolutionary Optimization of Variational Autoencoders With Binary Latents

249

27 Nov 2020

Neural Representations for Modeling Variation in Speech

197

25 Nov 2020

BinPlay: A Binary Latent Autoencoder for Generative Replay Continual LearningIEEE International Joint Conference on Neural Network (IJCNN), 2020

162

25 Nov 2020

Mixture-based Feature Space Learning for Few-shot Image ClassificationIEEE International Conference on Computer Vision (ICCV), 2020

Arman Afrasiyabi

Jean-François Lalonde

Christian Gagné

VLM

324

24 Nov 2020

Parrot: Data-Driven Behavioral Priors for Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2020

258

159

19 Nov 2020

Latent Adversarial Debiasing: Mitigating Collider Bias in Deep Neural Networks

L. N. Darlow

Stanisław Jastrzębski

Amos Storkey

247

19 Nov 2020

End-To-End Dilated Variational Autoencoder with Bottleneck Discriminative Loss for Sound Morphing -- A Preliminary Study

Matteo Lionello

Hendrik Purwins

155

19 Nov 2020

Neuro-Symbolic Representations for Video Captioning: A Case for Leveraging Inductive Biases for Vision and Language

Jianwei Yang

210

18 Nov 2020

Optimizing voice conversion network with cycle consistency loss of speaker identitySpoken Language Technology Workshop (SLT), 2020

Hongqiang Du

Xiaohai Tian

Lei Xie

Haizhou Li

162

17 Nov 2020

A Comprehensive Survey on Deep Music Generation: Multi-level Representations, Algorithms, Evaluations, and Future Directions

269

143

13 Nov 2020

Hierarchical Prosody Modeling for Non-Autoregressive Speech SynthesisSpoken Language Technology Workshop (SLT), 2020

C. Chien

Hung-yi Lee

292

12 Nov 2020

Testing for Typicality with Respect to an Ensemble of Learned Distributions

F. Laine

Claire Tomlin

132

11 Nov 2020

Semi-supervised Learning for Singing Synthesis Timbre

J. Bonada

Merlijn Blaauw

132

05 Nov 2020

A Hierarchical Subspace Model for Language-Attuned Acoustic Unit Discovery

174

04 Nov 2020

Paralinguistic Privacy Protection at the Edge

Ranya Aloufi

Hamed Haddadi

David E. Boyle

253

04 Nov 2020

Mixing Consistent Deep Clustering

03 Nov 2020

CVC: Contrastive Learning for Non-parallel Voice ConversionInterspeech (Interspeech), 2020

Hang Zhao

205

02 Nov 2020

Non-Autoregressive Predictive Coding for Learning Speech Representations from Local DependenciesInterspeech (Interspeech), 2020

227

01 Nov 2020

AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance NormalizationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

287

122

31 Oct 2020

GENs: Generative Encoding NetworksMachine-mediated learning (ML), 2020

183

28 Oct 2020

Scaling Laws for Autoregressive Generative Modeling

...

476

560

28 Oct 2020

Spiking Neural Networks -- Part III: Neuromorphic CommunicationsIEEE Communications Letters (IEEE Commun. Lett.), 2020

N. Skatchkovsky

Hyeryung Jang

Osvaldo Simeone

183

27 Oct 2020

Unsupervised Learning of Disentangled Speech Content and Style RepresentationInterspeech (Interspeech), 2020

263

24 Oct 2020

A Comparison of Discrete Latent Variable Models for Speech Representation LearningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

123

24 Oct 2020

Representation Learning for High-Dimensional Data Collection under Local Differential Privacy

254

23 Oct 2020

A Framework for Generative and Contrastive Learning of Audio Representations

Prateek Verma

J. Smith

SSL

220

22 Oct 2020

Learning Speaker Embedding from Text-to-Speech

Jaejin Cho

Piotr Żelasko

Jesus Villalba

Shinji Watanabe

Najim Dehak

140

21 Oct 2020

Learning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE ParadigmIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

223

21 Oct 2020

Towards End-to-End In-Image Neural Machine Translation

185

20 Oct 2020

End-to-End Text-to-Speech using Latent Duration based on VQ-VAEIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Yusuke Yasuda

Xin Wang

Junichi Yamagishi

173

19 Oct 2020

CLAR: Contrastive Learning of Auditory RepresentationsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2020

Haider Al-Tahan

Y. Mohsenzadeh

SSL

383

19 Oct 2020

Evidential Sparsification of Multimodal Latent Spaces in Conditional Variational AutoencodersNeural Information Processing Systems (NeurIPS), 2020

Masha Itkina

Boris Ivanovic

Ransalu Senanayake

Mykel J. Kochenderfer

Marco Pavone

243

19 Oct 2020

CQ-VAE: Coordinate Quantized VAE for Uncertainty Estimation with Application to Disk Shape Analysis from Lumbar Spine MRI ImagesInternational Conference on Machine Learning and Applications (ICMLA), 2020

105

17 Oct 2020

Latent Vector Recovery of Audio GANs

16 Oct 2020

The NeteaseGames System for Voice Conversion Challenge 2020 with Vector-quantization Variational Autoencoder and WaveNet

Haitong Zhang

DRL

110

15 Oct 2020

Smaller World Models for Reinforcement Learning

102

12 Oct 2020

Deep Sequence Learning for Video Anticipation: From Discrete and Deterministic to Continuous and Stochastic

S. Aliakbarian

AI4TS

128

09 Oct 2020

Event Representation with Sequential, Semi-Supervised Discrete VariablesNorth American Chapter of the Association for Computational Linguistics (NAACL), 2020

Mehdi Rezaee

Francis Ferraro

BDL DRL

226

09 Oct 2020

FastVC: Fast Voice Conversion with non-parallel data

Oriol Barbany

Milos Cernak

130

08 Oct 2020