ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.00937
  4. Cited By
Neural Discrete Representation Learning
v1v2 (latest)

Neural Discrete Representation Learning

2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
    BDLSSLOCL
ArXiv (abs)PDFHTML

Papers citing "Neural Discrete Representation Learning"

50 / 3,807 papers shown
Unsupervised Acoustic Unit Discovery by Leveraging a
  Language-Independent Subword Discriminative Feature Representation
Unsupervised Acoustic Unit Discovery by Leveraging a Language-Independent Subword Discriminative Feature RepresentationInterspeech (Interspeech), 2021
Siyuan Feng
Piotr Żelasko
Laureano Moro-Velazquez
O. Scharenborg
181
4
0
02 Apr 2021
Speech Resynthesis from Discrete Disentangled Self-Supervised
  Representations
Speech Resynthesis from Discrete Disentangled Self-Supervised RepresentationsInterspeech (Interspeech), 2021
Adam Polyak
Yossi Adi
Jade Copet
Eugene Kharitonov
Kushal Lakhotia
Wei-Ning Hsu
Abdel-rahman Mohamed
Emmanuel Dupoux
461
371
0
01 Apr 2021
A Closer Look at Fourier Spectrum Discrepancies for CNN-generated Images
  Detection
A Closer Look at Fourier Spectrum Discrepancies for CNN-generated Images DetectionComputer Vision and Pattern Recognition (CVPR), 2021
Keshigeyan Chandrasegaran
Ngoc-Trung Tran
Ngai-Man Cheung
211
97
0
31 Mar 2021
Unsupervised Disentanglement of Linear-Encoded Facial Semantics
Unsupervised Disentanglement of Linear-Encoded Facial SemanticsComputer Vision and Pattern Recognition (CVPR), 2021
Yutong Zheng
Yu-Kai Huang
R. Tao
Zhiqiang Shen
Marios Savvides
CVBMDRL
143
14
0
30 Mar 2021
PixelTransformer: Sample Conditioned Signal Generation
PixelTransformer: Sample Conditioned Signal GenerationInternational Conference on Machine Learning (ICML), 2021
Shubham Tulsiani
Abhinav Gupta
167
18
0
29 Mar 2021
Scalable and Efficient Neural Speech Coding: A Hybrid Design
Scalable and Efficient Neural Speech Coding: A Hybrid DesignIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Kai Zhen
Jongmo Sung
Mi Suk Lee
Seung-Wha Beack
Minje Kim
237
19
0
27 Mar 2021
Decomposing Normal and Abnormal Features of Medical Images into Discrete
  Latent Codes for Content-Based Image Retrieval
Decomposing Normal and Abnormal Features of Medical Images into Discrete Latent Codes for Content-Based Image Retrieval
Kazuma Kobayashi
Ryuichiro Hataya
Y. Kurose
M. Miyake
Masamichi Takahashi
Akiko Nakagawa
Tatsuya Harada
Ryuji Hamamoto
MedIm
272
22
0
23 Mar 2021
Tiny Transformers for Environmental Sound Classification at the Edge
Tiny Transformers for Environmental Sound Classification at the Edge
David Elliott
Carlos E. Otero
Steven Wyatt
Evan Martino
156
20
0
22 Mar 2021
Generating Diverse Structure for Image Inpainting With Hierarchical
  VQ-VAE
Generating Diverse Structure for Image Inpainting With Hierarchical VQ-VAEComputer Vision and Pattern Recognition (CVPR), 2021
Jialun Peng
Dong Liu
Songcen Xu
Houqiang Li
DiffM
184
232
0
18 Mar 2021
Variable-rate discrete representation learning
Variable-rate discrete representation learning
Sander Dieleman
C. Nash
Jesse Engel
Karen Simonyan
BDLDRL
209
32
0
10 Mar 2021
Wav2vec-C: A Self-supervised Model for Speech Representation Learning
Wav2vec-C: A Self-supervised Model for Speech Representation LearningInterspeech (Interspeech), 2021
Samik Sadhu
Di He
Che-Wei Huang
Sri Harish Reddy Mallidi
Minhua Wu
Ariya Rastrow
A. Stolcke
J. Droppo
Roland Maas
SSL
217
52
0
09 Mar 2021
Deep Generative Modelling: A Comparative Review of VAEs, GANs,
  Normalizing Flows, Energy-Based and Autoregressive Models
Deep Generative Modelling: A Comparative Review of VAEs, GANs, Normalizing Flows, Energy-Based and Autoregressive ModelsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Sam Bond-Taylor
Adam Leach
Yang Long
Chris G. Willcocks
VLMTPM
727
630
0
08 Mar 2021
Learning to Generate 3D Shapes with Generative Cellular Automata
Learning to Generate 3D Shapes with Generative Cellular AutomataInternational Conference on Learning Representations (ICLR), 2021
Dongsu Zhang
Changwoon Choi
Jeonghwan Kim
Y. Kim
155
28
0
06 Mar 2021
Generating Images with Sparse Representations
Generating Images with Sparse RepresentationsInternational Conference on Machine Learning (ICML), 2021
C. Nash
Jacob Menick
Sander Dieleman
Peter W. Battaglia
236
269
0
05 Mar 2021
crank: An Open-Source Software for Nonparallel Voice Conversion Based on
  Vector-Quantized Variational Autoencoder
crank: An Open-Source Software for Nonparallel Voice Conversion Based on Vector-Quantized Variational AutoencoderIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Kazuhiro Kobayashi
Wen-Chin Huang
Yi-Chiao Wu
Patrick Lumban Tobing
Tomoki Hayashi
Tomoki Toda
BDLDRL
150
19
0
04 Mar 2021
Enabling Visual Action Planning for Object Manipulation through Latent
  Space Roadmap
Enabling Visual Action Planning for Object Manipulation through Latent Space RoadmapIEEE Transactions on robotics (TRO), 2021
M. Lippi
Petra Poklukar
Michael C. Welle
Anastasia Varava
Hang Yin
Alessandro Marino
Danica Kragic
266
17
0
03 Mar 2021
Predicting Video with VQVAE
Predicting Video with VQVAE
Jacob Walker
Ali Razavi
Aaron van den Oord
DRL
247
75
0
02 Mar 2021
A survey on Variational Autoencoders from a GreenAI perspective
A survey on Variational Autoencoders from a GreenAI perspectiveSN Computer Science (SN Comput. Sci.), 2021
Andrea Asperti
David Evangelista
E. Loli Piccolomini
DRL
193
68
0
01 Mar 2021
M6: A Chinese Multimodal Pretrainer
M6: A Chinese Multimodal Pretrainer
Junyang Lin
Rui Men
An Yang
Chan Zhou
Ming Ding
...
Yong Li
Jialin Li
Jingren Zhou
J. Tang
Hongxia Yang
VLMMoE
347
147
0
01 Mar 2021
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image GenerationInternational Conference on Machine Learning (ICML), 2021
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
829
6,024
0
24 Feb 2021
Unsupervised Brain Anomaly Detection and Segmentation with Transformers
Unsupervised Brain Anomaly Detection and Segmentation with TransformersInternational Conference on Medical Imaging with Deep Learning (MIDL), 2021
W. H. Pinaya
Petru-Daniel Tudosiu
Robert J. Gray
G. Rees
P. Nachev
Sebastien Ourselin
M. Jorge Cardoso
ViTMedIm
149
68
0
23 Feb 2021
Anytime Sampling for Autoregressive Models via Ordered Autoencoding
Anytime Sampling for Autoregressive Models via Ordered AutoencodingInternational Conference on Learning Representations (ICLR), 2021
Yilun Xu
Yang Song
Sahaj Garg
Linyuan Gong
Rui Shu
Aditya Grover
Stefano Ermon
DiffM
189
13
0
23 Feb 2021
Uncertainty Estimation Using Riemannian Model Dynamics for Offline
  Reinforcement Learning
Uncertainty Estimation Using Riemannian Model Dynamics for Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2021
Guy Tennenholtz
Shie Mannor
OffRL
227
14
0
22 Feb 2021
Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding
Improving Lossless Compression Rates via Monte Carlo Bits-Back CodingInternational Conference on Machine Learning (ICML), 2021
Yangjun Ruan
Karen Ullrich
Daniel de Souza Severo
James Townsend
Ashish Khisti
Arnaud Doucet
Alireza Makhzani
Chris J. Maddison
238
25
0
22 Feb 2021
Measuring the Stability of Learned Features
Measuring the Stability of Learned Features
Kris Sankaran
OOD
84
0
0
20 Feb 2021
Preventing Oversmoothing in VAE via Generalized Variance
  Parameterization
Preventing Oversmoothing in VAE via Generalized Variance ParameterizationNeurocomputing (Neurocomputing), 2021
Yuhta Takida
Wei-Hsiang Liao
Chieh-Hsin Lai
Toshimitsu Uesaka
Shusuke Takahashi
Yuki Mitsufuji
DRL
219
18
0
17 Feb 2021
Enhancing into the codec: Noise Robust Speech Coding with
  Vector-Quantized Autoencoders
Enhancing into the codec: Noise Robust Speech Coding with Vector-Quantized AutoencodersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Jonah Casebeer
Vinjai Vale
Umut Isik
J. Valin
Ritwik Giri
A. Krishnaswamy
200
25
0
12 Feb 2021
VARA-TTS: Non-Autoregressive Text-to-Speech Synthesis based on Very Deep
  VAE with Residual Attention
VARA-TTS: Non-Autoregressive Text-to-Speech Synthesis based on Very Deep VAE with Residual Attention
Peng Liu
Yuewen Cao
Songxiang Liu
Na Hu
Guangzhi Li
Chao Weng
Jane Polak Scowcroft
183
23
0
12 Feb 2021
Self-Supervised VQ-VAE for One-Shot Music Style Transfer
Self-Supervised VQ-VAE for One-Shot Music Style TransferIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Ondřej Cífka
A. Ozerov
Umut Simsekli
G. Richard
224
36
0
10 Feb 2021
Inducing Meaningful Units from Character Sequences with Dynamic Capacity
  Slot Attention
Inducing Meaningful Units from Character Sequences with Dynamic Capacity Slot Attention
Melika Behjati
James Henderson
OCL
171
1
0
01 Feb 2021
Generative Spoken Language Modeling from Raw Audio
Generative Spoken Language Modeling from Raw AudioTransactions of the Association for Computational Linguistics (TACL), 2021
Kushal Lakhotia
Evgeny Kharitonov
Wei-Ning Hsu
Yossi Adi
Adam Polyak
...
Tu Nguyen
Jade Copet
Alexei Baevski
A. Mohamed
Emmanuel Dupoux
AuLLM
643
439
0
01 Feb 2021
CNN with large memory layers
CNN with large memory layers
R. Karimov
Yury Malkov
Karim Iskakov
Victor Lempitsky
223
0
0
27 Jan 2021
Disentangled Sequence Clustering for Human Intention Inference
Disentangled Sequence Clustering for Human Intention InferenceIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2021
Mark Zolotas
Y. Demiris
DRL
338
6
0
23 Jan 2021
Hierarchical disentangled representation learning for singing voice
  conversion
Hierarchical disentangled representation learning for singing voice conversionIEEE International Joint Conference on Neural Network (IJCNN), 2021
Naoya Takahashi
M. Singh
Yuki Mitsufuji
DRL
181
19
0
18 Jan 2021
Cauchy-Schwarz Regularized Autoencoder
Cauchy-Schwarz Regularized AutoencoderJournal of machine learning research (JMLR), 2021
Linh-Tam Tran
Maja Pantic
M. Deisenroth
DRLBDL
245
19
0
06 Jan 2021
HAVANA: Hierarchical and Variation-Normalized Autoencoder for Person
  Re-identification
HAVANA: Hierarchical and Variation-Normalized Autoencoder for Person Re-identification
Jiawei Ren
Xiao Ma
Chen Xu
Haiyu Zhao
Shuai Yi
BDL
267
5
0
06 Jan 2021
Psychoacoustic Calibration of Loss Functions for Efficient End-to-End
  Neural Audio Coding
Psychoacoustic Calibration of Loss Functions for Efficient End-to-End Neural Audio CodingIEEE Signal Processing Letters (IEEE SPL), 2020
Kai Zhen
Mi Suk Lee
Jongmo Sung
Seung-Wha Beack
Minje Kim
206
26
0
31 Dec 2020
Discovering Dialog Structure Graph for Open-Domain Dialog Generation
Discovering Dialog Structure Graph for Open-Domain Dialog Generation
Jun Xu
Zeyang Lei
Haifeng Wang
Zheng-Yu Niu
Hua Wu
Wanxiang Che
Ting Liu
193
6
0
31 Dec 2020
Text-Free Image-to-Speech Synthesis Using Learned Segmental Units
Text-Free Image-to-Speech Synthesis Using Learned Segmental UnitsAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Wei-Ning Hsu
David Harwath
Christopher Song
James R. Glass
CLIP
191
74
0
31 Dec 2020
Interpretable NLG for Task-oriented Dialogue Systems with Heterogeneous
  Rendering Machines
Interpretable NLG for Task-oriented Dialogue Systems with Heterogeneous Rendering MachinesAAAI Conference on Artificial Intelligence (AAAI), 2020
Yangming Li
Kaisheng Yao
188
4
0
29 Dec 2020
A Survey on Visual Transformer
A Survey on Visual TransformerIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Kai Han
Yunhe Wang
Hanting Chen
Xinghao Chen
Jianyuan Guo
...
Chunjing Xu
Yixing Xu
Zhaohui Yang
Yiman Zhang
Dacheng Tao
ViT
1.1K
3,127
0
23 Dec 2020
Motif-Driven Contrastive Learning of Graph Representations
Motif-Driven Contrastive Learning of Graph RepresentationsIEEE Transactions on Knowledge and Data Engineering (TKDE), 2020
Shichang Zhang
Ziniu Hu
Arjun Subramonian
Luke Huan
SSL
244
11
0
23 Dec 2020
OBoW: Online Bag-of-Visual-Words Generation for Self-Supervised Learning
OBoW: Online Bag-of-Visual-Words Generation for Self-Supervised LearningComputer Vision and Pattern Recognition (CVPR), 2020
Spyros Gidaris
Andrei Bursuc
Gilles Puy
N. Komodakis
Matthieu Cord
P. Pérez
SSL
297
81
0
21 Dec 2020
Exploring Fluent Query Reformulations with Text-to-Text Transformers and
  Reinforcement Learning
Exploring Fluent Query Reformulations with Text-to-Text Transformers and Reinforcement Learning
Jerry Zikun Chen
S. Yu
Haoran Wang
913
5
0
18 Dec 2020
Taming Transformers for High-Resolution Image Synthesis
Taming Transformers for High-Resolution Image SynthesisComputer Vision and Pattern Recognition (CVPR), 2020
Patrick Esser
Robin Rombach
Bjorn Ommer
ViT
740
3,822
0
17 Dec 2020
The effectiveness of unsupervised subword modeling with autoregressive
  and cross-lingual phone-aware networks
The effectiveness of unsupervised subword modeling with autoregressive and cross-lingual phone-aware networksIEEE Open Journal of Signal Processing (JOSP), 2020
Siyuan Feng
O. Scharenborg
SSL
222
3
0
17 Dec 2020
Computational principles of intelligence: learning and reasoning with
  neural networks
Computational principles of intelligence: learning and reasoning with neural networks
Abel Torres Montoya
PINNAI4CE
117
1
0
17 Dec 2020
Planning from Pixels in Atari with Learned Symbolic Representations
Planning from Pixels in Atari with Learned Symbolic RepresentationsAAAI Conference on Artificial Intelligence (AAAI), 2020
Andrea Dittadi
Frederik K. Drachmann
Thomas Bolander
358
11
0
16 Dec 2020
Unsupervised Learning of Global Factors in Deep Generative Models
Unsupervised Learning of Global Factors in Deep Generative ModelsPattern Recognition (Pattern Recognit.), 2020
I. Peis
Pablo M. Olmos
Antonio Artés-Rodríguez
BDLDRL
220
13
0
15 Dec 2020
Towards unsupervised phone and word segmentation using self-supervised
  vector-quantized neural networks
Towards unsupervised phone and word segmentation using self-supervised vector-quantized neural networksInterspeech (Interspeech), 2020
Herman Kamper
Benjamin van Niekerk
SSLMQ
297
38
0
14 Dec 2020
Previous
123...686970...757677
Next
Page 69 of 77
Pageof 77