ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.00937
  4. Cited By
Neural Discrete Representation Learning
v1v2 (latest)

Neural Discrete Representation Learning

2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
    BDLSSLOCL
ArXiv (abs)PDFHTML

Papers citing "Neural Discrete Representation Learning"

50 / 3,807 papers shown
ImageBART: Bidirectional Context with Multinomial Diffusion for
  Autoregressive Image Synthesis
ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis
Patrick Esser
Robin Rombach
A. Blattmann
Bjorn Ommer
DiffM
273
178
0
19 Aug 2021
Transformers predicting the future. Applying attention in next-frame and
  time series forecasting
Transformers predicting the future. Applying attention in next-frame and time series forecasting
Radostin Cholakov
T. Kolev
AI4TS
159
20
0
18 Aug 2021
Cross-modal Spectrum Transformation Network For Acoustic Scene
  classification
Cross-modal Spectrum Transformation Network For Acoustic Scene classificationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Yang Liu
A. Neophytou
Sunando Sengupta
Eric Sommerlade
179
9
0
13 Aug 2021
PixelSynth: Generating a 3D-Consistent Experience from a Single Image
PixelSynth: Generating a 3D-Consistent Experience from a Single ImageIEEE International Conference on Computer Vision (ICCV), 2021
C. Rockwell
David Fouhey
Justin Johnson
VGen
257
95
0
12 Aug 2021
Analysis of ODE2VAE with Examples
Analysis of ODE2VAE with Examples
Batuhan Koyuncu
DRL
103
0
0
10 Aug 2021
StarGAN-VC+ASR: StarGAN-based Non-Parallel Voice Conversion Regularized
  by Automatic Speech Recognition
StarGAN-VC+ASR: StarGAN-based Non-Parallel Voice Conversion Regularized by Automatic Speech RecognitionInterspeech (Interspeech), 2021
Shoki Sakamoto
Akira Taniguchi
T. Taniguchi
Hirokazu Kameoka
BDL
121
7
0
10 Aug 2021
Information Bottleneck Approach to Spatial Attention Learning
Information Bottleneck Approach to Spatial Attention LearningInternational Joint Conference on Artificial Intelligence (IJCAI), 2021
Qiuxia Lai
Yu Li
Ailing Zeng
Minhao Liu
Hanqiu Sun
Qiang Xu
204
13
0
07 Aug 2021
Applying the Information Bottleneck Principle to Prosodic Representation
  Learning
Applying the Information Bottleneck Principle to Prosodic Representation LearningInterspeech (Interspeech), 2021
Guangyan Zhang
Ying Qin
Daxin Tan
Tan Lee
216
5
0
05 Aug 2021
RockGPT: Reconstructing three-dimensional digital rocks from single
  two-dimensional slice from the perspective of video generation
RockGPT: Reconstructing three-dimensional digital rocks from single two-dimensional slice from the perspective of video generationComputational Geosciences (Comput. Geosci.), 2021
Qi Zheng
Dongxiao Zhang
138
27
0
05 Aug 2021
Deep Quantized Representation for Enhanced Reconstruction
Deep Quantized Representation for Enhanced Reconstruction
Akash Gupta
Abhishek Aich
Kevin Rodriguez
G. Reddy
Amit K. Roy-Chowdhury
154
5
0
29 Jul 2021
Fast and Scalable Image Search For Histology
Fast and Scalable Image Search For Histology
Chengkuan Chen
Ming Y. Lu
Drew F. K. Williamson
Tiffany Y. Chen
A. J. Schaumberg
Faisal Mahmood
73
3
0
28 Jul 2021
Unsupervised Learning of Neurosymbolic Encoders
Unsupervised Learning of Neurosymbolic Encoders
Eric Zhan
Jennifer J. Sun
Ann Kennedy
Yisong Yue
Swarat Chaudhuri
253
15
0
28 Jul 2021
Improving Robot Localisation by Ignoring Visual Distraction
Improving Robot Localisation by Ignoring Visual DistractionIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2021
Oscar Alejandro Mendez Maldonado
M. Vowels
Richard Bowden
90
3
0
25 Jul 2021
Introducing: DeepHead, Wide-band Electromagnetic Imaging Paradigm
Introducing: DeepHead, Wide-band Electromagnetic Imaging Paradigm
Ahmed Al-Saffar
L. Guo
A. Abbosh
MedIm
62
0
0
23 Jul 2021
Abstract Reasoning via Logic-guided Generation
Abstract Reasoning via Logic-guided Generation
Sihyun Yu
Sangwoo Mo
SungSoo Ahn
Jinwoo Shin
218
7
0
22 Jul 2021
Data synthesis and adversarial networks: A review and meta-analysis in
  cancer imaging
Data synthesis and adversarial networks: A review and meta-analysis in cancer imaging
Richard Osuala
Kaisar Kushibar
Lidia Garrucho
Akis Linardos
Zuzanna Szafranowska
Stefan Klein
Ben Glocker
Oliver Díaz
Karim Lekadir
MedIm
347
60
0
20 Jul 2021
Generative Video Transformer: Can Objects be the Words?
Generative Video Transformer: Can Objects be the Words?International Conference on Machine Learning (ICML), 2021
Yi-Fu Wu
Jaesik Yoon
Sungjin Ahn
ViT
273
34
0
20 Jul 2021
GenRadar: Self-supervised Probabilistic Camera Synthesis based on Radar
  Frequencies
GenRadar: Self-supervised Probabilistic Camera Synthesis based on Radar FrequenciesIEEE Access (IEEE Access), 2021
Carsten Ditzel
Klaus C. J. Dietmayer
162
3
0
19 Jul 2021
Translatotron 2: High-quality direct speech-to-speech translation with
  voice preservation
Translatotron 2: High-quality direct speech-to-speech translation with voice preservationInternational Conference on Machine Learning (ICML), 2021
Ye Jia
Michelle Tadmor Ramanovich
Tal Remez
Roi Pomerantz
441
95
0
19 Jul 2021
Unsupervised Skill-Discovery and Skill-Learning in Minecraft
Unsupervised Skill-Discovery and Skill-Learning in Minecraft
J. J. Nieto
Roger Creus
Xavier Giró-i-Nieto
SSLDRL
168
5
0
18 Jul 2021
Learning De-identified Representations of Prosody from Raw Audio
Learning De-identified Representations of Prosody from Raw AudioInternational Conference on Machine Learning (ICML), 2021
J. Weston
R. Lenain
U. Meepegama
E. Fristed
SSL
199
18
0
17 Jul 2021
CCVS: Context-aware Controllable Video Synthesis
CCVS: Context-aware Controllable Video SynthesisNeural Information Processing Systems (NeurIPS), 2021
G. L. Moing
Jean Ponce
Cordelia Schmid
266
90
0
16 Jul 2021
Codified audio language modeling learns useful representations for music
  information retrieval
Codified audio language modeling learns useful representations for music information retrieval
Rodrigo Castellon
Chris Donahue
Abigail Z. Jacobs
264
107
0
12 Jul 2021
PocketVAE: A Two-step Model for Groove Generation and Control
PocketVAE: A Two-step Model for Groove Generation and Control
Kyungyun Lee
Wonil Kim
Juhan Nam
108
1
0
11 Jul 2021
SoundStream: An End-to-End Neural Audio Codec
SoundStream: An End-to-End Neural Audio Codec
Neil Zeghidour
Alejandro Luebs
Ahmed Omran
Jan Skoglund
Marco Tagliasacchi
AI4TS
517
1,129
0
07 Jul 2021
Discrete-Valued Neural Communication
Discrete-Valued Neural Communication
Dianbo Liu DianboLiu
Alex Lamb
Kenji Kawaguchi
Anirudh Goyal
Chen Sun
Michael C. Mozer
Yoshua Bengio
281
53
0
06 Jul 2021
Classical Planning in Deep Latent Space
Classical Planning in Deep Latent SpaceJournal of Artificial Intelligence Research (JAIR), 2021
Masataro Asai
Hiroshi Kajino
A. Fukunaga
Christian Muise
VLM
436
24
0
30 Jun 2021
A Generative Model for Raw Audio Using Transformer Architectures
A Generative Model for Raw Audio Using Transformer ArchitecturesInternational Conference on Digital Audio Effects (DAFx), 2021
Prateek Verma
C. Chafe
248
37
0
30 Jun 2021
CLIPDraw: Exploring Text-to-Drawing Synthesis through Language-Image
  Encoders
CLIPDraw: Exploring Text-to-Drawing Synthesis through Language-Image EncodersNeural Information Processing Systems (NeurIPS), 2021
Kevin Frans
Lisa Soros
Olaf Witkowski
CLIP
236
266
0
28 Jun 2021
Transflower: probabilistic autoregressive dance generation with
  multimodal attention
Transflower: probabilistic autoregressive dance generation with multimodal attentionACM Transactions on Graphics (TOG), 2021
Guillermo Valle Pérez
G. Henter
Jonas Beskow
A. Holzapfel
Pierre-Yves Oudeyer
Simon Alexanderson
375
49
0
25 Jun 2021
On Incorporating Inductive Biases into VAEs
On Incorporating Inductive Biases into VAEsInternational Conference on Learning Representations (ICLR), 2021
Ning Miao
Emile Mathieu
N. Siddharth
Yee Whye Teh
Tom Rainforth
CMLDRL
258
11
0
25 Jun 2021
Preliminary study on using vector quantization latent spaces for TTS/VC
  systems with consistent performance
Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance
Hieu-Thi Luong
Junichi Yamagishi
172
0
0
25 Jun 2021
NP-DRAW: A Non-Parametric Structured Latent Variable Model for Image
  Generation
NP-DRAW: A Non-Parametric Structured Latent Variable Model for Image Generation
Xiaohui Zeng
R. Urtasun
R. Zemel
Sanja Fidler
Renjie Liao
DiffM
128
2
0
25 Jun 2021
Generative Modeling for Multi-task Visual Learning
Generative Modeling for Multi-task Visual Learning
Zhipeng Bao
M. Hebert
Yu-Xiong Wang
134
18
0
25 Jun 2021
Handling Data Heterogeneity with Generative Replay in Collaborative
  Learning for Medical Imaging
Handling Data Heterogeneity with Generative Replay in Collaborative Learning for Medical Imaging
Liangqiong Qu
N. Balachandar
Miao Zhang
D. Rubin
MedIm
254
24
0
24 Jun 2021
VIMPAC: Video Pre-Training via Masked Token Prediction and Contrastive
  Learning
VIMPAC: Video Pre-Training via Masked Token Prediction and Contrastive Learning
Hao Tan
Jie Lei
Thomas Wolf
Joey Tianyi Zhou
213
73
0
21 Jun 2021
Trainable Class Prototypes for Few-Shot Learning
Trainable Class Prototypes for Few-Shot Learning
Jianyi Li
Guizhong Liu
VLM
187
2
0
21 Jun 2021
Discrete Auto-regressive Variational Attention Models for Text Modeling
Discrete Auto-regressive Variational Attention Models for Text Modeling
Xianghong Fang
Haoli Bai
Jian Li
Zenglin Xu
Michael Lyu
Irwin King
147
3
0
16 Jun 2021
Test Sample Accuracy Scales with Training Sample Density in Neural
  Networks
Test Sample Accuracy Scales with Training Sample Density in Neural Networks
Xu Ji
Razvan Pascanu
Devon Hjelm
Balaji Lakshminarayanan
Andrea Vedaldi
312
8
0
15 Jun 2021
Self-Supervised Learning with Kernel Dependence Maximization
Self-Supervised Learning with Kernel Dependence Maximization
Yazhe Li
Roman Pogodin
Danica J. Sutherland
Arthur Gretton
SSL
209
99
0
15 Jun 2021
BEiT: BERT Pre-Training of Image Transformers
BEiT: BERT Pre-Training of Image Transformers
Hangbo Bao
Li Dong
Songhao Piao
Furu Wei
ViT
856
3,424
0
15 Jun 2021
Improved Transformer for High-Resolution GANs
Improved Transformer for High-Resolution GANsNeural Information Processing Systems (NeurIPS), 2021
Long Zhao
Zizhao Zhang
Ting Chen
Dimitris N. Metaxas
Han Zhang
ViT
352
109
0
14 Jun 2021
HuBERT: Self-Supervised Speech Representation Learning by Masked
  Prediction of Hidden Units
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden UnitsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Wei-Ning Hsu
Benjamin Bolte
Yifan Hao
Kushal Lakhotia
Ruslan Salakhutdinov
Abdel-rahman Mohamed
SSL
599
4,061
0
14 Jun 2021
Learning the Imaging Landmarks: Unsupervised Key point Detection in Lung
  Ultrasound Videos
Learning the Imaging Landmarks: Unsupervised Key point Detection in Lung Ultrasound Videos
Arpan Tripathi
Mahesh Raveendranatha Panicker
A. Hareendranathan
Yale Tung Chen
Jacob L. Jaremko
K. Narayan
C. Kesavadas
87
0
0
13 Jun 2021
Inverting Adversarially Robust Networks for Image Synthesis
Inverting Adversarially Robust Networks for Image SynthesisAsian Conference on Computer Vision (ACCV), 2021
Renan A. Rojas-Gomez
Raymond A. Yeh
Minh Do
A. Nguyen
206
6
0
13 Jun 2021
D2C: Diffusion-Denoising Models for Few-shot Conditional Generation
D2C: Diffusion-Denoising Models for Few-shot Conditional GenerationNeural Information Processing Systems (NeurIPS), 2021
Abhishek Sinha
Jiaming Song
Chenlin Meng
Stefano Ermon
VLMDiffM
289
141
0
12 Jun 2021
Robust Representation Learning via Perceptual Similarity Metrics
Robust Representation Learning via Perceptual Similarity MetricsInternational Conference on Machine Learning (ICML), 2021
Saeid Asgari Taghanaki
Kristy Choi
Amir Khasahmadi
Anirudh Goyal
SSL
131
29
0
11 Jun 2021
Decoupled Greedy Learning of CNNs for Synchronous and Asynchronous
  Distributed Learning
Decoupled Greedy Learning of CNNs for Synchronous and Asynchronous Distributed Learning
Eugene Belilovsky
Louis Leconte
Lucas Caccia
Michael Eickenberg
Edouard Oyallon
169
9
0
11 Jun 2021
Conditional Variational Autoencoder with Adversarial Learning for
  End-to-End Text-to-Speech
Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-SpeechInternational Conference on Machine Learning (ICML), 2021
Jaehyeon Kim
Jungil Kong
Juhee Son
DRL
303
1,163
0
11 Jun 2021
Score-based Generative Modeling in Latent Space
Score-based Generative Modeling in Latent SpaceNeural Information Processing Systems (NeurIPS), 2021
Arash Vahdat
Karsten Kreis
Jan Kautz
DiffM
443
808
0
10 Jun 2021
Previous
123...666768...757677
Next
Page 67 of 77
Pageof 77