ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.00937
  4. Cited By
Neural Discrete Representation Learning
v1v2 (latest)

Neural Discrete Representation Learning

2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
    BDLSSLOCL
ArXiv (abs)PDFHTML

Papers citing "Neural Discrete Representation Learning"

50 / 3,807 papers shown
Cross-Modal Discrete Representation Learning
Cross-Modal Discrete Representation LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Alexander H. Liu
SouYoung Jin
Cheng-I Jeff Lai
Andrew Rouditchenko
A. Oliva
James R. Glass
SSL
140
52
0
10 Jun 2021
Domain Specific Transporter Framework to Detect Fractures in Ultrasound
Domain Specific Transporter Framework to Detect Fractures in UltrasoundAnnual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 2021
Arpan Tripathi
A. Hareendranathan
Mahesh Raveendranatha Panicker
Jack Zhang
Naveenjyote Boora
Jacob L. Jaremko
44
0
0
09 Jun 2021
Vector Quantized Models for Planning
Vector Quantized Models for PlanningInternational Conference on Machine Learning (ICML), 2021
Sherjil Ozair
Yazhe Li
Ali Razavi
Ioannis Antonoglou
Aaron van den Oord
Oriol Vinyals
OffRL
210
58
0
08 Jun 2021
Unsupervised Word Segmentation from Discrete Speech Units in
  Low-Resource Settings
Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings
Marcely Zanon Boito
Bolaji Yusuf
Lucas Ondel
Aline Villavicencio
Laurent Besacier
152
4
0
08 Jun 2021
NWT: Towards natural audio-to-video generation with representation
  learning
NWT: Towards natural audio-to-video generation with representation learning
Rayhane Mama
Marc S. Tyndel
Hashiam Kadhim
Cole Clifford
Ragavan Thurairatnam
VGen
249
13
0
08 Jun 2021
Interpretable agent communication from scratch (with a generic visual
  processor emerging on the side)
Interpretable agent communication from scratch (with a generic visual processor emerging on the side)Neural Information Processing Systems (NeurIPS), 2021
Roberto Dessì
Eugene Kharitonov
Marco Baroni
209
31
0
08 Jun 2021
Weakly-supervised word-level pronunciation error detection in non-native
  English speech
Weakly-supervised word-level pronunciation error detection in non-native English speechInterspeech (Interspeech), 2021
Daniel Korzekwa
Jaime Lorenzo-Trueba
Thomas Drugman
Shira Calamaro
B. Kostek
115
14
0
07 Jun 2021
Neural Distributed Source Coding
Neural Distributed Source CodingIEEE Journal on Selected Areas in Information Theory (JSAIT), 2021
Jay Whang
Alliot Nagle
Anish Acharya
Hyeji Kim
A. Dimakis
376
25
0
05 Jun 2021
On Perceptual Lossy Compression: The Cost of Perceptual Reconstruction
  and An Optimal Training Framework
On Perceptual Lossy Compression: The Cost of Perceptual Reconstruction and An Optimal Training FrameworkInternational Conference on Machine Learning (ICML), 2021
Zeyu Yan
Fei Wen
R. Ying
Chao Ma
Peilin Liu
219
44
0
05 Jun 2021
The Image Local Autoregressive Transformer
The Image Local Autoregressive TransformerNeural Information Processing Systems (NeurIPS), 2021
Chenjie Cao
Yue Hong
Xiang Li
Chengrong Wang
C. Xu
Xiangyang Xue
Yanwei Fu
180
15
0
04 Jun 2021
Segmental Contrastive Predictive Coding for Unsupervised Word
  Segmentation
Segmental Contrastive Predictive Coding for Unsupervised Word SegmentationInterspeech (Interspeech), 2021
Saurabhchand Bhati
Jesús Villalba
Piotr Żelasko
Laureano Moro-Velazquez
Najim Dehak
SSL
206
43
0
03 Jun 2021
A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker
  Identity in Dysarthric Voice Conversion
A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice ConversionInterspeech (Interspeech), 2021
Wen-Chin Huang
Kazuhiro Kobayashi
Yu-Huai Peng
Ching-Feng Liu
Yu Tsao
Hsin-Min Wang
Tomoki Toda
156
14
0
02 Jun 2021
Container: Context Aggregation Network
Container: Context Aggregation NetworkNeural Information Processing Systems (NeurIPS), 2021
Peng Gao
Jiasen Lu
Jiaming Song
Roozbeh Mottaghi
Aniruddha Kembhavi
ViT
288
81
0
02 Jun 2021
What Can I Do Here? Learning New Skills by Imagining Visual Affordances
What Can I Do Here? Learning New Skills by Imagining Visual AffordancesIEEE International Conference on Robotics and Automation (ICRA), 2021
Alexander Khazatsky
Ashvin Nair
Dan Jing
Sergey Levine
LM&Ro
270
36
0
01 Jun 2021
DISSECT: Disentangled Simultaneous Explanations via Concept Traversals
DISSECT: Disentangled Simultaneous Explanations via Concept TraversalsInternational Conference on Learning Representations (ICLR), 2021
Asma Ghandeharioun
Been Kim
Chun-Liang Li
Brendan Jou
B. Eoff
Rosalind W. Picard
AAML
332
58
0
31 May 2021
Factorising Meaning and Form for Intent-Preserving Paraphrasing
Factorising Meaning and Form for Intent-Preserving ParaphrasingAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Tom Hosking
Mirella Lapata
OOD
201
47
0
31 May 2021
Cascaded Diffusion Models for High Fidelity Image Generation
Cascaded Diffusion Models for High Fidelity Image GenerationJournal of machine learning research (JMLR), 2021
Jonathan Ho
Chitwan Saharia
William Chan
David J. Fleet
Mohammad Norouzi
Tim Salimans
818
1,500
0
30 May 2021
Diffusion-Based Representation Learning
Diffusion-Based Representation LearningInternational Conference on Machine Learning (ICML), 2021
K. Abstreiter
Sarthak Mittal
Stefan Bauer
Bernhard Schölkopf
Arash Mehrjou
DiffM
217
68
0
29 May 2021
M6-UFC: Unifying Multi-Modal Controls for Conditional Image Synthesis
  via Non-Autoregressive Generative Transformers
M6-UFC: Unifying Multi-Modal Controls for Conditional Image Synthesis via Non-Autoregressive Generative Transformers
Zhu Zhang
Jianxin Ma
Chang Zhou
Rui Men
Zhikang Li
Ming Ding
Jie Tang
Jingren Zhou
Hongxia Yang
352
48
0
29 May 2021
CogView: Mastering Text-to-Image Generation via Transformers
CogView: Mastering Text-to-Image Generation via TransformersNeural Information Processing Systems (NeurIPS), 2021
Ming Ding
Zhuoyi Yang
Wenyi Hong
Wendi Zheng
Chang Zhou
...
Junyang Lin
Xu Zou
Zhou Shao
Hongxia Yang
Jie Tang
ViTVLM
475
924
0
26 May 2021
Deep Neural Networks and End-to-End Learning for Audio Compression
Deep Neural Networks and End-to-End Learning for Audio CompressionJournal of KIISE (JKIISE), 2021
Daniela N. Rim
I. Jang
Heeyoul Choi
186
14
0
25 May 2021
EXoN: EXplainable encoder Network
EXoN: EXplainable encoder Network
SeungHwan An
Hosik Choi
Jong-June Jeon
BDLDRL
224
5
0
23 May 2021
Compositional Fine-Grained Low-Shot Learning
Compositional Fine-Grained Low-Shot Learning
Dat T. Huynh
Ehsan Elhamifar
148
6
0
21 May 2021
Combining Transformer Generators with Convolutional Discriminators
Combining Transformer Generators with Convolutional DiscriminatorsDeutsche Jahrestagung für Künstliche Intelligenz (KI), 2021
Ricard Durall
Stanislav Frolov
Jörn Hees
Federico Raue
Franz-Josef Pfreundt
Andreas Dengel
J. Keuper
ViT
309
18
0
21 May 2021
Parallel and Flexible Sampling from Autoregressive Models via Langevin
  Dynamics
Parallel and Flexible Sampling from Autoregressive Models via Langevin DynamicsInternational Conference on Machine Learning (ICML), 2021
V. Jayaram
John Thickstun
DiffM
228
26
0
17 May 2021
Priors in Bayesian Deep Learning: A Review
Priors in Bayesian Deep Learning: A ReviewInternational Statistical Review (ISR), 2021
Vincent Fortuin
UQCVBDL
453
159
0
14 May 2021
High-Resolution Complex Scene Synthesis with Transformers
High-Resolution Complex Scene Synthesis with Transformers
Manuel Jahn
Robin Rombach
Bjorn Ommer
ViT
168
40
0
13 May 2021
Autoencoding Under Normalization Constraints
Autoencoding Under Normalization ConstraintsInternational Conference on Machine Learning (ICML), 2021
Sangwoong Yoon
Yung-Kyun Noh
Frank C. Park
OODDUQCV
313
43
0
12 May 2021
Discrete representations in neural models of spoken language
Discrete representations in neural models of spoken languageBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackBoxNLP), 2021
Bertrand Higy
Lieke Gelderloos
Afra Alishahi
Grzegorz Chrupała
257
6
0
12 May 2021
Diffusion Models Beat GANs on Image Synthesis
Diffusion Models Beat GANs on Image SynthesisNeural Information Processing Systems (NeurIPS), 2021
Prafulla Dhariwal
Alex Nichol
3.1K
10,466
0
11 May 2021
MuseMorphose: Full-Song and Fine-Grained Piano Music Style Transfer with
  One Transformer VAE
MuseMorphose: Full-Song and Fine-Grained Piano Music Style Transfer with One Transformer VAEIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Shih-Lun Wu
Yi-Hsuan Yang
ViT
375
75
0
10 May 2021
Voice Conversion Based Speaker Normalization for Acoustic Unit Discovery
Voice Conversion Based Speaker Normalization for Acoustic Unit Discovery
Thomas Glarner
Janek Ebbers
Reinhold Häb-Umbach
DRL
83
1
0
04 May 2021
VQCPC-GAN: Variable-Length Adversarial Audio Synthesis Using
  Vector-Quantized Contrastive Predictive Coding
VQCPC-GAN: Variable-Length Adversarial Audio Synthesis Using Vector-Quantized Contrastive Predictive CodingIEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2021
J. Nistal
Cyran Aouameur
Stefan Lattner
G. Richard
274
7
0
04 May 2021
OCTOPUS: Overcoming Performance andPrivatization Bottlenecks in
  Distributed Learning
OCTOPUS: Overcoming Performance andPrivatization Bottlenecks in Distributed LearningIEEE Transactions on Parallel and Distributed Systems (TPDS), 2021
Shuo Wang
Surya Nepal
Kristen Moore
M. Grobler
Carsten Rudolph
A. Abuadbba
FedML
207
8
0
03 May 2021
GODIVA: Generating Open-DomaIn Videos from nAtural Descriptions
GODIVA: Generating Open-DomaIn Videos from nAtural Descriptions
Chenfei Wu
Lun Huang
Qianxi Zhang
Binyang Li
Lei Ji
Fan Yang
Guillermo Sapiro
Nan Duan
DiffMVGen
262
293
0
30 Apr 2021
Eccentric Regularization: Minimizing Hyperspherical Energy without
  explicit projection
Eccentric Regularization: Minimizing Hyperspherical Energy without explicit projectionIEEE International Joint Conference on Neural Network (IJCNN), 2021
Xuefeng Li
Alan Blair
178
0
0
23 Apr 2021
Protecting gender and identity with disentangled speech representations
Protecting gender and identity with disentangled speech representationsInterspeech (Interspeech), 2021
Dimitrios Stoidis
Andrea Cavallaro
214
12
0
22 Apr 2021
IB-DRR: Incremental Learning with Information-Back Discrete
  Representation Replay
IB-DRR: Incremental Learning with Information-Back Discrete Representation Replay
Jian Jiang
Edoardo Cetin
Oya Celiktutan
159
12
0
21 Apr 2021
VideoGPT: Video Generation using VQ-VAE and Transformers
VideoGPT: Video Generation using VQ-VAE and Transformers
Wilson Yan
Yunzhi Zhang
Pieter Abbeel
A. Srinivas
ViTVGen
646
647
0
20 Apr 2021
Conditional Variational Capsule Network for Open Set Recognition
Conditional Variational Capsule Network for Open Set RecognitionIEEE International Conference on Computer Vision (ICCV), 2021
Yunrui Guo
Guglielmo Camporese
Wenjing Yang
A. Sperduti
Lamberto Ballan
BDL
225
55
0
19 Apr 2021
Geometry-Free View Synthesis: Transformers and no 3D Priors
Geometry-Free View Synthesis: Transformers and no 3D PriorsIEEE International Conference on Computer Vision (ICCV), 2021
Robin Rombach
Patrick Esser
Bjorn Ommer
ViT
417
110
0
15 Apr 2021
Spectrogram Inpainting for Interactive Generation of Instrument Sounds
Spectrogram Inpainting for Interactive Generation of Instrument Sounds
Théis Bazin
Gaëtan Hadjeres
P. Esling
M. Malt
131
12
0
15 Apr 2021
FastS2S-VC: Streaming Non-Autoregressive Sequence-to-Sequence Voice
  Conversion
FastS2S-VC: Streaming Non-Autoregressive Sequence-to-Sequence Voice Conversion
Hirokazu Kameoka
Kou Tanaka
Takuhiro Kaneko
230
23
0
14 Apr 2021
NoiseVC: Towards High Quality Zero-Shot Voice Conversion
NoiseVC: Towards High Quality Zero-Shot Voice Conversion
Shijun Wang
Damian Borth
DRL
151
6
0
13 Apr 2021
Visiting the Invisible: Layer-by-Layer Completed Scene Decomposition
Visiting the Invisible: Layer-by-Layer Completed Scene DecompositionInternational Journal of Computer Vision (IJCV), 2021
Chuanxia Zheng
Duy-Son Dao
Guoxian Song
Tat-Jen Cham
Jianfei Cai
108
25
0
12 Apr 2021
Boltzmann Tuning of Generative Models
Boltzmann Tuning of Generative Models
Victor Berger
Michele Sebag
150
0
0
12 Apr 2021
Learned transform compression with optimized entropy encoding
Learned transform compression with optimized entropy encoding
Magda Gregorova
Marc Desaules
Alexandros Kalousis
186
2
0
07 Apr 2021
Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language
  Representation Learning
Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation LearningComputer Vision and Pattern Recognition (CVPR), 2021
Zhicheng Huang
Zhaoyang Zeng
Yupan Huang
Bei Liu
Dongmei Fu
Jianlong Fu
VLMViT
457
302
0
07 Apr 2021
Creativity and Machine Learning: A Survey
Creativity and Machine Learning: A SurveyACM Computing Surveys (CSUR), 2021
Giorgio Franceschelli
Mirco Musolesi
VLMAI4CE
556
56
0
06 Apr 2021
Defending Against Image Corruptions Through Adversarial Augmentations
Defending Against Image Corruptions Through Adversarial AugmentationsInternational Conference on Learning Representations (ICLR), 2021
D. A. Calian
Florian Stimberg
Olivia Wiles
Sylvestre-Alvise Rebuffi
András Gyorgy
Timothy A. Mann
Sven Gowal
AAML
339
46
0
02 Apr 2021
Previous
123...676869...757677
Next
Page 68 of 77
Pageof 77