v1v2 (latest)

Neural Discrete Representation Learning

2 November 2017

Papers citing "Neural Discrete Representation Learning"

50 / 3,807 papers shown

Unsupervised Acoustic Unit Discovery by Leveraging a Language-Independent Subword Discriminative Feature RepresentationInterspeech (Interspeech), 2021

Siyuan Feng

Piotr Żelasko

Laureano Moro-Velazquez

O. Scharenborg

181

02 Apr 2021

Speech Resynthesis from Discrete Disentangled Self-Supervised RepresentationsInterspeech (Interspeech), 2021

Yossi Adi

461

371

01 Apr 2021

A Closer Look at Fourier Spectrum Discrepancies for CNN-generated Images DetectionComputer Vision and Pattern Recognition (CVPR), 2021

Keshigeyan Chandrasegaran

Ngoc-Trung Tran

Ngai-Man Cheung

211

31 Mar 2021

Unsupervised Disentanglement of Linear-Encoded Facial SemanticsComputer Vision and Pattern Recognition (CVPR), 2021

143

30 Mar 2021

PixelTransformer: Sample Conditioned Signal GenerationInternational Conference on Machine Learning (ICML), 2021

Shubham Tulsiani

Abhinav Gupta

167

29 Mar 2021

Scalable and Efficient Neural Speech Coding: A Hybrid DesignIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021

237

27 Mar 2021

Decomposing Normal and Abnormal Features of Medical Images into Discrete Latent Codes for Content-Based Image Retrieval

272

23 Mar 2021

Tiny Transformers for Environmental Sound Classification at the Edge

156

22 Mar 2021

Generating Diverse Structure for Image Inpainting With Hierarchical VQ-VAEComputer Vision and Pattern Recognition (CVPR), 2021

184

232

18 Mar 2021

Variable-rate discrete representation learning

209

10 Mar 2021

Wav2vec-C: A Self-supervised Model for Speech Representation LearningInterspeech (Interspeech), 2021

Samik Sadhu

Di He

Che-Wei Huang

Sri Harish Reddy Mallidi

217

09 Mar 2021

Deep Generative Modelling: A Comparative Review of VAEs, GANs, Normalizing Flows, Energy-Based and Autoregressive ModelsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021

727

630

08 Mar 2021

Learning to Generate 3D Shapes with Generative Cellular AutomataInternational Conference on Learning Representations (ICLR), 2021

155

06 Mar 2021

Generating Images with Sparse RepresentationsInternational Conference on Machine Learning (ICML), 2021

236

269

05 Mar 2021

crank: An Open-Source Software for Nonparallel Voice Conversion Based on Vector-Quantized Variational AutoencoderIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Kazuhiro Kobayashi

Wen-Chin Huang

Yi-Chiao Wu

Patrick Lumban Tobing

Tomoki Hayashi

Tomoki Toda

BDL DRL

150

04 Mar 2021

Enabling Visual Action Planning for Object Manipulation through Latent Space RoadmapIEEE Transactions on robotics (TRO), 2021

Danica Kragic

266

03 Mar 2021

Predicting Video with VQVAE

Jacob Walker

Ali Razavi

Aaron van den Oord

DRL

247

02 Mar 2021

A survey on Variational Autoencoders from a GreenAI perspectiveSN Computer Science (SN Comput. Sci.), 2021

193

01 Mar 2021

M6: A Chinese Multimodal Pretrainer

Rui Men

...

Yong Li

Jialin Li

Jingren Zhou

J. Tang

Hongxia Yang

VLM MoE

347

147

01 Mar 2021

Zero-Shot Text-to-Image GenerationInternational Conference on Machine Learning (ICML), 2021

829

6,024

24 Feb 2021

Unsupervised Brain Anomaly Detection and Segmentation with TransformersInternational Conference on Medical Imaging with Deep Learning (MIDL), 2021

Sebastien Ourselin

149

23 Feb 2021

Anytime Sampling for Autoregressive Models via Ordered AutoencodingInternational Conference on Learning Representations (ICLR), 2021

189

23 Feb 2021

Uncertainty Estimation Using Riemannian Model Dynamics for Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2021

Guy Tennenholtz

Shie Mannor

OffRL

227

22 Feb 2021

Improving Lossless Compression Rates via Monte Carlo Bits-Back CodingInternational Conference on Machine Learning (ICML), 2021

Yangjun Ruan

Karen Ullrich

Daniel de Souza Severo

Ashish Khisti

238

22 Feb 2021

Measuring the Stability of Learned Features

Kris Sankaran

OOD

20 Feb 2021

Preventing Oversmoothing in VAE via Generalized Variance ParameterizationNeurocomputing (Neurocomputing), 2021

219

17 Feb 2021

Enhancing into the codec: Noise Robust Speech Coding with Vector-Quantized AutoencodersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

200

12 Feb 2021

VARA-TTS: Non-Autoregressive Text-to-Speech Synthesis based on Very Deep VAE with Residual Attention

183

12 Feb 2021

Self-Supervised VQ-VAE for One-Shot Music Style TransferIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

224

10 Feb 2021

Inducing Meaningful Units from Character Sequences with Dynamic Capacity Slot Attention

Melika Behjati

James Henderson

OCL

171

01 Feb 2021

Generative Spoken Language Modeling from Raw AudioTransactions of the Association for Computational Linguistics (TACL), 2021

Yossi Adi

...

643

439

01 Feb 2021

CNN with large memory layers

223

27 Jan 2021

Disentangled Sequence Clustering for Human Intention InferenceIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2021

Mark Zolotas

Y. Demiris

DRL

338

23 Jan 2021

Hierarchical disentangled representation learning for singing voice conversionIEEE International Joint Conference on Neural Network (IJCNN), 2021

181

18 Jan 2021

Cauchy-Schwarz Regularized AutoencoderJournal of machine learning research (JMLR), 2021

245

06 Jan 2021

HAVANA: Hierarchical and Variation-Normalized Autoencoder for Person Re-identification

267

06 Jan 2021

Psychoacoustic Calibration of Loss Functions for Efficient End-to-End Neural Audio CodingIEEE Signal Processing Letters (IEEE SPL), 2020

206

31 Dec 2020

Discovering Dialog Structure Graph for Open-Domain Dialog Generation

193

31 Dec 2020

Text-Free Image-to-Speech Synthesis Using Learned Segmental UnitsAnnual Meeting of the Association for Computational Linguistics (ACL), 2020

191

31 Dec 2020

Interpretable NLG for Task-oriented Dialogue Systems with Heterogeneous Rendering MachinesAAAI Conference on Artificial Intelligence (AAAI), 2020

Yangming Li

Kaisheng Yao

188

29 Dec 2020

A Survey on Visual TransformerIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020

...

1.1K

3,127

23 Dec 2020

Motif-Driven Contrastive Learning of Graph RepresentationsIEEE Transactions on Knowledge and Data Engineering (TKDE), 2020

244

23 Dec 2020

OBoW: Online Bag-of-Visual-Words Generation for Self-Supervised LearningComputer Vision and Pattern Recognition (CVPR), 2020

297

21 Dec 2020

Exploring Fluent Query Reformulations with Text-to-Text Transformers and Reinforcement Learning

Jerry Zikun Chen

S. Yu

Haoran Wang

913

18 Dec 2020

Taming Transformers for High-Resolution Image SynthesisComputer Vision and Pattern Recognition (CVPR), 2020

740

3,822

17 Dec 2020

The effectiveness of unsupervised subword modeling with autoregressive and cross-lingual phone-aware networksIEEE Open Journal of Signal Processing (JOSP), 2020

Siyuan Feng

O. Scharenborg

SSL

222

17 Dec 2020

Computational principles of intelligence: learning and reasoning with neural networks

Abel Torres Montoya

PINN AI4CE

117

17 Dec 2020

Planning from Pixels in Atari with Learned Symbolic RepresentationsAAAI Conference on Artificial Intelligence (AAAI), 2020

Andrea Dittadi

Frederik K. Drachmann

Thomas Bolander

358

16 Dec 2020

Unsupervised Learning of Global Factors in Deep Generative ModelsPattern Recognition (Pattern Recognit.), 2020

I. Peis

Pablo M. Olmos

Antonio Artés-Rodríguez

BDL DRL

220

15 Dec 2020

Towards unsupervised phone and word segmentation using self-supervised vector-quantized neural networksInterspeech (Interspeech), 2020

Herman Kamper

Benjamin van Niekerk

SSL MQ

297

14 Dec 2020