Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.11676
Cited By
Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge
24 May 2020
Andros Tjandra
S. Sakti
Satoshi Nakamura
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge"
22 / 22 papers shown
Title
LAST: Language Model Aware Speech Tokenization
A. Turetzky
Yossi Adi
83
3
0
05 Sep 2024
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data
Mateusz Lajszczak
Guillermo Cámbara
Yang Li
Fatih Beyhan
Arent van Korlaar
...
Bartosz Putrycz
Soledad López Gambino
Kayeon Yoo
Elena Sokolova
Thomas Drugman
LM&MA
113
88
0
12 Feb 2024
Modeling Spatio-temporal Dynamical Systems with Neural Discrete Learning and Levels-of-Experts
Kun Wang
Hao Wu
Guibin Zhang
Sihang Li
Yuxuan Liang
Yuankai Wu
Roger Zimmermann
Yang Wang
77
11
0
06 Feb 2024
Disentanglement via Latent Quantization
Kyle Hsu
W. Dorrell
James C. R. Whittington
Jiajun Wu
Chelsea Finn
DRL
163
27
0
28 May 2023
Textually Pretrained Speech Language Models
Michael Hassid
Tal Remez
Tu Nguyen
Itai Gat
Alexis Conneau
...
Alexandre Défossez
Gabriel Synnaeve
Emmanuel Dupoux
Roy Schwartz
Yossi Adi
VLM
SyDa
131
61
0
22 May 2023
Self-Organising Neural Discrete Representation Learning à la Kohonen
Kazuki Irie
Róbert Csordás
Jürgen Schmidhuber
SSL
79
1
0
15 Feb 2023
ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement
Wei-Ning Hsu
Tal Remez
Bowen Shi
Jacob Donley
Yossi Adi
DiffM
93
12
0
21 Dec 2022
Artificial intelligence approaches for materials-by-design of energetic materials: state-of-the-art, challenges, and future directions
Joseph B. Choi
Phong C. H. Nguyen
O. Sen
H. Udaykumar
Stephen Seung-Yeob Baek
PINN
AI4CE
71
13
0
15 Nov 2022
Deep Reinforcement Learning with Vector Quantized Encoding
Liang Zhang
Justin Lieffers
A. Pyarelal
OffRL
58
2
0
12 Nov 2022
Learning an Artificial Language for Knowledge-Sharing in Multilingual Translation
Danni Liu
Jan Niehues
37
5
0
02 Nov 2022
Self-supervised language learning from raw audio: Lessons from the Zero Resource Speech Challenge
Ewan Dunbar
Nicolas Hamilakis
Emmanuel Dupoux
SSL
84
30
0
27 Oct 2022
SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning
Tzu-hsun Feng
Annie Dong
Ching-Feng Yeh
Shu-Wen Yang
Tzu-Quan Lin
...
Xuankai Chang
Shinji Watanabe
Abdel-rahman Mohamed
Shang-Wen Li
Hung-yi Lee
ELM
SSL
102
35
0
16 Oct 2022
Augmentation Invariant Discrete Representation for Generative Spoken Language Modeling
Itai Gat
Felix Kreuk
Tu Nguyen
Ann Lee
Jade Copet
Gabriel Synnaeve
Emmanuel Dupoux
Yossi Adi
75
11
0
30 Sep 2022
Self-Supervised Speech Representation Learning: A Review
Abdel-rahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
...
Shang-Wen Li
Karen Livescu
Lars Maaløe
Tara N. Sainath
Shinji Watanabe
SSL
AI4TS
289
368
0
21 May 2022
SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization
Yuhta Takida
Takashi Shibuya
Wei-Hsiang Liao
Chieh-Hsin Lai
Junki Ohmura
Toshimitsu Uesaka
Naoki Murata
Shusuke Takahashi
Toshiyuki Kumakura
Yuki Mitsufuji
BDL
85
67
0
16 May 2022
Discrete representations in neural models of spoken language
Bertrand Higy
Lieke Gelderloos
Afra Alishahi
Grzegorz Chrupała
140
6
0
12 May 2021
Speech Resynthesis from Discrete Disentangled Self-Supervised Representations
Adam Polyak
Yossi Adi
Jade Copet
Eugene Kharitonov
Kushal Lakhotia
Wei-Ning Hsu
Abdel-rahman Mohamed
Emmanuel Dupoux
130
318
0
01 Apr 2021
Inducing Meaningful Units from Character Sequences with Dynamic Capacity Slot Attention
Melika Behjati
James Henderson
OCL
52
1
0
01 Feb 2021
Generative Spoken Language Modeling from Raw Audio
Kushal Lakhotia
Evgeny Kharitonov
Wei-Ning Hsu
Yossi Adi
Adam Polyak
...
Tu Nguyen
Jade Copet
Alexei Baevski
A. Mohamed
Emmanuel Dupoux
AuLLM
292
366
0
01 Feb 2021
The effectiveness of unsupervised subword modeling with autoregressive and cross-lingual phone-aware networks
Siyuan Feng
O. Scharenborg
SSL
54
3
0
17 Dec 2020
Unsupervised Learning of Disentangled Speech Content and Style Representation
Andros Tjandra
Ruoming Pang
Yu Zhang
Shigeki Karita
BDL
DRL
73
15
0
24 Oct 2020
The Zero Resource Speech Challenge 2020: Discovering discrete subword and word units
Ewan Dunbar
Julien Karadayi
Mathieu Bernard
Xuan-Nga Cao
Robin Algayres
Lucas Ondel
Laurent Besacier
S. Sakti
Emmanuel Dupoux
SSL
123
61
0
12 Oct 2020
1