ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.11676
  4. Cited By
Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis:
  ZeroSpeech 2020 Challenge

Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge

24 May 2020
Andros Tjandra
S. Sakti
Satoshi Nakamura
ArXiv (abs)PDFHTML

Papers citing "Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge"

22 / 22 papers shown
Title
LAST: Language Model Aware Speech Tokenization
LAST: Language Model Aware Speech Tokenization
A. Turetzky
Yossi Adi
83
3
0
05 Sep 2024
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model
  on 100K hours of data
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data
Mateusz Lajszczak
Guillermo Cámbara
Yang Li
Fatih Beyhan
Arent van Korlaar
...
Bartosz Putrycz
Soledad López Gambino
Kayeon Yoo
Elena Sokolova
Thomas Drugman
LM&MA
113
88
0
12 Feb 2024
Modeling Spatio-temporal Dynamical Systems with Neural Discrete Learning
  and Levels-of-Experts
Modeling Spatio-temporal Dynamical Systems with Neural Discrete Learning and Levels-of-Experts
Kun Wang
Hao Wu
Guibin Zhang
Sihang Li
Yuxuan Liang
Yuankai Wu
Roger Zimmermann
Yang Wang
77
11
0
06 Feb 2024
Disentanglement via Latent Quantization
Disentanglement via Latent Quantization
Kyle Hsu
W. Dorrell
James C. R. Whittington
Jiajun Wu
Chelsea Finn
DRL
163
27
0
28 May 2023
Textually Pretrained Speech Language Models
Textually Pretrained Speech Language Models
Michael Hassid
Tal Remez
Tu Nguyen
Itai Gat
Alexis Conneau
...
Alexandre Défossez
Gabriel Synnaeve
Emmanuel Dupoux
Roy Schwartz
Yossi Adi
VLMSyDa
131
61
0
22 May 2023
Self-Organising Neural Discrete Representation Learning à la Kohonen
Self-Organising Neural Discrete Representation Learning à la Kohonen
Kazuki Irie
Róbert Csordás
Jürgen Schmidhuber
SSL
79
1
0
15 Feb 2023
ReVISE: Self-Supervised Speech Resynthesis with Visual Input for
  Universal and Generalized Speech Enhancement
ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement
Wei-Ning Hsu
Tal Remez
Bowen Shi
Jacob Donley
Yossi Adi
DiffM
93
12
0
21 Dec 2022
Artificial intelligence approaches for materials-by-design of energetic
  materials: state-of-the-art, challenges, and future directions
Artificial intelligence approaches for materials-by-design of energetic materials: state-of-the-art, challenges, and future directions
Joseph B. Choi
Phong C. H. Nguyen
O. Sen
H. Udaykumar
Stephen Seung-Yeob Baek
PINNAI4CE
71
13
0
15 Nov 2022
Deep Reinforcement Learning with Vector Quantized Encoding
Deep Reinforcement Learning with Vector Quantized Encoding
Liang Zhang
Justin Lieffers
A. Pyarelal
OffRL
58
2
0
12 Nov 2022
Learning an Artificial Language for Knowledge-Sharing in Multilingual
  Translation
Learning an Artificial Language for Knowledge-Sharing in Multilingual Translation
Danni Liu
Jan Niehues
37
5
0
02 Nov 2022
Self-supervised language learning from raw audio: Lessons from the Zero
  Resource Speech Challenge
Self-supervised language learning from raw audio: Lessons from the Zero Resource Speech Challenge
Ewan Dunbar
Nicolas Hamilakis
Emmanuel Dupoux
SSL
84
30
0
27 Oct 2022
SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of
  Self-Supervised Speech Representation Learning
SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning
Tzu-hsun Feng
Annie Dong
Ching-Feng Yeh
Shu-Wen Yang
Tzu-Quan Lin
...
Xuankai Chang
Shinji Watanabe
Abdel-rahman Mohamed
Shang-Wen Li
Hung-yi Lee
ELMSSL
102
35
0
16 Oct 2022
Augmentation Invariant Discrete Representation for Generative Spoken
  Language Modeling
Augmentation Invariant Discrete Representation for Generative Spoken Language Modeling
Itai Gat
Felix Kreuk
Tu Nguyen
Ann Lee
Jade Copet
Gabriel Synnaeve
Emmanuel Dupoux
Yossi Adi
75
11
0
30 Sep 2022
Self-Supervised Speech Representation Learning: A Review
Self-Supervised Speech Representation Learning: A Review
Abdel-rahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
...
Shang-Wen Li
Karen Livescu
Lars Maaløe
Tara N. Sainath
Shinji Watanabe
SSLAI4TS
289
368
0
21 May 2022
SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed
  Stochastic Quantization
SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization
Yuhta Takida
Takashi Shibuya
Wei-Hsiang Liao
Chieh-Hsin Lai
Junki Ohmura
Toshimitsu Uesaka
Naoki Murata
Shusuke Takahashi
Toshiyuki Kumakura
Yuki Mitsufuji
BDL
85
67
0
16 May 2022
Discrete representations in neural models of spoken language
Discrete representations in neural models of spoken language
Bertrand Higy
Lieke Gelderloos
Afra Alishahi
Grzegorz Chrupała
140
6
0
12 May 2021
Speech Resynthesis from Discrete Disentangled Self-Supervised
  Representations
Speech Resynthesis from Discrete Disentangled Self-Supervised Representations
Adam Polyak
Yossi Adi
Jade Copet
Eugene Kharitonov
Kushal Lakhotia
Wei-Ning Hsu
Abdel-rahman Mohamed
Emmanuel Dupoux
130
318
0
01 Apr 2021
Inducing Meaningful Units from Character Sequences with Dynamic Capacity
  Slot Attention
Inducing Meaningful Units from Character Sequences with Dynamic Capacity Slot Attention
Melika Behjati
James Henderson
OCL
52
1
0
01 Feb 2021
Generative Spoken Language Modeling from Raw Audio
Generative Spoken Language Modeling from Raw Audio
Kushal Lakhotia
Evgeny Kharitonov
Wei-Ning Hsu
Yossi Adi
Adam Polyak
...
Tu Nguyen
Jade Copet
Alexei Baevski
A. Mohamed
Emmanuel Dupoux
AuLLM
292
366
0
01 Feb 2021
The effectiveness of unsupervised subword modeling with autoregressive
  and cross-lingual phone-aware networks
The effectiveness of unsupervised subword modeling with autoregressive and cross-lingual phone-aware networks
Siyuan Feng
O. Scharenborg
SSL
54
3
0
17 Dec 2020
Unsupervised Learning of Disentangled Speech Content and Style
  Representation
Unsupervised Learning of Disentangled Speech Content and Style Representation
Andros Tjandra
Ruoming Pang
Yu Zhang
Shigeki Karita
BDLDRL
73
15
0
24 Oct 2020
The Zero Resource Speech Challenge 2020: Discovering discrete subword
  and word units
The Zero Resource Speech Challenge 2020: Discovering discrete subword and word units
Ewan Dunbar
Julien Karadayi
Mathieu Bernard
Xuan-Nga Cao
Robin Algayres
Lucas Ondel
Laurent Besacier
S. Sakti
Emmanuel Dupoux
SSL
123
61
0
12 Oct 2020
1