Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge

24 May 2020

Papers citing "Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge"

22 / 22 papers shown

Title
LAST: Language Model Aware Speech Tokenization A. Turetzky Yossi Adi 83 3 0 05 Sep 2024
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data Mateusz Lajszczak Guillermo Cámbara Yang Li Fatih Beyhan Arent van Korlaar ... Bartosz Putrycz Soledad López Gambino Kayeon Yoo Elena Sokolova Thomas Drugman LM&MA 113 88 0 12 Feb 2024
Modeling Spatio-temporal Dynamical Systems with Neural Discrete Learning and Levels-of-Experts Kun Wang Hao Wu Guibin Zhang Sihang Li Yuxuan Liang Yuankai Wu Roger Zimmermann Yang Wang 77 11 0 06 Feb 2024
Disentanglement via Latent Quantization Kyle Hsu W. Dorrell James C. R. Whittington Jiajun Wu Chelsea Finn DRL 163 27 0 28 May 2023
Textually Pretrained Speech Language Models Michael Hassid Tal Remez Tu Nguyen Itai Gat Alexis Conneau ... Alexandre Défossez Gabriel Synnaeve Emmanuel Dupoux Roy Schwartz Yossi Adi VLM SyDa 131 61 0 22 May 2023
Self-Organising Neural Discrete Representation Learning à la Kohonen Kazuki Irie Róbert Csordás Jürgen Schmidhuber SSL 79 1 0 15 Feb 2023
ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement Wei-Ning Hsu Tal Remez Bowen Shi Jacob Donley Yossi Adi DiffM 93 12 0 21 Dec 2022
Artificial intelligence approaches for materials-by-design of energetic materials: state-of-the-art, challenges, and future directions Joseph B. Choi Phong C. H. Nguyen O. Sen H. Udaykumar Stephen Seung-Yeob Baek PINN AI4CE 71 13 0 15 Nov 2022
Deep Reinforcement Learning with Vector Quantized Encoding Liang Zhang Justin Lieffers A. Pyarelal OffRL 58 2 0 12 Nov 2022
Learning an Artificial Language for Knowledge-Sharing in Multilingual Translation Danni Liu Jan Niehues 37 5 0 02 Nov 2022
Self-supervised language learning from raw audio: Lessons from the Zero Resource Speech Challenge Ewan Dunbar Nicolas Hamilakis Emmanuel Dupoux SSL 84 30 0 27 Oct 2022
SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning Tzu-hsun Feng Annie Dong Ching-Feng Yeh Shu-Wen Yang Tzu-Quan Lin ... Xuankai Chang Shinji Watanabe Abdel-rahman Mohamed Shang-Wen Li Hung-yi Lee ELM SSL 102 35 0 16 Oct 2022
Augmentation Invariant Discrete Representation for Generative Spoken Language Modeling Itai Gat Felix Kreuk Tu Nguyen Ann Lee Jade Copet Gabriel Synnaeve Emmanuel Dupoux Yossi Adi 75 11 0 30 Sep 2022
Self-Supervised Speech Representation Learning: A Review Abdel-rahman Mohamed Hung-yi Lee Lasse Borgholt Jakob Drachmann Havtorn Joakim Edin ... Shang-Wen Li Karen Livescu Lars Maaløe Tara N. Sainath Shinji Watanabe SSL AI4TS 289 368 0 21 May 2022
SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization Yuhta Takida Takashi Shibuya Wei-Hsiang Liao Chieh-Hsin Lai Junki Ohmura Toshimitsu Uesaka Naoki Murata Shusuke Takahashi Toshiyuki Kumakura Yuki Mitsufuji BDL 85 67 0 16 May 2022
Discrete representations in neural models of spoken language Bertrand Higy Lieke Gelderloos Afra Alishahi Grzegorz Chrupała 140 6 0 12 May 2021
Speech Resynthesis from Discrete Disentangled Self-Supervised Representations Adam Polyak Yossi Adi Jade Copet Eugene Kharitonov Kushal Lakhotia Wei-Ning Hsu Abdel-rahman Mohamed Emmanuel Dupoux 130 318 0 01 Apr 2021
Inducing Meaningful Units from Character Sequences with Dynamic Capacity Slot Attention Melika Behjati James Henderson OCL 52 1 0 01 Feb 2021
Generative Spoken Language Modeling from Raw Audio Kushal Lakhotia Evgeny Kharitonov Wei-Ning Hsu Yossi Adi Adam Polyak ... Tu Nguyen Jade Copet Alexei Baevski A. Mohamed Emmanuel Dupoux AuLLM 292 366 0 01 Feb 2021
The effectiveness of unsupervised subword modeling with autoregressive and cross-lingual phone-aware networks Siyuan Feng O. Scharenborg SSL 54 3 0 17 Dec 2020
Unsupervised Learning of Disentangled Speech Content and Style Representation Andros Tjandra Ruoming Pang Yu Zhang Shigeki Karita BDL DRL 73 15 0 24 Oct 2020
The Zero Resource Speech Challenge 2020: Discovering discrete subword and word units Ewan Dunbar Julien Karadayi Mathieu Bernard Xuan-Nga Cao Robin Algayres Lucas Ondel Laurent Besacier S. Sakti Emmanuel Dupoux SSL 123 61 0 12 Oct 2020