Learning Disentangled Phone and Speaker Representations in a
Semi-Supervised VQ-VAE Paradigm

v1v2 (latest)

Learning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE Paradigm

21 October 2020

Jennifer Williams

Junichi Yamagishi

ArXiv (abs)PDF HTML

Papers citing "Learning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE Paradigm"

14 / 14 papers shown

Title
Vector Quantized-Elites: Unsupervised and Problem-Agnostic Quality-Diversity Optimization Constantinos Tsakonas Konstantinos Chatzilygeroudis 69 0 0 10 Apr 2025
Improving Voice Quality in Speech Anonymization With Just Perception-Informed Losses Suhita Ghosh Tim Thiele Frederic Lorbeer Frank Dreyer Sebastian Stober 58 0 0 20 Oct 2024
Exploratory Evaluation of Speech Content Masking Jennifer Williams Karla Pizzi Paul-Gauthier Noé Sneha Das 63 3 0 08 Jan 2024
StyleSpeech: Self-supervised Style Enhancing with VQ-VAE-based Pre-training for Expressive Audiobook Speech Synthesis Xueyuan Chen Xi Wang Shaofei Zhang Lei He Zhiyong Wu Xixin Wu Helen M. Meng 71 8 0 19 Dec 2023
A Two-Stage Training Framework for Joint Speech Compression and Enhancement Jiayi Huang Zeyu Yan Wenbin Jiang Fei Wen 49 1 0 08 Sep 2023
Learn to Sing by Listening: Building Controllable Virtual Singer by Unsupervised Learning from Voice Recordings Wei Xue Yiwen Wang Qi-fei Liu Yi-Ting Guo 59 1 0 09 May 2023
Disentanglement of Latent Representations via Causal Interventions Gaël Gendron Michael Witbrock Gillian Dobbie OOD CML CoGe 131 2 0 02 Feb 2023
Disentangled Feature Learning for Real-Time Neural Speech Coding Xue Jiang Xiulian Peng Yuan Zhang Yan Lu SSL DRL 94 12 0 22 Nov 2022
Latent-Domain Predictive Neural Speech Coding Xue Jiang Xiulian Peng Huaying Xue Yuan Zhang Yan Lu 73 18 0 18 Jul 2022
Towards Error-Resilient Neural Speech Coding Huaying Xue Xiulian Peng Xue Jiang Yan Lu 60 7 0 03 Jul 2022
Dictionary Attacks on Speaker Verification Mirko Marras Pawel Korus Anubhav Jain N. Memon AAML 72 10 0 24 Apr 2022
End-to-End Neural Speech Coding for Real-Time Communications Xue Jiang Xiulian Peng Chengyu Zheng Huaying Xue Yuan Zhang Yan Lu 92 30 0 24 Jan 2022
Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance Hieu-Thi Luong Junichi Yamagishi 80 0 0 25 Jun 2021
OCTOPUS: Overcoming Performance andPrivatization Bottlenecks in Distributed Learning Shuo Wang Surya Nepal Kristen Moore M. Grobler Carsten Rudolph A. Abuadbba FedML 67 8 0 03 May 2021