ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.10727
  4. Cited By
Learning Disentangled Phone and Speaker Representations in a
  Semi-Supervised VQ-VAE Paradigm
v1v2 (latest)

Learning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE Paradigm

21 October 2020
Jennifer Williams
Yi Zhao
Erica Cooper
Junichi Yamagishi
    SSL
ArXiv (abs)PDFHTML

Papers citing "Learning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE Paradigm"

14 / 14 papers shown
Title
Vector Quantized-Elites: Unsupervised and Problem-Agnostic Quality-Diversity Optimization
Vector Quantized-Elites: Unsupervised and Problem-Agnostic Quality-Diversity Optimization
Constantinos Tsakonas
Konstantinos Chatzilygeroudis
69
0
0
10 Apr 2025
Improving Voice Quality in Speech Anonymization With Just
  Perception-Informed Losses
Improving Voice Quality in Speech Anonymization With Just Perception-Informed Losses
Suhita Ghosh
Tim Thiele
Frederic Lorbeer
Frank Dreyer
Sebastian Stober
58
0
0
20 Oct 2024
Exploratory Evaluation of Speech Content Masking
Exploratory Evaluation of Speech Content Masking
Jennifer Williams
Karla Pizzi
Paul-Gauthier Noé
Sneha Das
63
3
0
08 Jan 2024
StyleSpeech: Self-supervised Style Enhancing with VQ-VAE-based
  Pre-training for Expressive Audiobook Speech Synthesis
StyleSpeech: Self-supervised Style Enhancing with VQ-VAE-based Pre-training for Expressive Audiobook Speech Synthesis
Xueyuan Chen
Xi Wang
Shaofei Zhang
Lei He
Zhiyong Wu
Xixin Wu
Helen M. Meng
71
8
0
19 Dec 2023
A Two-Stage Training Framework for Joint Speech Compression and
  Enhancement
A Two-Stage Training Framework for Joint Speech Compression and Enhancement
Jiayi Huang
Zeyu Yan
Wenbin Jiang
Fei Wen
49
1
0
08 Sep 2023
Learn to Sing by Listening: Building Controllable Virtual Singer by
  Unsupervised Learning from Voice Recordings
Learn to Sing by Listening: Building Controllable Virtual Singer by Unsupervised Learning from Voice Recordings
Wei Xue
Yiwen Wang
Qi-fei Liu
Yi-Ting Guo
59
1
0
09 May 2023
Disentanglement of Latent Representations via Causal Interventions
Disentanglement of Latent Representations via Causal Interventions
Gaël Gendron
Michael Witbrock
Gillian Dobbie
OODCMLCoGe
131
2
0
02 Feb 2023
Disentangled Feature Learning for Real-Time Neural Speech Coding
Disentangled Feature Learning for Real-Time Neural Speech Coding
Xue Jiang
Xiulian Peng
Yuan Zhang
Yan Lu
SSLDRL
94
12
0
22 Nov 2022
Latent-Domain Predictive Neural Speech Coding
Latent-Domain Predictive Neural Speech Coding
Xue Jiang
Xiulian Peng
Huaying Xue
Yuan Zhang
Yan Lu
73
18
0
18 Jul 2022
Towards Error-Resilient Neural Speech Coding
Towards Error-Resilient Neural Speech Coding
Huaying Xue
Xiulian Peng
Xue Jiang
Yan Lu
60
7
0
03 Jul 2022
Dictionary Attacks on Speaker Verification
Dictionary Attacks on Speaker Verification
Mirko Marras
Pawel Korus
Anubhav Jain
N. Memon
AAML
72
10
0
24 Apr 2022
End-to-End Neural Speech Coding for Real-Time Communications
End-to-End Neural Speech Coding for Real-Time Communications
Xue Jiang
Xiulian Peng
Chengyu Zheng
Huaying Xue
Yuan Zhang
Yan Lu
92
30
0
24 Jan 2022
Preliminary study on using vector quantization latent spaces for TTS/VC
  systems with consistent performance
Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance
Hieu-Thi Luong
Junichi Yamagishi
80
0
0
25 Jun 2021
OCTOPUS: Overcoming Performance andPrivatization Bottlenecks in
  Distributed Learning
OCTOPUS: Overcoming Performance andPrivatization Bottlenecks in Distributed Learning
Shuo Wang
Surya Nepal
Kristen Moore
M. Grobler
Carsten Rudolph
A. Abuadbba
FedML
67
8
0
03 May 2021
1