Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders

29 August 2018

Papers citing "Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders"

21 / 21 papers shown

LatentVoiceGrad: Nonparallel Voice Conversion with Latent Diffusion/Flow-Matching ModelsIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2025

116

10 Sep 2025

Towards General-Purpose Text-Instruction-Guided Voice ConversionAutomatic Speech Recognition & Understanding (ASRU), 2023

Hung-yi Lee

293

25 Sep 2023

AVQVC: One-shot Voice Conversion by Vector Quantization with applying contrastive learningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

200

21 Feb 2022

Zero-shot Voice Conversion via Self-supervised Prosody Representation Learning

Shijun Wang

Dimche Kostadinov

Damian Borth

276

27 Oct 2021

Disentanglement of Emotional Style and Speaker Identity for Expressive Voice ConversionInterspeech (Interspeech), 2021

Zongyang Du

Berrak Sisman

Kun Zhou

Haizhou Li

199

20 Oct 2021

Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style TransferAutomatic Speech Recognition & Understanding (ASRU), 2021

Zongyang Du

Berrak Sisman

Kun Zhou

Haizhou Li

220

08 Jul 2021

Adversarially learning disentangled speech representations for robust multi-factor voice conversionInterspeech (Interspeech), 2021

Jie Wang

Jingbei Li

Xintao Zhao

Zhiyong Wu

Shiyin Kang

Helen Meng

DRL

318

30 Jan 2021

AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance NormalizationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

284

122

31 Oct 2020

GAZEV: GAN-Based Zero-Shot Voice Conversion over Non-parallel Speech CorpusInterspeech (Interspeech), 2020

Zining Zhang

Bingsheng He

Zhenjie Zhang

146

24 Oct 2020

VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics

293

06 Oct 2020

An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep LearningIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020

Haizhou Li

450

389

09 Aug 2020

Unsupervised Cross-Domain Speech-to-Speech Conversion with Time-Frequency Consistency

109

15 May 2020

Unsupervised Speech Decomposition via Triple Information Bottleneck

Kaizhi Qian

209

200

23 Apr 2020

F0-consistent many-to-many non-parallel voice conversion via conditional autoencoderIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Kaizhi Qian

Zeyu Jin

M. Hasegawa-Johnson

G. J. Mysore

161

114

15 Apr 2020

Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice ConversionIEEE Transactions on Emerging Topics in Computational Intelligence (IEEE TETCI), 2020

221

22 Jan 2020

MoEVC: A Mixture-of-experts Voice Conversion System with Sparse Gating Mechanism for Accelerating Online Computation

100

27 Dec 2019

ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech

Xin Wang

...

424

05 Nov 2019

Towards Fine-Grained Prosody Control for Voice ConversionInternational Symposium on Chinese Spoken Language Processing (ISCSLP), 2019

Zheng Lian

Zhengqi Wen

157

24 Oct 2019

AUTOVC: Zero-Shot Voice Style Transfer with Only Autoencoder LossInternational Conference on Machine Learning (ICML), 2019

Kaizhi Qian

379

521

14 May 2019

Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice ConversionInterspeech (Interspeech), 2019

Wen-Chin Huang

Yi-Chiao Wu

Chen-Chou Lo

Patrick Lumban Tobing

216

02 May 2019

Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion

Wen-Chin Huang

Yi-Chiao Wu

Hsin-Te Hwang

Patrick Lumban Tobing

157

27 Nov 2018