Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1808.09634
Cited By
Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders
29 August 2018
Wen-Chin Huang
Hsin-Te Hwang
Yu-Huai Peng
Yu Tsao
H. Wang
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders"
21 / 21 papers shown
LatentVoiceGrad: Nonparallel Voice Conversion with Latent Diffusion/Flow-Matching Models
IEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2025
Hirokazu Kameoka
Takuhiro Kaneko
Kou Tanaka
Yuto Kondo
DiffM
116
0
0
10 Sep 2025
Towards General-Purpose Text-Instruction-Guided Voice Conversion
Automatic Speech Recognition & Understanding (ASRU), 2023
Chun-Yi Kuan
Chen-An Li
Tsung-Yuan Hsu
Tzu-Quan Lin
Ho-Lam Chung
Kai-Wei Chang
Shuo-yiin Chang
Hung-yi Lee
293
13
0
25 Sep 2023
AVQVC: One-shot Voice Conversion by Vector Quantization with applying contrastive learning
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Huaizhen Tang
Xulong Zhang
Jianzong Wang
Ning Cheng
Jing Xiao
200
62
0
21 Feb 2022
Zero-shot Voice Conversion via Self-supervised Prosody Representation Learning
Shijun Wang
Dimche Kostadinov
Damian Borth
276
11
0
27 Oct 2021
Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conversion
Interspeech (Interspeech), 2021
Zongyang Du
Berrak Sisman
Kun Zhou
Haizhou Li
199
32
0
20 Oct 2021
Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer
Automatic Speech Recognition & Understanding (ASRU), 2021
Zongyang Du
Berrak Sisman
Kun Zhou
Haizhou Li
220
25
0
08 Jul 2021
Adversarially learning disentangled speech representations for robust multi-factor voice conversion
Interspeech (Interspeech), 2021
Jie Wang
Jingbei Li
Xintao Zhao
Zhiyong Wu
Shiyin Kang
Helen Meng
DRL
318
32
0
30 Jan 2021
AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance Normalization
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Yen-Hao Chen
Da-Yi Wu
Tsung-Han Wu
Hung-yi Lee
284
122
0
31 Oct 2020
GAZEV: GAN-Based Zero-Shot Voice Conversion over Non-parallel Speech Corpus
Interspeech (Interspeech), 2020
Zining Zhang
Bingsheng He
Zhenjie Zhang
146
21
0
24 Oct 2020
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics
Hirokazu Kameoka
Takuhiro Kaneko
Kou Tanaka
Nobukatsu Hojo
Shogo Seki
DiffM
293
25
0
06 Oct 2020
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep Learning
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Berrak Sisman
Junichi Yamagishi
Simon King
Haizhou Li
BDL
450
389
0
09 Aug 2020
Unsupervised Cross-Domain Speech-to-Speech Conversion with Time-Frequency Consistency
M. A. Khan
Fabien Cardinaux
Stefan Uhlich
Marc Ferras
Asja Fischer
109
1
0
15 May 2020
Unsupervised Speech Decomposition via Triple Information Bottleneck
Kaizhi Qian
Yang Zhang
Shiyu Chang
David D. Cox
M. Hasegawa-Johnson
209
200
0
23 Apr 2020
F0-consistent many-to-many non-parallel voice conversion via conditional autoencoder
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Kaizhi Qian
Zeyu Jin
M. Hasegawa-Johnson
G. J. Mysore
161
114
0
15 Apr 2020
Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice Conversion
IEEE Transactions on Emerging Topics in Computational Intelligence (IEEE TETCI), 2020
Wen-Chin Huang
Hao Luo
Hsin-Te Hwang
Chen-Chou Lo
Yu-Huai Peng
Yu Tsao
Hsin-Min Wang
DRL
221
43
0
22 Jan 2020
MoEVC: A Mixture-of-experts Voice Conversion System with Sparse Gating Mechanism for Accelerating Online Computation
Yu-Tao Chang
Yuan-Hong Yang
Yu-Huai Peng
Syu-Siang Wang
T. Chi
Yu Tsao
Hsin-Min Wang
MoE
100
0
0
27 Dec 2019
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech
Xin Wang
Junichi Yamagishi
Massimiliano Todisco
Héctor Delgado
A. Nautsch
...
J. Bonastre
Avashna Govender
S. Ronanki
Jing-Xuan Zhang
Zhenhua Ling
424
13
0
05 Nov 2019
Towards Fine-Grained Prosody Control for Voice Conversion
International Symposium on Chinese Spoken Language Processing (ISCSLP), 2019
Zheng Lian
Zhengqi Wen
157
20
0
24 Oct 2019
AUTOVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
International Conference on Machine Learning (ICML), 2019
Kaizhi Qian
Yang Zhang
Shiyu Chang
Xuesong Yang
M. Hasegawa-Johnson
379
521
0
14 May 2019
Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice Conversion
Interspeech (Interspeech), 2019
Wen-Chin Huang
Yi-Chiao Wu
Chen-Chou Lo
Patrick Lumban Tobing
Tomoki Hayashi
Kazuhiro Kobayashi
Tomoki Toda
Yu Tsao
H. Wang
DRL
216
13
0
02 May 2019
Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion
Wen-Chin Huang
Yi-Chiao Wu
Hsin-Te Hwang
Patrick Lumban Tobing
Tomoki Hayashi
Kazuhiro Kobayashi
Tomoki Toda
Yu Tsao
H. Wang
157
20
0
27 Nov 2018
1
Page 1 of 1