ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1808.09634
  4. Cited By
Voice Conversion Based on Cross-Domain Features Using Variational Auto
  Encoders

Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders

29 August 2018
Wen-Chin Huang
Hsin-Te Hwang
Yu-Huai Peng
Yu Tsao
H. Wang
ArXiv (abs)PDFHTML

Papers citing "Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders"

21 / 21 papers shown
LatentVoiceGrad: Nonparallel Voice Conversion with Latent Diffusion/Flow-Matching Models
LatentVoiceGrad: Nonparallel Voice Conversion with Latent Diffusion/Flow-Matching ModelsIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2025
Hirokazu Kameoka
Takuhiro Kaneko
Kou Tanaka
Yuto Kondo
DiffM
116
0
0
10 Sep 2025
Towards General-Purpose Text-Instruction-Guided Voice Conversion
Towards General-Purpose Text-Instruction-Guided Voice ConversionAutomatic Speech Recognition & Understanding (ASRU), 2023
Chun-Yi Kuan
Chen-An Li
Tsung-Yuan Hsu
Tzu-Quan Lin
Ho-Lam Chung
Kai-Wei Chang
Shuo-yiin Chang
Hung-yi Lee
293
13
0
25 Sep 2023
AVQVC: One-shot Voice Conversion by Vector Quantization with applying
  contrastive learning
AVQVC: One-shot Voice Conversion by Vector Quantization with applying contrastive learningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Huaizhen Tang
Xulong Zhang
Jianzong Wang
Ning Cheng
Jing Xiao
200
62
0
21 Feb 2022
Zero-shot Voice Conversion via Self-supervised Prosody Representation
  Learning
Zero-shot Voice Conversion via Self-supervised Prosody Representation Learning
Shijun Wang
Dimche Kostadinov
Damian Borth
276
11
0
27 Oct 2021
Disentanglement of Emotional Style and Speaker Identity for Expressive
  Voice Conversion
Disentanglement of Emotional Style and Speaker Identity for Expressive Voice ConversionInterspeech (Interspeech), 2021
Zongyang Du
Berrak Sisman
Kun Zhou
Haizhou Li
199
32
0
20 Oct 2021
Expressive Voice Conversion: A Joint Framework for Speaker Identity and
  Emotional Style Transfer
Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style TransferAutomatic Speech Recognition & Understanding (ASRU), 2021
Zongyang Du
Berrak Sisman
Kun Zhou
Haizhou Li
220
25
0
08 Jul 2021
Adversarially learning disentangled speech representations for robust
  multi-factor voice conversion
Adversarially learning disentangled speech representations for robust multi-factor voice conversionInterspeech (Interspeech), 2021
Jie Wang
Jingbei Li
Xintao Zhao
Zhiyong Wu
Shiyin Kang
Helen Meng
DRL
318
32
0
30 Jan 2021
AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and
  Adaptive Instance Normalization
AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance NormalizationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Yen-Hao Chen
Da-Yi Wu
Tsung-Han Wu
Hung-yi Lee
284
122
0
31 Oct 2020
GAZEV: GAN-Based Zero-Shot Voice Conversion over Non-parallel Speech
  Corpus
GAZEV: GAN-Based Zero-Shot Voice Conversion over Non-parallel Speech CorpusInterspeech (Interspeech), 2020
Zining Zhang
Bingsheng He
Zhenjie Zhang
146
21
0
24 Oct 2020
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed
  Langevin Dynamics
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics
Hirokazu Kameoka
Takuhiro Kaneko
Kou Tanaka
Nobukatsu Hojo
Shogo Seki
DiffM
293
25
0
06 Oct 2020
An Overview of Voice Conversion and its Challenges: From Statistical
  Modeling to Deep Learning
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep LearningIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Berrak Sisman
Junichi Yamagishi
Simon King
Haizhou Li
BDL
450
389
0
09 Aug 2020
Unsupervised Cross-Domain Speech-to-Speech Conversion with
  Time-Frequency Consistency
Unsupervised Cross-Domain Speech-to-Speech Conversion with Time-Frequency Consistency
M. A. Khan
Fabien Cardinaux
Stefan Uhlich
Marc Ferras
Asja Fischer
109
1
0
15 May 2020
Unsupervised Speech Decomposition via Triple Information Bottleneck
Unsupervised Speech Decomposition via Triple Information Bottleneck
Kaizhi Qian
Yang Zhang
Shiyu Chang
David D. Cox
M. Hasegawa-Johnson
209
200
0
23 Apr 2020
F0-consistent many-to-many non-parallel voice conversion via conditional
  autoencoder
F0-consistent many-to-many non-parallel voice conversion via conditional autoencoderIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Kaizhi Qian
Zeyu Jin
M. Hasegawa-Johnson
G. J. Mysore
161
114
0
15 Apr 2020
Unsupervised Representation Disentanglement using Cross Domain Features
  and Adversarial Learning in Variational Autoencoder based Voice Conversion
Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice ConversionIEEE Transactions on Emerging Topics in Computational Intelligence (IEEE TETCI), 2020
Wen-Chin Huang
Hao Luo
Hsin-Te Hwang
Chen-Chou Lo
Yu-Huai Peng
Yu Tsao
Hsin-Min Wang
DRL
221
43
0
22 Jan 2020
MoEVC: A Mixture-of-experts Voice Conversion System with Sparse Gating
  Mechanism for Accelerating Online Computation
MoEVC: A Mixture-of-experts Voice Conversion System with Sparse Gating Mechanism for Accelerating Online Computation
Yu-Tao Chang
Yuan-Hong Yang
Yu-Huai Peng
Syu-Siang Wang
T. Chi
Yu Tsao
Hsin-Min Wang
MoE
100
0
0
27 Dec 2019
ASVspoof 2019: A large-scale public database of synthesized, converted
  and replayed speech
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech
Xin Wang
Junichi Yamagishi
Massimiliano Todisco
Héctor Delgado
A. Nautsch
...
J. Bonastre
Avashna Govender
S. Ronanki
Jing-Xuan Zhang
Zhenhua Ling
424
13
0
05 Nov 2019
Towards Fine-Grained Prosody Control for Voice Conversion
Towards Fine-Grained Prosody Control for Voice ConversionInternational Symposium on Chinese Spoken Language Processing (ISCSLP), 2019
Zheng Lian
Zhengqi Wen
157
20
0
24 Oct 2019
AUTOVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
AUTOVC: Zero-Shot Voice Style Transfer with Only Autoencoder LossInternational Conference on Machine Learning (ICML), 2019
Kaizhi Qian
Yang Zhang
Shiyu Chang
Xuesong Yang
M. Hasegawa-Johnson
379
521
0
14 May 2019
Investigation of F0 conditioning and Fully Convolutional Networks in
  Variational Autoencoder based Voice Conversion
Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice ConversionInterspeech (Interspeech), 2019
Wen-Chin Huang
Yi-Chiao Wu
Chen-Chou Lo
Patrick Lumban Tobing
Tomoki Hayashi
Kazuhiro Kobayashi
Tomoki Toda
Yu Tsao
H. Wang
DRL
216
13
0
02 May 2019
Refined WaveNet Vocoder for Variational Autoencoder Based Voice
  Conversion
Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion
Wen-Chin Huang
Yi-Chiao Wu
Hsin-Te Hwang
Patrick Lumban Tobing
Tomoki Hayashi
Kazuhiro Kobayashi
Tomoki Toda
Yu Tsao
H. Wang
157
20
0
27 Nov 2018
1
Page 1 of 1