ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1704.00849
  4. Cited By
Voice Conversion from Unaligned Corpora using Variational Autoencoding
  Wasserstein Generative Adversarial Networks

Voice Conversion from Unaligned Corpora using Variational Autoencoding Wasserstein Generative Adversarial Networks

4 April 2017
Chin-Cheng Hsu
Hsin-Te Hwang
Yi-Chiao Wu
Yu Tsao
H. Wang
    DRL
ArXivPDFHTML

Papers citing "Voice Conversion from Unaligned Corpora using Variational Autoencoding Wasserstein Generative Adversarial Networks"

41 / 41 papers shown
Title
Generative Adversarial Network based Voice Conversion: Techniques, Challenges, and Recent Advancements
Generative Adversarial Network based Voice Conversion: Techniques, Challenges, and Recent Advancements
Sandipan Dhar
N. D. Jana
Swagatam Das
43
0
0
27 Apr 2025
A Large-Scale Evaluation of Speech Foundation Models
A Large-Scale Evaluation of Speech Foundation Models
Shu-Wen Yang
Heng-Jui Chang
Zili Huang
Andy T. Liu
Cheng-I Jeff Lai
...
Kushal Lakhotia
Shang-Wen Li
Abdelrahman Mohamed
Shinji Watanabe
Hung-yi Lee
38
19
0
15 Apr 2024
Delivering Speaking Style in Low-resource Voice Conversion with
  Multi-factor Constraints
Delivering Speaking Style in Low-resource Voice Conversion with Multi-factor Constraints
Zhichao Wang
Xinsheng Wang
Linfu Xie
Yuan-Jui Chen
Qiao Tian
Yuping Wang
22
5
0
16 Nov 2022
EmoFake: An Initial Dataset for Emotion Fake Audio Detection
EmoFake: An Initial Dataset for Emotion Fake Audio Detection
Yan Zhao
Jiangyan Yi
J. Tao
Chenglong Wang
Xiaohui Zhang
Yongfeng Dong
24
9
0
10 Nov 2022
DisC-VC: Disentangled and F0-Controllable Neural Voice Conversion
DisC-VC: Disentangled and F0-Controllable Neural Voice Conversion
Chihiro Watanabe
Hirokazu Kameoka
DRL
24
0
0
20 Oct 2022
End-to-End Zero-Shot Voice Conversion with Location-Variable
  Convolutions
End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions
Wonjune Kang
M. Hasegawa-Johnson
D. Roy
24
8
0
19 May 2022
Disentangleing Content and Fine-grained Prosody Information via Hybrid
  ASR Bottleneck Features for Voice Conversion
Disentangleing Content and Fine-grained Prosody Information via Hybrid ASR Bottleneck Features for Voice Conversion
Xintao Zhao
Feng Liu
Changhe Song
Zhiyong Wu
Shiyin Kang
Deyi Tuo
H. Meng
11
20
0
24 Mar 2022
Training Generative Adversarial Networks with Adaptive Composite
  Gradient
Training Generative Adversarial Networks with Adaptive Composite Gradient
Huiqing Qi
Fang Li
Shengli Tan
Xiangyun Zhang
GAN
21
3
0
10 Nov 2021
Disentanglement of Emotional Style and Speaker Identity for Expressive
  Voice Conversion
Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conversion
Zongyang Du
Berrak Sisman
Kun Zhou
Haizhou Li
11
24
0
20 Oct 2021
StarGAN-VC+ASR: StarGAN-based Non-Parallel Voice Conversion Regularized
  by Automatic Speech Recognition
StarGAN-VC+ASR: StarGAN-based Non-Parallel Voice Conversion Regularized by Automatic Speech Recognition
Shoki Sakamoto
Akira Taniguchi
T. Taniguchi
Hirokazu Kameoka
BDL
17
5
0
10 Aug 2021
SVSNet: An End-to-end Speaker Voice Similarity Assessment Model
SVSNet: An End-to-end Speaker Voice Similarity Assessment Model
Cheng-Hung Hu
Yu-Huai Peng
Junichi Yamagishi
Yu Tsao
Hsin-Min Wang
24
5
0
20 Jul 2021
VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised
  Speech Representation Disentanglement for One-shot Voice Conversion
VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-shot Voice Conversion
Disong Wang
Liqun Deng
Y. Yeung
Xiao Chen
Xunying Liu
H. Meng
DRL
14
136
0
18 Jun 2021
Emotional Voice Conversion: Theory, Databases and ESD
Emotional Voice Conversion: Theory, Databases and ESD
Kun Zhou
Berrak Sisman
Rui Liu
Haizhou Li
23
167
0
31 May 2021
Adversarial Disentanglement of Speaker Representation for
  Attribute-Driven Privacy Preservation
Adversarial Disentanglement of Speaker Representation for Attribute-Driven Privacy Preservation
Paul-Gauthier Noé
Mohammad MohammadAmini
D. Matrouf
Titouan Parcollet
Andreas Nautsch
J. Bonastre
8
27
0
08 Dec 2020
VAW-GAN for Disentanglement and Recomposition of Emotional Elements in
  Speech
VAW-GAN for Disentanglement and Recomposition of Emotional Elements in Speech
Kun Zhou
Berrak Sisman
Haizhou Li
DRL
11
40
0
03 Nov 2020
AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and
  Adaptive Instance Normalization
AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance Normalization
Yen-Hao Chen
Da-Yi Wu
Tsung-Han Wu
Hung-yi Lee
13
107
0
31 Oct 2020
CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-spectrogram
  Conversion
CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-spectrogram Conversion
Takuhiro Kaneko
Hirokazu Kameoka
Kou Tanaka
Nobukatsu Hojo
21
78
0
22 Oct 2020
An Overview of Voice Conversion and its Challenges: From Statistical
  Modeling to Deep Learning
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep Learning
Berrak Sisman
Junichi Yamagishi
Simon King
Haizhou Li
BDL
27
316
0
09 Aug 2020
Adversarial representation learning for private speech generation
Adversarial representation learning for private speech generation
David Ericsson
Adam Östberg
Edvin Listo Zec
John Martinsson
Olof Mogren
19
16
0
16 Jun 2020
A Survey on Generative Adversarial Networks: Variants, Applications, and
  Training
A Survey on Generative Adversarial Networks: Variants, Applications, and Training
Abdul Jabbar
Xi Li
Bourahla Omar
25
266
0
09 Jun 2020
Many-to-Many Voice Transformer Network
Many-to-Many Voice Transformer Network
Hirokazu Kameoka
Wen-Chin Huang
Kou Tanaka
Takuhiro Kaneko
Nobukatsu Hojo
T. Toda
ViT
17
30
0
18 May 2020
Converting Anyone's Emotion: Towards Speaker-Independent Emotional Voice
  Conversion
Converting Anyone's Emotion: Towards Speaker-Independent Emotional Voice Conversion
Kun Zhou
Berrak Sisman
Mingyang Zhang
Haizhou Li
17
52
0
13 May 2020
F0-consistent many-to-many non-parallel voice conversion via conditional
  autoencoder
F0-consistent many-to-many non-parallel voice conversion via conditional autoencoder
Kaizhi Qian
Zeyu Jin
M. Hasegawa-Johnson
G. J. Mysore
15
107
0
15 Apr 2020
Many-to-Many Voice Conversion using Conditional Cycle-Consistent
  Adversarial Networks
Many-to-Many Voice Conversion using Conditional Cycle-Consistent Adversarial Networks
Shindong Lee
Bonggu Ko
Keonnyeong Lee
In-Chul Yoo
Dongsuk Yook
GAN
17
33
0
15 Feb 2020
Transforming Spectrum and Prosody for Emotional Voice Conversion with
  Non-Parallel Training Data
Transforming Spectrum and Prosody for Emotional Voice Conversion with Non-Parallel Training Data
Kun Zhou
Berrak Sisman
Haizhou Li
17
66
0
01 Feb 2020
A Review on Generative Adversarial Networks: Algorithms, Theory, and
  Applications
A Review on Generative Adversarial Networks: Algorithms, Theory, and Applications
Jie Gui
Zhenan Sun
Yonggang Wen
Dacheng Tao
Jieping Ye
EGVM
26
817
0
20 Jan 2020
DNN-based cross-lingual voice conversion using Bottleneck Features
DNN-based cross-lingual voice conversion using Bottleneck Features
M. K. Reddy
K. S. Rao
21
4
0
09 Sep 2019
Non-Parallel Sequence-to-Sequence Voice Conversion with Disentangled
  Linguistic and Speaker Representations
Non-Parallel Sequence-to-Sequence Voice Conversion with Disentangled Linguistic and Speaker Representations
Jing-Xuan Zhang
Zhenhua Ling
Lirong Dai
22
99
0
25 Jun 2019
Using generative modelling to produce varied intonation for speech
  synthesis
Using generative modelling to produce varied intonation for speech synthesis
Zack Hodari
O. Watts
Simon King
21
29
0
10 Jun 2019
TTS Skins: Speaker Conversion via ASR
TTS Skins: Speaker Conversion via ASR
Adam Polyak
Lior Wolf
Yaniv Taigman
13
27
0
18 Apr 2019
ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection
ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection
Massimiliano Todisco
Xin Wang
Ville Vestman
Md. Sahidullah
Héctor Delgado
A. Nautsch
Junichi Yamagishi
Nicholas W. D. Evans
Tomi Kinnunen
Kong Aik Lee
14
593
0
09 Apr 2019
Learning a Generative Model of Cancer Metastasis
Learning a Generative Model of Cancer Metastasis
Benjamin Kompa
Beau Coker
MedIm
AI4CE
11
0
0
17 Jan 2019
Unpaired Speech Enhancement by Acoustic and Adversarial Supervision for
  Speech Recognition
Unpaired Speech Enhancement by Acoustic and Adversarial Supervision for Speech Recognition
Jonathan Lee
Michael Laskey
Bo-Kyeong Kim
A. Aswani
Soo-Young Lee
13
17
0
06 Nov 2018
Nonparallel Emotional Speech Conversion
Nonparallel Emotional Speech Conversion
Jian Gao
Deep Chakraborty
H. Tembine
Olaitan Olaleye
14
68
0
03 Nov 2018
ACVAE-VC: Non-parallel many-to-many voice conversion with auxiliary
  classifier variational autoencoder
ACVAE-VC: Non-parallel many-to-many voice conversion with auxiliary classifier variational autoencoder
Hirokazu Kameoka
Takuhiro Kaneko
Kou Tanaka
Nobukatsu Hojo
DRL
6
59
0
13 Aug 2018
Rhythm-Flexible Voice Conversion without Parallel Data Using Cycle-GAN
  over Phoneme Posteriorgram Sequences
Rhythm-Flexible Voice Conversion without Parallel Data Using Cycle-GAN over Phoneme Posteriorgram Sequences
Cheng-chieh Yeh
Po-Chun Hsu
Ju-Chieh Chou
Hung-yi Lee
Lin-Shan Lee
25
23
0
09 Aug 2018
StarGAN-VC: Non-parallel many-to-many voice conversion with star
  generative adversarial networks
StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks
Hirokazu Kameoka
Takuhiro Kaneko
Kou Tanaka
Nobukatsu Hojo
15
370
0
06 Jun 2018
Boosting Noise Robustness of Acoustic Model via Deep Adversarial
  Training
Boosting Noise Robustness of Acoustic Model via Deep Adversarial Training
B. Liu
Shuai Nie
Yaping Zhang
Dengfeng Ke
Shan Liang
Wenju Liu
21
25
0
02 May 2018
The Voice Conversion Challenge 2018: Promoting Development of Parallel
  and Nonparallel Methods
The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods
Jaime Lorenzo-Trueba
Junichi Yamagishi
T. Toda
Daisuke Saito
F. Villavicencio
Tomi Kinnunen
Zhenhua Ling
14
318
0
12 Apr 2018
High-quality nonparallel voice conversion based on cycle-consistent
  adversarial network
High-quality nonparallel voice conversion based on cycle-consistent adversarial network
Fuming Fang
Junichi Yamagishi
Isao Echizen
Jaime Lorenzo-Trueba
GAN
20
136
0
02 Apr 2018
A Multi-Discriminator CycleGAN for Unsupervised Non-Parallel Speech
  Domain Adaptation
A Multi-Discriminator CycleGAN for Unsupervised Non-Parallel Speech Domain Adaptation
Ehsan Hosseini-Asl
Yingbo Zhou
Caiming Xiong
R. Socher
16
54
0
27 Mar 2018
1