CycleGAN-VC2: Improved CycleGAN-based Non-parallel Voice Conversion

9 April 2019

Papers citing "CycleGAN-VC2: Improved CycleGAN-based Non-parallel Voice Conversion"

30 / 30 papers shown

Title
Generative Adversarial Network based Voice Conversion: Techniques, Challenges, and Recent Advancements Sandipan Dhar N. D. Jana Swagatam Das 50 0 0 27 Apr 2025
Supervising 3D Talking Head Avatars with Analysis-by-Audio-Synthesis Radek Daněček Carolin Schmitt Senya Polikovsky Michael J. Black 38 0 0 18 Apr 2025
ShiftySpeech: A Large-Scale Synthetic Speech Dataset with Distribution Shifts Ashi Garg Zexin Cai Lin Zhang Leibny Paola García-Perera Kevin Duh Kevin Duh Sanjeev Khudanpur Matthew Wiesner Nicholas Andrews 77 0 0 08 Feb 2025
Voice Conversion-based Privacy through Adversarial Information Hiding J. Webber O. Watts G. Henter Jennifer Williams Simon King 45 0 0 23 Sep 2024
A Novel Deep Learning Technique for Morphology Preserved Fetal ECG Extraction from Mother ECG using 1D-CycleGAN Promit Basak A.K.M. Nazmus Sakib M. Chowdhury N. Al-Emadi Huseyin Cagatay Yalcin S. Pedersen S. Mahmud S. Kiranyaz S. Al-Maadeed 20 18 0 25 Sep 2023
Data Redaction from Conditional Generative Models Zhifeng Kong Kamalika Chaudhuri KELM 26 7 0 18 May 2023
MetaSpeech: Speech Effects Switch Along with Environment for Metaverse Xulong Zhang Jianzong Wang Ning Cheng Jing Xiao 24 1 0 25 Oct 2022
An Overview of Affective Speech Synthesis and Conversion in the Deep Learning Era Andreas Triantafyllopoulos Björn W. Schuller Gokcce .Iymen M. Sezgin Xiangheng He ... Shuo Liu Silvan Mertes Elisabeth André Ruibo Fu Jianhua Tao 22 53 0 06 Oct 2022
The Sound of Silence: Efficiency of First Digit Features in Synthetic Audio Detection Daniele Mari Federica Latora Simone Milani 21 11 0 06 Oct 2022
Dance Style Transfer with Cross-modal Transformer Wenjie Yin Hang Yin Kim Baraka Danica Kragic Mårten Björkman 50 23 0 19 Aug 2022
End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions Wonjune Kang M. Hasegawa-Johnson D. Roy 37 8 0 19 May 2022
ContentVec: An Improved Self-Supervised Speech Representation by Disentangling Speakers Kaizhi Qian Yang Zhang Heting Gao Junrui Ni Cheng-I Jeff Lai David D. Cox M. Hasegawa-Johnson Shiyu Chang DRL 30 110 0 20 Apr 2022
Learning the Beauty in Songs: Neural Singing Voice Beautifier Jinglin Liu Chengxi Li Yi Ren Zhiying Zhu Zhou Zhao DiffM 35 16 0 27 Feb 2022
Style Equalization: Unsupervised Learning of Controllable Generative Sequence Models Jen-Hao Rick Chang A. Shrivastava H. Koppula Xiaoshuai Zhang Oncel Tuzel DiffM 51 16 0 06 Oct 2021
MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in Frames Takuhiro Kaneko Hirokazu Kameoka Kou Tanaka Nobukatsu Hojo 38 57 0 25 Feb 2021
Low-resource expressive text-to-speech using data augmentation Goeric Huybrechts Thomas Merritt Giulia Comini Bartek Perz Raahil Shah Jaime Lorenzo-Trueba 26 50 0 11 Nov 2020
Semi-supervised Learning for Singing Synthesis Timbre J. Bonada Merlijn Blaauw 27 4 0 05 Nov 2020
AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance Normalization Yen-Hao Chen Da-Yi Wu Tsung-Han Wu Hung-yi Lee 34 107 0 31 Oct 2020
Unsupervised Learning of Disentangled Speech Content and Style Representation Andros Tjandra Ruoming Pang Yu Zhang Shigeki Karita BDL DRL 23 15 0 24 Oct 2020
GAZEV: GAN-Based Zero-Shot Voice Conversion over Non-parallel Speech Corpus Zining Zhang Bingsheng He Zhenjie Zhang 24 19 0 24 Oct 2020
CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-spectrogram Conversion Takuhiro Kaneko Hirokazu Kameoka Kou Tanaka Nobukatsu Hojo 29 78 0 22 Oct 2020
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep Learning Berrak Sisman Junichi Yamagishi Simon King Haizhou Li BDL 45 318 0 09 Aug 2020
Many-to-Many Voice Conversion using Conditional Cycle-Consistent Adversarial Networks Shindong Lee Bonggu Ko Keonnyeong Lee In-Chul Yoo Dongsuk Yook GAN 30 34 0 15 Feb 2020
Transforming Spectrum and Prosody for Emotional Voice Conversion with Non-Parallel Training Data Kun Zhou Berrak Sisman Haizhou Li 27 66 0 01 Feb 2020
Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice Conversion Wen-Chin Huang Hao Luo Hsin-Te Hwang Chen-Chou Lo Yu-Huai Peng Yu Tsao Hsin-Min Wang DRL 17 42 0 22 Jan 2020
Emotion Filtering at the Edge Ranya Aloufi Hamed Haddadi David E. Boyle 13 19 0 18 Sep 2019
Non-Parallel Sequence-to-Sequence Voice Conversion with Disentangled Linguistic and Speaker Representations Jing-Xuan Zhang Zhenhua Ling Lirong Dai 22 99 0 25 Jun 2019
ET-GAN: Cross-Language Emotion Transfer Based on Cycle-Consistent Generative Adversarial Networks Xiaoqi Jia Jianwei Tai Hang Zhou Yakai Li Weijuan Zhang Haichao Du Qingjia Huang GAN 22 6 0 27 May 2019
Nonparallel Emotional Speech Conversion Jian Gao Deep Chakraborty H. Tembine Olaitan Olaleye 22 68 0 03 Nov 2018
Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network Wenzhe Shi Jose Caballero Ferenc Huszár J. Totz Andrew P. Aitken Rob Bishop Daniel Rueckert Zehan Wang SupR 234 5,181 0 16 Sep 2016