From Speaker Verification to Multispeaker Speech Synthesis, Deep Transfer with Feedback Constraint

10 May 2020

Papers citing "From Speaker Verification to Multispeaker Speech Synthesis, Deep Transfer with Feedback Constraint"

33 / 33 papers shown

Title
Voice Cloning: Comprehensive Survey Hussam Azzuni Abdulmotaleb El Saddik VLM 39 0 0 01 May 2025
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning Shuai Wang Zheng-Shou Chen Kong Aik Lee Yan-min Qian Haizhou Li 37 4 0 21 Jul 2024
HSVRS: A Virtual Reality System of the Hide-and-Seek Game to Enhance Gaze Fixation Ability for Autistic Children Chengyan Yu Shihuan Wang Dong Zhang Yingying Zhang Chao-qun Cen Zhixiang you Xiaobing Zou Hongzhu Deng Ming Li 31 0 0 20 Oct 2023
Timbre-reserved Adversarial Attack in Speaker Identification Qing Wang Jixun Yao Li Lyna Zhang Pengcheng Guo Linfu Xie AAML 27 4 0 02 Sep 2023
The DKU-DUKEECE System for the Manipulation Region Location Task of ADD 2023 Zexin Cai Weiqing Wang Yikang Wang Ming Li 22 6 0 20 Aug 2023
An analysis on the effects of speaker embedding choice in non auto-regressive TTS Adriana Stan Johannah O'Mahony 32 0 0 19 Jul 2023
AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation Rongjie Huang Huadai Liu Xize Cheng Yi Ren Lin Li ... Jinzheng He Lichao Zhang Jinglin Liu Xiaoyue Yin Zhou Zhao 67 8 0 24 May 2023
Personalized Lightweight Text-to-Speech: Voice Cloning with Adaptive Structured Pruning Sung-Feng Huang Chia-Ping Chen Zhi-Sheng Chen Yu-Pao Tsai Hung-yi Lee 20 2 0 21 Mar 2023
Waveform Boundary Detection for Partially Spoofed Audio Zexin Cai Weiqing Wang Ming Li 19 25 0 01 Nov 2022
Speaker consistency loss and step-wise optimization for semi-supervised joint training of TTS and ASR using unpaired text data Naoki Makishima Satoshi Suzuki Atsushi Ando Ryo Masumura 142 4 0 11 Jul 2022
TDASS: Target Domain Adaptation Speech Synthesis Framework for Multi-speaker Low-Resource TTS Xulong Zhang Jianzong Wang Ning Cheng Jing Xiao 19 14 0 24 May 2022
Karaoker: Alignment-free singing voice synthesis with speech training data Panos Kakoulidis Nikolaos Ellinas G. Vamvoukakis K. Markopoulos June Sig Sung Gunu Jho Pirros Tsiakoulis Aimilios Chalamandaris 10 3 0 08 Apr 2022
A Text-to-Speech Pipeline, Evaluation Methodology, and Initial Fine-Tuning Results for Child Speech Synthesis Rishabh Jain Mariam Yiwere Dan Bigioi Peter Corcoran H. Cucu 17 14 0 22 Mar 2022
Voice Filter: Few-shot text-to-speech speaker adaptation using voice conversion as a post-processing module Adam Gabry's Goeric Huybrechts M. Ribeiro C. Chien Julian Roth Giulia Comini Roberto Barra-Chicote Bartek Perz Jaime Lorenzo-Trueba 28 21 0 16 Feb 2022
The MSXF TTS System for ICASSP 2022 ADD Challenge Chunyong Yang Pengfei Liu Yanli Chen Hongbin Wang Min Liu 10 0 0 27 Jan 2022
Generating Adversarial Samples For Training Wake-up Word Detection Systems Against Confusing Words Haoxu Wang Yan Jia Zeqing Zhao Xuyang Wang Junjie Wang Ming Li AAML 14 0 0 01 Jan 2022
Meta-TTS: Meta-Learning for Few-Shot Speaker Adaptive Text-to-Speech Sung-Feng Huang Chyi-Jiunn Lin Da-Rong Liu Yi-Chen Chen Hung-yi Lee 8 56 0 07 Nov 2021
SIG-VC: A Speaker Information Guided Zero-shot Voice Conversion System for Both Human Beings and Machines Haozhe Zhang Zexin Cai Xiaoyi Qin Ming Li 52 15 0 06 Nov 2021
Unet-TTS: Improving Unseen Speaker and Style Transfer in One-shot Voice Cloning Rui Li dong Pu Minnie Huang Bill Huang 50 14 0 23 Sep 2021
AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary Person Xinsheng Wang Qicong Xie Jihua Zhu Lei Xie O. Scharenborg 25 16 0 09 Aug 2021
Speech2Video: Cross-Modal Distillation for Speech to Video Generation Shijing Si Jianzong Wang Xiaoyang Qu Ning Cheng Wenqi Wei Xinghua Zhu Jing Xiao VGen 16 15 0 10 Jul 2021
Msdtron: a high-capability multi-speaker speech synthesis system for diverse data using characteristic information Qinghua Wu Quanbo Shen Jian Luan YuJun Wang 30 3 0 07 Jul 2021
A Survey on Neural Speech Synthesis Xu Tan Tao Qin Frank Soong Tie-Yan Liu AI4TS 18 352 0 29 Jun 2021
Speaker verification-derived loss and data augmentation for DNN-based multispeaker speech synthesis Beáta Lőrincz Adriana Stan M. Giurgiu 21 6 0 03 Jun 2021
Building Bilingual and Code-Switched Voice Conversion with Limited Training Data Using Embedding Consistency Loss Yaogen Yang Haozhe Zhang Xiaoyi Qin Shanshan Liang Huahua Cui Mingyang Xu Ming Li 53 4 0 22 Apr 2021
Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech C. Chien Jheng-hao Lin Chien-yu Huang Po-Chun Hsu Hung-yi Lee 16 68 0 06 Mar 2021
Exploring Voice Conversion based Data Augmentation in Text-Dependent Speaker Verification Xiaoyi Qin Yaogen Yang Lin Yang Xuyang Wang Junjie Wang Ming Li 16 0 0 21 Nov 2020
Optimizing voice conversion network with cycle consistency loss of speaker identity Hongqiang Du Xiaohai Tian Lei Xie Haizhou Li 13 17 0 17 Nov 2020
Training Wake Word Detection with Synthesized Speech Data on Confusion Words Yan Jia Zexin Cai Murong Ma Zeqing Zhao Xuyang Wang Junjie Wang Ming Li 11 1 0 03 Nov 2020
AISHELL-3: A Multi-speaker Mandarin TTS Corpus and the Baselines Yao Shi Hui Bu Xin Xu Shaojing Zhang Ming Li 16 218 0 22 Oct 2020
Expressive TTS Training with Frame and Style Reconstruction Loss Rui Liu Berrak Sisman Guanglai Gao Haizhou Li 24 73 0 04 Aug 2020
VoxCeleb2: Deep Speaker Recognition Joon Son Chung Arsha Nagrani Andrew Zisserman 224 2,234 0 14 Jun 2018
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis Ye Jia Yu Zhang Ron J. Weiss Quan Wang Jonathan Shen ... Z. Chen Patrick Nguyen Ruoming Pang Ignacio López Moreno Yonghui Wu 207 820 0 12 Jun 2018