ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.13595
  4. Cited By
Adversarial Feature Learning and Unsupervised Clustering based Speech
  Synthesis for Found Data with Acoustic and Textual Noise

Adversarial Feature Learning and Unsupervised Clustering based Speech Synthesis for Found Data with Acoustic and Textual Noise

IEEE Signal Processing Letters (IEEE SPL), 2020
28 April 2020
Shan Yang
Yuxuan Wang
Lei Xie
ArXiv (abs)PDFHTML

Papers citing "Adversarial Feature Learning and Unsupervised Clustering based Speech Synthesis for Found Data with Acoustic and Textual Noise"

7 / 7 papers shown
Preserving background sound in noise-robust voice conversion via
  multi-task learning
Preserving background sound in noise-robust voice conversion via multi-task learningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Jixun Yao
Yi Lei
Qing Wang
Pengcheng Guo
Ziqian Ning
Linfu Xie
Hai Li
Junhui Liu
Danming Xie
243
16
0
06 Nov 2022
AccentSpeech: Learning Accent from Crowd-sourced Data for Target Speaker
  TTS with Accents
AccentSpeech: Learning Accent from Crowd-sourced Data for Target Speaker TTS with AccentsInternational Symposium on Chinese Spoken Language Processing (ISCSLP), 2022
Yongmao Zhang
Zhichao Wang
Pei-Yin Yang
Hongshen Sun
Zhisheng Wang
Linfu Xie
172
7
0
31 Oct 2022
Noise-robust voice conversion with domain adversarial training
Noise-robust voice conversion with domain adversarial trainingNeural Networks (NN), 2022
Hongqiang Du
Lei Xie
Haizhou Li
229
20
0
26 Jan 2022
DDS: A new device-degraded speech dataset for speech enhancement
DDS: A new device-degraded speech dataset for speech enhancement
Haoyu Li
Junichi Yamagishi
295
10
0
16 Sep 2021
Text-Free Image-to-Speech Synthesis Using Learned Segmental Units
Text-Free Image-to-Speech Synthesis Using Learned Segmental UnitsAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Wei-Ning Hsu
David Harwath
Christopher Song
James R. Glass
CLIP
220
74
0
31 Dec 2020
Accent and Speaker Disentanglement in Many-to-many Voice Conversion
Accent and Speaker Disentanglement in Many-to-many Voice ConversionInternational Symposium on Chinese Spoken Language Processing (ISCSLP), 2020
Zhichao Wang
Wenshuo Ge
Xiong Wang
Shan Yang
Wendong Gan
Haitao Chen
Hai Li
Lei Xie
Xiulin Li
CVBM
298
36
0
17 Nov 2020
Data Efficient Voice Cloning from Noisy Samples with Domain Adversarial
  Training
Data Efficient Voice Cloning from Noisy Samples with Domain Adversarial TrainingInterspeech (Interspeech), 2020
Jian Cong
Shan Yang
Lei Xie
Guoqiao Yu
Guanglu Wan
216
33
0
10 Aug 2020
1
Page 1 of 1