ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.12813
  4. Cited By
Disentangleing Content and Fine-grained Prosody Information via Hybrid
  ASR Bottleneck Features for Voice Conversion

Disentangleing Content and Fine-grained Prosody Information via Hybrid ASR Bottleneck Features for Voice Conversion

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
24 March 2022
Xintao Zhao
Feng Liu
Changhe Song
Zhiyong Wu
Shiyin Kang
Deyi Tuo
Helen Meng
ArXiv (abs)PDFHTMLGithub (5089★)

Papers citing "Disentangleing Content and Fine-grained Prosody Information via Hybrid ASR Bottleneck Features for Voice Conversion"

18 / 18 papers shown
FabasedVC: Enhancing Voice Conversion with Text Modality Fusion and Phoneme-Level SSL Features
FabasedVC: Enhancing Voice Conversion with Text Modality Fusion and Phoneme-Level SSL Features
Wenyu Wang
Zhetao Hu
Yiquan Zhou
Jiacheng Xu
Z. F. Wu
Chen Li
Shihao Li
97
0
0
13 Nov 2025
CoMelSinger: Discrete Token-Based Zero-Shot Singing Synthesis With Structured Melody Control and Guidance
CoMelSinger: Discrete Token-Based Zero-Shot Singing Synthesis With Structured Melody Control and Guidance
Junchuan Zhao
Wei Zeng
Tianle Lyu
Ye Wang
220
2
0
24 Sep 2025
Prosody-Adaptable Audio Codecs for Zero-Shot Voice Conversion via In-Context Learning
Prosody-Adaptable Audio Codecs for Zero-Shot Voice Conversion via In-Context Learning
Junchuan Zhao
Xintong Wang
Ye Wang
190
5
0
21 May 2025
AVENet: Disentangling Features by Approximating Average Features for Voice Conversion
AVENet: Disentangling Features by Approximating Average Features for Voice Conversion
Wenyu Wang
Yiquan Zhou
Jihua Zhu
Hongwu Ding
Jiacheng Xu
Shihao Li
DRL
210
0
0
08 Apr 2025
Singing Voice Conversion with Accompaniment Using Self-Supervised Representation-Based Melody Features
Singing Voice Conversion with Accompaniment Using Self-Supervised Representation-Based Melody FeaturesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Wei Chen
Binzhu Sha
Jing Yang
Zhuo Wang
Fan Fan
Zhikai Wu
529
2
0
07 Feb 2025
Takin-VC: Expressive Zero-Shot Voice Conversion via Adaptive Hybrid Content Encoding and Enhanced Timbre Modeling
Takin-VC: Expressive Zero-Shot Voice Conversion via Adaptive Hybrid Content Encoding and Enhanced Timbre ModelingAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Yuguang Yang
Yu Pan
Jixun Yao
Xiang Zhang
Jianhao Ye
Hongbin Zhou
Lei Xie
Lei Ma
Jianjun Zhao
225
0
0
02 Oct 2024
Takin: A Cohort of Superior Quality Zero-shot Speech Generation Models
Takin: A Cohort of Superior Quality Zero-shot Speech Generation Models
Sijing Chen
Qi Liu
Laipeng He
Tianwei He
Wendi He
...
Huimin Zhang
Xiang Zhang
Guangcheng Zhao
Hongbin Zhou
Pengpeng Zou
355
14
0
18 Sep 2024
RobustSVC: HuBERT-based Melody Extractor and Adversarial Learning for
  Robust Singing Voice Conversion
RobustSVC: HuBERT-based Melody Extractor and Adversarial Learning for Robust Singing Voice ConversionInternational Symposium on Chinese Spoken Language Processing (ISCSLP), 2024
Wei Chen
Xintao Zhao
Jun Chen
Binzhu Sha
Zhiwei Lin
Zhiyong Wu
317
3
0
10 Sep 2024
EAD-VC: Enhancing Speech Auto-Disentanglement for Voice Conversion with
  IFUB Estimator and Joint Text-Guided Consistent Learning
EAD-VC: Enhancing Speech Auto-Disentanglement for Voice Conversion with IFUB Estimator and Joint Text-Guided Consistent Learning
Ziqi Liang
Jianzong Wang
Xulong Zhang
Yong Zhang
Ning Cheng
Jing Xiao
163
3
0
30 Apr 2024
Learning Disentangled Speech Representations with Contrastive Learning
  and Time-Invariant Retrieval
Learning Disentangled Speech Representations with Contrastive Learning and Time-Invariant RetrievalIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Yimin Deng
Huaizhen Tang
Xulong Zhang
Ning Cheng
Jing Xiao
Jianzong Wang
DRL
350
2
0
16 Jan 2024
CLN-VC: Text-Free Voice Conversion Based on Fine-Grained Style Control
  and Contrastive Learning with Negative Samples Augmentation
CLN-VC: Text-Free Voice Conversion Based on Fine-Grained Style Control and Contrastive Learning with Negative Samples Augmentation
Yimin Deng
Xulong Zhang
Jianzong Wang
Ning Cheng
Jing Xiao
324
3
0
15 Nov 2023
DualVC: Dual-mode Voice Conversion using Intra-model Knowledge
  Distillation and Hybrid Predictive Coding
DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive CodingInterspeech (Interspeech), 2023
Ziqian Ning
Yuepeng Jiang
Pengcheng Zhu
Jixun Yao
Shuai Wang
Linfu Xie
Mengxiao Bi
215
15
0
21 May 2023
Adversarial Speaker Disentanglement Using Unannotated External Data for
  Self-supervised Representation Based Voice Conversion
Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice ConversionIEEE International Conference on Multimedia and Expo (ICME), 2023
Xintao Zhao
Shuai Wang
Yang Chao
Zhiyong Wu
Helen Meng
183
5
0
16 May 2023
Voice conversion with limited data and limitless data augmentations
Voice conversion with limited data and limitless data augmentations
Olga Slizovskaia
Jordi Janer
Pritish Chandna
Oscar Mayor
128
1
0
27 Dec 2022
Improved disentangled speech representations using contrastive learning
  in factorized hierarchical variational autoencoder
Improved disentangled speech representations using contrastive learning in factorized hierarchical variational autoencoderEuropean Signal Processing Conference (EUSIPCO), 2022
Yuying Xie
Thomas Arildsen
Zheng-Hua Tan
228
3
0
15 Nov 2022
Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion
  of Bottleneck and Perturbation Features
Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation FeaturesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Ziqian Ning
Qicong Xie
Pengcheng Zhu
Zhichao Wang
Liumeng Xue
Jixun Yao
Linfu Xie
Mengxiao Bi
185
28
0
09 Nov 2022
Streaming Voice Conversion Via Intermediate Bottleneck Features And
  Non-streaming Teacher Guidance
Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher GuidanceIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Yuan-Jui Chen
Ming Tu
Tang-Chun Li
Xin Li
Qiuqiang Kong
Jiaxin Li
Zhichao Wang
Qiao Tian
Yuping Wang
Yuxuan Wang
293
17
0
27 Oct 2022
Mockingjay: Unsupervised Speech Representation Learning with Deep
  Bidirectional Transformer Encoders
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer EncodersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Andy T. Liu
Shu-Wen Yang
Po-Han Chi
Po-Chun Hsu
Hung-yi Lee
SSL
645
394
0
25 Oct 2019
1
Page 1 of 1