ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.05325
  4. Cited By
LDM-SVC: Latent Diffusion Model Based Zero-Shot Any-to-Any Singing Voice
  Conversion with Singer Guidance

LDM-SVC: Latent Diffusion Model Based Zero-Shot Any-to-Any Singing Voice Conversion with Singer Guidance

Interspeech (Interspeech), 2024
8 June 2024
Shihao Chen
Yu Gu
Jie Zhang
Na Li
Rilin Chen
Liping Chen
Lirong Dai
    DiffM
ArXiv (abs)PDFHTML

Papers citing "LDM-SVC: Latent Diffusion Model Based Zero-Shot Any-to-Any Singing Voice Conversion with Singer Guidance"

9 / 9 papers shown
Title
CartoonSing: Unifying Human and Nonhuman Timbres in Singing Generation
CartoonSing: Unifying Human and Nonhuman Timbres in Singing Generation
Jionghao Han
Jiatong Shi
Zhuoyan Tao
Yuxun Tang
Yiwen Zhao
Gus Xia
Shinji Watanabe
84
0
0
26 Nov 2025
HQ-SVC: Towards High-Quality Zero-Shot Singing Voice Conversion in Low-Resource Scenarios
HQ-SVC: Towards High-Quality Zero-Shot Singing Voice Conversion in Low-Resource Scenarios
Bingsong Bai
Yizhong Geng
Fengping Wang
Cong Wang
Puyuan Guo
Yingming Gao
Ya Li
171
1
0
11 Nov 2025
DiffRhythm+: Controllable and Flexible Full-Length Song Generation with Preference Optimization
DiffRhythm+: Controllable and Flexible Full-Length Song Generation with Preference Optimization
Huakang Chen
Yuepeng Jiang
Guobin Ma
Chunbo Hao
Shuai Wang
Jixun Yao
Ziqian Ning
Meng Meng
Jian Luan
Lei Xie
DiffM
172
6
0
17 Jul 2025
SingNet: Towards a Large-Scale, Diverse, and In-the-Wild Singing Voice Dataset
SingNet: Towards a Large-Scale, Diverse, and In-the-Wild Singing Voice Dataset
Yicheng Gu
Chaoren Wang
Jing Zhang
Xueyao Zhang
Zihao Fang
Haorui He
Zhizheng Wu
193
7
0
14 May 2025
A Survey on Cross-Modal Interaction Between Music and Multimodal Data
A Survey on Cross-Modal Interaction Between Music and Multimodal Data
Sifei Li
Mining Tan
Feier Shen
Minyan Luo
Zijiao Yin
Fan Tang
Weiming Dong
Changsheng Xu
249
1
0
17 Apr 2025
USM-VC: Mitigating Timbre Leakage with Universal Semantic Mapping Residual Block for Voice Conversion
USM-VC: Mitigating Timbre Leakage with Universal Semantic Mapping Residual Block for Voice Conversion
Na Li
Chuke Wang
Yu Gu
Zhifeng Li
386
0
0
11 Apr 2025
Seed-Music: A Unified Framework for High Quality and Controlled Music
  Generation
Seed-Music: A Unified Framework for High Quality and Controlled Music Generation
Ye Bai
Haonan Chen
Jitong Chen
Zhuo Chen
Yi Deng
...
Hang Zhao
Ziyi Zhao
Dejian Zhong
Shicen Zhou
Pei Zou
DiffM
234
15
0
13 Sep 2024
LHQ-SVC: Lightweight and High Quality Singing Voice Conversion Modeling
LHQ-SVC: Lightweight and High Quality Singing Voice Conversion ModelingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Yubo Huang
Xin Lai
Muyang Ye
Anran Zhu
Zixi Wang
Jingzehua Xu
Shuai Zhang
Zhiyuan Zhou
Weijie Niu
236
5
0
13 Sep 2024
LCM-SVC: Latent Diffusion Model Based Singing Voice Conversion with
  Inference Acceleration via Latent Consistency Distillation
LCM-SVC: Latent Diffusion Model Based Singing Voice Conversion with Inference Acceleration via Latent Consistency DistillationInternational Symposium on Chinese Spoken Language Processing (ISCSLP), 2024
Shihao Chen
Yu Gu
Jianwei Cui
Jie Zhang
Rilin Chen
Lirong Dai
101
3
0
22 Aug 2024
1