v1v2 (latest)

DeepSinger: Singing Voice Synthesis with Data Mined From the Web

Knowledge Discovery and Data Mining (KDD), 2020

9 July 2020

Xu Tan

Zhou Zhao

Papers citing "DeepSinger: Singing Voice Synthesis with Data Mined From the Web"

50 / 50 papers shown

DiTSinger: Scaling Singing Voice Synthesis with Diffusion Transformer and Implicit Alignment

...

417

10 Oct 2025

CoMelSinger: Discrete Token-Based Zero-Shot Singing Synthesis With Structured Melody Control and Guidance

215

24 Sep 2025

Mamba2 Meets Silence: Robust Vocal Source Separation for Sparse Regions

Euiyeon Kim

Yong-Hoon Choi

Mamba

358

20 Aug 2025

SingNet: Towards a Large-Scale, Diverse, and In-the-Wild Singing Voice Dataset

297

14 May 2025

TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow MatchingAAAI Conference on Artificial Intelligence (AAAI), 2025

1.1K

18 Feb 2025

RDSinger: Reference-based Diffusion Network for Singing Voice Synthesis

269

29 Oct 2024

ConSinger: Efficient High-Fidelity Singing Voice Generation with Minimal StepsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

343

20 Oct 2024

SiFiSinger: A High-Fidelity End-to-End Singing Voice Synthesizer based on Source-filter ModelIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

Chao Weng

198

16 Oct 2024

Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models

Rajesh Sharma

S. R Mahadeva Prasanna

242

21 Sep 2024

MakeSinger: A Semi-Supervised Training Method for Data-Efficient Singing Voice Synthesis via Classifier-free Diffusion GuidanceInterspeech (Interspeech), 2024

Nam Soo Kim

345

10 Jun 2024

RapVerse: Coherent Vocals and Whole-Body Motions Generations from Text

353

30 May 2024

Robust Singing Voice Transcription Serves SynthesisAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Rongjie Huang

Zhou Zhao

402

16 May 2024

Singing Voice Data Scaling-up: An Introduction to ACE-Opencpop and ACE-KiSing

Jiatong Shi

Yueqian Lin

Xinyi Bai

Keyi Zhang

Yuning Wu

Yuxun Tang

Yifeng Yu

Qin Jin

Shinji Watanabe

326

31 Jan 2024

FreGrad: Lightweight and Fast Frequency-aware Diffusion Vocoder

Ji-Hoon Kim

Joon Son Chung

305

18 Jan 2024

MusicAgent: An AI Agent for Music Understanding and Generation with Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Dingyao Yu

Kaitao Song

Peiling Lu

Tianyu He

Xu Tan

Wei Ye

Shikun Zhang

Jiang Bian

LLMAG

399

18 Oct 2023

BiSinger: Bilingual Singing Voice SynthesisAutomatic Speech Recognition & Understanding (ASRU), 2023

268

25 Sep 2023

FSD: An Initial Chinese Dataset for Fake Song DetectionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Yuankun Xie

259

05 Sep 2023

Enhancing the vocal range of single-speaker singing voice synthesis with melody-unsupervised pre-trainingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Zhiyong Wu

163

01 Sep 2023

Towards Improving the Expressiveness of Singing Voice Synthesis with BERT Derived Semantic InformationInterspeech (Interspeech), 2022

Zhiyong Wu

Shiyin Kang

Helen Meng

271

31 Aug 2023

Elucidate Gender Fairness in Singing Voice TranscriptionACM Multimedia (ACM MM), 2023

Xiangming Gu

Weizhen Zeng

Ye Wang

296

05 Aug 2023

A Systematic Exploration of Joint-training for Singing Voice SynthesisInternational Symposium on Chinese Spoken Language Processing (ISCSLP), 2023

Yuning Wu

Yifeng Yu

Jiatong Shi

Tao Qian

Qin Jin

291

05 Aug 2023

RMSSinger: Realistic-Music-Score based Singing Voice SynthesisAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Rongjie Huang

Zhou Zhao

283

18 May 2023

Translate the Beauty in Songs: Jointly Learning to Align Melody and Translate LyricsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

233

28 Mar 2023

PHONEix: Acoustic Feature Processing Strategy for Enhanced Singing Pronunciation with Phoneme Distribution PredictorIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Yuning Wu

Jiatong Shi

Tao Qian

Dongji Gao

Qin Jin

239

15 Mar 2023

UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice SynthesisAAAI Conference on Artificial Intelligence (AAAI), 2022

Shan Yang

256

03 Dec 2022

VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing SynthesizerInterspeech (Interspeech), 2022

328

05 Nov 2022

Singing Voice Synthesis with Vibrato Modeling and Latent Energy RepresentationIEEE International Workshop on Multimedia Signal Processing (MMSP), 2022

236

02 Nov 2022

DDSP-based Singing Vocoders: A New Subtractive-based Synthesizer and A Comprehensive EvaluationInternational Society for Music Information Retrieval Conference (ISMIR), 2022

310

09 Aug 2022

SUSing: SU-net for Singing Voice SynthesisIEEE International Joint Conference on Neural Network (IJCNN), 2022

241

24 May 2022

SingAug: Data Augmentation for Singing Voice Synthesis with Cycle-consistent Training StrategyInterspeech (Interspeech), 2022

Shuai Guo

Jiatong Shi

Tao Qian

Shinji Watanabe

Qin Jin

320

31 Mar 2022

WeSinger: Data-augmented Singing Voice Synthesis with Auxiliary LossesInterspeech (Interspeech), 2022

667

21 Mar 2022

Learning the Beauty in Songs: Neural Singing Voice BeautifierAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Zhou Zhao

286

27 Feb 2022

Deep Performer: Score-to-Audio Music Performance SynthesisIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Hao-Wen Dong

Cong Zhou

Taylor Berg-Kirkpatrick

Julian McAuley

354

12 Feb 2022

Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale CorpusACM Multimedia (MM), 2021

Rongjie Huang

Zhou Zhao

289

129

20 Dec 2021

Rapping-Singing Voice Synthesis based on Phoneme-level Prosody Control

...

Aimilios Chalamandaris

201

17 Nov 2021

RefineGAN: Universally Generating Waveform Better than Ground Truth with Highly Accurate Pitch and Intensity ResponsesInterspeech (Interspeech), 2021

Shengyuan Xu

Wenxiao Zhao

Jing Guo

349

01 Nov 2021

VISinger: Variational Inference with Adversarial Learning for End-to-End Singing Voice Synthesis

Yongmao Zhang

Jian Cong

Heyang Xue

Lei Xie

Pengcheng Zhu

Mengxiao Bi

297

102

17 Oct 2021

SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation

Rongjie Huang

Zhou Zhao

440

14 Oct 2021

A Melody-Unsupervision Model for Singing Voice Synthesis

Soonbeom Choi

Juhan Nam

188

13 Oct 2021

KaraSinger: Score-Free Singing Voice Synthesis with VQ-VAE using Mel-spectrogramsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Chien-Feng Liao

Jen-Yu Liu

Yi-Hsuan Yang

226

08 Oct 2021

Sinsy: A Deep Neural Network-Based Singing Voice Synthesis SystemIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021

282

05 Aug 2021

Synchronising speech segments with musical beats in Mandarin and English singingInterspeech (Interspeech), 2021

Cong Zhang

Jian Zhu

100

18 Jun 2021

EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model

Rongjie Huang

Zhou Zhao

227

17 Jun 2021

WSRGlow: A Glow-based Waveform Generative Model for Audio Super-Resolution

Kexun Zhang

Yi Ren

Changliang Xu

Zhou Zhao

249

16 Jun 2021

DiffSinger: Singing Voice Synthesis via Shallow Diffusion MechanismAAAI Conference on Artificial Intelligence (AAAI), 2021

Zhou Zhao

664

345

06 May 2021

Semi-supervised Learning for Singing Synthesis Timbre

J. Bonada

Merlijn Blaauw

215

05 Nov 2020

Sequence-to-sequence Singing Voice Synthesis with Perceptual Entropy Loss

Jiatong Shi

238

22 Oct 2020

HiFiSinger: Towards High-Fidelity Neural Singing Voice Synthesis

Xu Tan

254

111

03 Sep 2020

PopMAG: Pop Music Accompaniment Generation

Xu Tan

Zhou Zhao

269

137

18 Aug 2020

PJS: phoneme-balanced Japanese singing voice corpus

Junya Koguchi

Shinnosuke Takamichi

204

04 Jun 2020