Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2007.04590
Cited By
v1
v2 (latest)
DeepSinger: Singing Voice Synthesis with Data Mined From the Web
Knowledge Discovery and Data Mining (KDD), 2020
9 July 2020
Yi Ren
Xu Tan
Tao Qin
Jian Luan
Zhou Zhao
Tie-Yan Liu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"DeepSinger: Singing Voice Synthesis with Data Mined From the Web"
50 / 50 papers shown
DiTSinger: Scaling Singing Voice Synthesis with Diffusion Transformer and Implicit Alignment
Zongcai Du
Guilin Deng
Xiaofeng Guo
Xin Gao
Linke Li
...
Fubo Han
Siyu Yang
Peng Liu
Pan Zhong
Qiang Fu
DiffM
411
1
0
10 Oct 2025
CoMelSinger: Discrete Token-Based Zero-Shot Singing Synthesis With Structured Melody Control and Guidance
Junchuan Zhao
Wei Zeng
Tianle Lyu
Ye Wang
215
2
0
24 Sep 2025
Mamba2 Meets Silence: Robust Vocal Source Separation for Sparse Regions
Euiyeon Kim
Yong-Hoon Choi
Mamba
358
0
0
20 Aug 2025
SingNet: Towards a Large-Scale, Diverse, and In-the-Wild Singing Voice Dataset
Yicheng Gu
Chaoren Wang
Jing Zhang
Xueyao Zhang
Zihao Fang
Haorui He
Zhizheng Wu
297
8
0
14 May 2025
TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching
AAAI Conference on Artificial Intelligence (AAAI), 2025
Wenxiang Guo
Yu Zhang
Changhao Pan
Rongjie Huang
Li Tang
Ruiqi Li
Zhiqing Hong
Yongqi Wang
Zhou Zhao
1.0K
18
0
18 Feb 2025
RDSinger: Reference-based Diffusion Network for Singing Voice Synthesis
Kehan Sui
Jinxu Xiang
Fang Jin
DiffM
258
2
0
29 Oct 2024
ConSinger: Efficient High-Fidelity Singing Voice Generation with Minimal Steps
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Yulin Song
Guorui Sang
Jing Yu
Chuangbai Xiao
DiffM
343
1
0
20 Oct 2024
SiFiSinger: A High-Fidelity End-to-End Singing Voice Synthesizer based on Source-filter Model
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Jianwei Cui
Yu Gu
Chao Weng
Jie Zhang
Liping Chen
Lirong Dai
197
9
0
16 Oct 2024
Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models
Orchid Chetia Phukan
Sarthak Jain
Swarup Ranjan Behera
Arun Balaji Buduru
Rajesh Sharma
S. R Mahadeva Prasanna
242
2
0
21 Sep 2024
MakeSinger: A Semi-Supervised Training Method for Data-Efficient Singing Voice Synthesis via Classifier-free Diffusion Guidance
Interspeech (Interspeech), 2024
Semin Kim
Myeonghun Jeong
Hyeonseung Lee
Minchan Kim
Byoung Jin Choi
Nam Soo Kim
VLM
DiffM
341
4
0
10 Jun 2024
RapVerse: Coherent Vocals and Whole-Body Motions Generations from Text
Jiaben Chen
Xin Yan
Yihang Chen
Siyuan Cen
Zixin Wang
Qinwei Ma
Haoyu Zhen
Kaizhi Qian
Lie Lu
Chuang Gan
346
3
0
30 May 2024
Robust Singing Voice Transcription Serves Synthesis
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Ruiqi Li
Yu Zhang
Yongqi Wang
Zhiqing Hong
Rongjie Huang
Zhou Zhao
401
19
0
16 May 2024
Singing Voice Data Scaling-up: An Introduction to ACE-Opencpop and ACE-KiSing
Jiatong Shi
Yueqian Lin
Xinyi Bai
Keyi Zhang
Yuning Wu
Yuxun Tang
Yifeng Yu
Qin Jin
Shinji Watanabe
316
17
0
31 Jan 2024
FreGrad: Lightweight and Fast Frequency-aware Diffusion Vocoder
Tan Dat Nguyen
Ji-Hoon Kim
Youngjoon Jang
Jaehun Kim
Joon Son Chung
DiffM
305
21
0
18 Jan 2024
MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Dingyao Yu
Kaitao Song
Peiling Lu
Tianyu He
Xu Tan
Wei Ye
Shikun Zhang
Jiang Bian
LLMAG
399
27
0
18 Oct 2023
BiSinger: Bilingual Singing Voice Synthesis
Automatic Speech Recognition & Understanding (ASRU), 2023
Huali Zhou
Yueqian Lin
Yao Shi
Peng Sun
Ming Li
265
7
0
25 Sep 2023
FSD: An Initial Chinese Dataset for Fake Song Detection
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Yuankun Xie
Jingjing Zhou
Xiaolin Lu
Zhenghao Jiang
Yuxin Yang
Haonan Cheng
Long Ye
256
21
0
05 Sep 2023
Enhancing the vocal range of single-speaker singing voice synthesis with melody-unsupervised pre-training
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Shaohuan Zhou
Xu Li
Zhiyong Wu
Yin Shan
Helen Meng
163
2
0
01 Sep 2023
Towards Improving the Expressiveness of Singing Voice Synthesis with BERT Derived Semantic Information
Interspeech (Interspeech), 2022
Shaohuan Zhou
Shunwei Lei
Weiya You
Deyi Tuo
Yuren You
Zhiyong Wu
Shiyin Kang
Helen Meng
266
4
0
31 Aug 2023
Elucidate Gender Fairness in Singing Voice Transcription
ACM Multimedia (ACM MM), 2023
Xiangming Gu
Weizhen Zeng
Ye Wang
291
4
0
05 Aug 2023
A Systematic Exploration of Joint-training for Singing Voice Synthesis
International Symposium on Chinese Spoken Language Processing (ISCSLP), 2023
Yuning Wu
Yifeng Yu
Jiatong Shi
Tao Qian
Qin Jin
288
7
0
05 Aug 2023
RMSSinger: Realistic-Music-Score based Singing Voice Synthesis
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Jinzheng He
Jinglin Liu
Zhenhui Ye
Rongjie Huang
Chenye Cui
Huadai Liu
Zhou Zhao
DiffM
283
31
0
18 May 2023
Translate the Beauty in Songs: Jointly Learning to Align Melody and Translate Lyrics
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Chengxi Li
Kai Fan
Jiajun Bu
Boxing Chen
Zhongqiang Huang
Zhi Yu
232
9
0
28 Mar 2023
PHONEix: Acoustic Feature Processing Strategy for Enhanced Singing Pronunciation with Phoneme Distribution Predictor
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Yuning Wu
Jiatong Shi
Tao Qian
Dongji Gao
Qin Jin
237
5
0
15 Mar 2023
UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice Synthesis
AAAI Conference on Artificial Intelligence (AAAI), 2022
Yinjiao Lei
Shan Yang
Xinsheng Wang
Qicong Xie
Jixun Yao
Linfu Xie
Jane Polak Scowcroft
DiffM
247
15
0
03 Dec 2022
VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer
Interspeech (Interspeech), 2022
Yongmao Zhang
Heyang Xue
Hanzhao Li
Linfu Xie
Tingwei Guo
Ruixiong Zhang
Caixia Gong
DiffM
VLM
328
45
0
05 Nov 2022
Singing Voice Synthesis with Vibrato Modeling and Latent Energy Representation
IEEE International Workshop on Multimedia Signal Processing (MMSP), 2022
Yingjie Song
Wei Song
Wei Zhang
Zhengchen Zhang
Dan Zeng
Zhi Liu
Yang Yu
DiffM
236
8
0
02 Nov 2022
DDSP-based Singing Vocoders: A New Subtractive-based Synthesizer and A Comprehensive Evaluation
International Society for Music Information Retrieval Conference (ISMIR), 2022
Da-Yi Wu
Wen-Yi Hsiao
Fu-Rong Yang
Oscar D. Friedman
Warren Jackson
Scott Bruzenak
Yi-Wen Liu
Yi-Hsuan Yang
DiffM
301
28
0
09 Aug 2022
SUSing: SU-net for Singing Voice Synthesis
IEEE International Joint Conference on Neural Network (IJCNN), 2022
Xulong Zhang
Jianzong Wang
Ning Cheng
Jing Xiao
241
13
0
24 May 2022
SingAug: Data Augmentation for Singing Voice Synthesis with Cycle-consistent Training Strategy
Interspeech (Interspeech), 2022
Shuai Guo
Jiatong Shi
Tao Qian
Shinji Watanabe
Qin Jin
320
16
0
31 Mar 2022
WeSinger: Data-augmented Singing Voice Synthesis with Auxiliary Losses
Interspeech (Interspeech), 2022
Zewang Zhang
Yibin Zheng
Xinhui Li
Li Lu
657
21
0
21 Mar 2022
Learning the Beauty in Songs: Neural Singing Voice Beautifier
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Jinglin Liu
Chengxi Li
Yi Ren
Zhiying Zhu
Zhou Zhao
DiffM
283
24
0
27 Feb 2022
Deep Performer: Score-to-Audio Music Performance Synthesis
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Hao-Wen Dong
Cong Zhou
Taylor Berg-Kirkpatrick
Julian McAuley
353
26
0
12 Feb 2022
Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus
ACM Multimedia (MM), 2021
Rongjie Huang
Feiyang Chen
Yi Ren
Jinglin Liu
Chenye Cui
Zhou Zhao
288
129
0
20 Dec 2021
Rapping-Singing Voice Synthesis based on Phoneme-level Prosody Control
K. Markopoulos
Nikolaos Ellinas
Alexandra Vioni
Myrsini Christidou
Panos Kakoulidis
...
Georgia Maniati
June Sig Sung
Hyoungmin Park
Pirros Tsiakoulis
Aimilios Chalamandaris
200
2
0
17 Nov 2021
RefineGAN: Universally Generating Waveform Better than Ground Truth with Highly Accurate Pitch and Intensity Responses
Interspeech (Interspeech), 2021
Shengyuan Xu
Wenxiao Zhao
Jing Guo
346
14
0
01 Nov 2021
VISinger: Variational Inference with Adversarial Learning for End-to-End Singing Voice Synthesis
Yongmao Zhang
Jian Cong
Heyang Xue
Lei Xie
Pengcheng Zhu
Mengxiao Bi
297
102
0
17 Oct 2021
SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation
Rongjie Huang
Chenye Cui
Feiyang Chen
Yi Ren
Jinglin Liu
Zhou Zhao
Baoxing Huai
N. Yuan
GAN
440
71
0
14 Oct 2021
A Melody-Unsupervision Model for Singing Voice Synthesis
Soonbeom Choi
Juhan Nam
184
16
0
13 Oct 2021
KaraSinger: Score-Free Singing Voice Synthesis with VQ-VAE using Mel-spectrograms
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Chien-Feng Liao
Jen-Yu Liu
Yi-Hsuan Yang
222
6
0
08 Oct 2021
Sinsy: A Deep Neural Network-Based Singing Voice Synthesis System
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Yukiya Hono
Kei Hashimoto
Keiichiro Oura
Yoshihiko Nankaku
K. Tokuda
280
48
0
05 Aug 2021
Synchronising speech segments with musical beats in Mandarin and English singing
Interspeech (Interspeech), 2021
Cong Zhang
Jian Zhu
98
0
0
18 Jun 2021
EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model
Chenye Cui
Yi Ren
Jinglin Liu
Feiyang Chen
Rongjie Huang
Ming Lei
Zhou Zhao
221
37
0
17 Jun 2021
WSRGlow: A Glow-based Waveform Generative Model for Audio Super-Resolution
Kexun Zhang
Yi Ren
Changliang Xu
Zhou Zhao
241
37
0
16 Jun 2021
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
AAAI Conference on Artificial Intelligence (AAAI), 2021
Jinglin Liu
Chengxi Li
Yi Ren
Feiyang Chen
Zhou Zhao
DiffM
662
342
0
06 May 2021
Semi-supervised Learning for Singing Synthesis Timbre
J. Bonada
Merlijn Blaauw
198
4
0
05 Nov 2020
Sequence-to-sequence Singing Voice Synthesis with Perceptual Entropy Loss
Jiatong Shi
Shuai Guo
Nan Huo
Yuekai Zhang
Qin Jin
232
33
0
22 Oct 2020
HiFiSinger: Towards High-Fidelity Neural Singing Voice Synthesis
Jiawei Chen
Xu Tan
Jian Luan
Tao Qin
Tie-Yan Liu
VLM
252
111
0
03 Sep 2020
PopMAG: Pop Music Accompaniment Generation
Yi Ren
Jinzheng He
Xu Tan
Tao Qin
Zhou Zhao
Tie-Yan Liu
267
136
0
18 Aug 2020
PJS: phoneme-balanced Japanese singing voice corpus
Junya Koguchi
Shinnosuke Takamichi
196
28
0
04 Jun 2020
1
Page 1 of 1