ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2105.02446
  4. Cited By
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism

6 May 2021
Jinglin Liu
Chengxi Li
Yi Ren
Feiyang Chen
Zhou Zhao
    DiffM
ArXivPDFHTML

Papers citing "DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism"

50 / 159 papers shown
Title
HiddenSinger: High-Quality Singing Voice Synthesis via Neural Audio
  Codec and Latent Diffusion Models
HiddenSinger: High-Quality Singing Voice Synthesis via Neural Audio Codec and Latent Diffusion Models
Ji-Sang Hwang
Sang-Hoon Lee
Seong-Whan Lee
DiffM
14
8
0
12 Jun 2023
On the Design Fundamentals of Diffusion Models: A Survey
On the Design Fundamentals of Diffusion Models: A Survey
Ziyi Chang
G. Koulieris
Hubert P. H. Shum
DiffM
27
52
0
07 Jun 2023
Interpretable Style Transfer for Text-to-Speech with ControlVAE and
  Diffusion Bridge
Interpretable Style Transfer for Text-to-Speech with ControlVAE and Diffusion Bridge
Wenhao Guan
Tao Li
Yishuang Li
Hukai Huang
Q. Hong
Lin Li
DiffM
21
6
0
07 Jun 2023
Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive
  Bias
Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias
Ziyue Jiang
Yi Ren
Zhe Ye
Jinglin Liu
Chen Zhang
...
Rongjie Huang
Chunfeng Wang
Xiang Yin
Zejun Ma
Zhou Zhao
DiffM
21
73
0
06 Jun 2023
Latent Optimal Paths by Gumbel Propagation for Variational Bayesian
  Dynamic Programming
Latent Optimal Paths by Gumbel Propagation for Variational Bayesian Dynamic Programming
Xinlei Niu
Christian J. Walder
J. Zhang
Charles Patrick Martin
BDL
6
0
0
05 Jun 2023
Detector Guidance for Multi-Object Text-to-Image Generation
Detector Guidance for Multi-Object Text-to-Image Generation
Luping Liu
Zijian Zhang
Yi Ren
Rongjie Huang
Xiang Yin
Zhou Zhao
DiffM
22
9
0
04 Jun 2023
Make-A-Voice: Unified Voice Synthesis With Discrete Representation
Make-A-Voice: Unified Voice Synthesis With Discrete Representation
Rongjie Huang
Chunlei Zhang
Yongqiang Wang
Dongchao Yang
Lu Liu
Zhenhui Ye
Ziyue Jiang
Chao Weng
Zhou Zhao
Dong Yu
DiffM
23
26
0
30 May 2023
Diverse and Expressive Speech Prosody Prediction with Denoising
  Diffusion Probabilistic Model
Diverse and Expressive Speech Prosody Prediction with Denoising Diffusion Probabilistic Model
Xiang Li
Songxiang Liu
Max W. Y. Lam
Zhiyong Wu
Chao Weng
H. Meng
DiffM
21
5
0
26 May 2023
Optimal Linear Subspace Search: Learning to Construct Fast and
  High-Quality Schedulers for Diffusion Models
Optimal Linear Subspace Search: Learning to Construct Fast and High-Quality Schedulers for Diffusion Models
Zhongjie Duan
Chengyu Wang
Cen Chen
Jun Huang
Weining Qian
DiffM
16
12
0
24 May 2023
CLAPSpeech: Learning Prosody from Text Context with Contrastive
  Language-Audio Pre-training
CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-training
Zhe Ye
Rongjie Huang
Yi Ren
Ziyue Jiang
Jinglin Liu
Jinzheng He
Xiang Yin
Zhou Zhao
CLIP
26
20
0
18 May 2023
RMSSinger: Realistic-Music-Score based Singing Voice Synthesis
RMSSinger: Realistic-Music-Score based Singing Voice Synthesis
Jinzheng He
Jinglin Liu
Zhenhui Ye
Rongjie Huang
Chenye Cui
Huadai Liu
Zhou Zhao
DiffM
14
19
0
18 May 2023
CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency
  Model
CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model
Zhe Ye
Wei Xue
Xuejiao Tan
Jie Chen
Qi-fei Liu
Yi-Ting Guo
DiffM
28
40
0
11 May 2023
Learn to Sing by Listening: Building Controllable Virtual Singer by
  Unsupervised Learning from Voice Recordings
Learn to Sing by Listening: Building Controllable Virtual Singer by Unsupervised Learning from Voice Recordings
Wei Xue
Yiwen Wang
Qi-fei Liu
Yi-Ting Guo
16
1
0
09 May 2023
Improved Techniques for Maximum Likelihood Estimation for Diffusion ODEs
Improved Techniques for Maximum Likelihood Estimation for Diffusion ODEs
Kaiwen Zheng
Cheng Lu
Jianfei Chen
Jun Zhu
DiffM
22
26
0
06 May 2023
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking
  Head
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Rongjie Huang
Mingze Li
Dongchao Yang
Jiatong Shi
Xuankai Chang
...
Jia-Bin Huang
Jinglin Liu
Yixiang Ren
Zhou Zhao
Shinji Watanabe
LM&MA
AuLLM
24
196
0
25 Apr 2023
Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling
  in Offline Reinforcement Learning
Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling in Offline Reinforcement Learning
Cheng Lu
Huayu Chen
Jianfei Chen
Hang Su
Chongxuan Li
Jun Zhu
DiffM
OffRL
19
58
0
25 Apr 2023
Deep Audio-Visual Singing Voice Transcription based on Self-Supervised
  Learning Models
Deep Audio-Visual Singing Voice Transcription based on Self-Supervised Learning Models
Xiangming Gu
Weizhen Zeng
Jianan Zhang
Longshen Ou
Ye Wang
32
6
0
24 Apr 2023
DiffVoice: Text-to-Speech with Latent Diffusion
DiffVoice: Text-to-Speech with Latent Diffusion
Zhijun Liu
Yiwei Guo
K. Yu
DiffM
22
22
0
23 Apr 2023
NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot
  Speech and Singing Synthesizers
NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers
Kai Shen
Zeqian Ju
Xu Tan
Yanqing Liu
Yichong Leng
Lei He
Tao Qin
Sheng Zhao
Jiang Bian
DiffM
15
221
0
18 Apr 2023
Translate the Beauty in Songs: Jointly Learning to Align Melody and
  Translate Lyrics
Translate the Beauty in Songs: Jointly Learning to Align Melody and Translate Lyrics
Chengxi Li
Kai Fan
Jiajun Bu
Boxing Chen
Zhongqiang Huang
Zhi Yu
11
7
0
28 Mar 2023
PHONEix: Acoustic Feature Processing Strategy for Enhanced Singing
  Pronunciation with Phoneme Distribution Predictor
PHONEix: Acoustic Feature Processing Strategy for Enhanced Singing Pronunciation with Phoneme Distribution Predictor
Yuning Wu
Jiatong Shi
Tao Qian
Dongji Gao
Qin Jin
17
5
0
15 Mar 2023
An investigation into the adaptability of a diffusion-based TTS model
An investigation into the adaptability of a diffusion-based TTS model
Haolin Chen
Philip N. Garner
DiffM
31
1
0
03 Mar 2023
Continuous-Time Functional Diffusion Processes
Continuous-Time Functional Diffusion Processes
Giulio Franzese
Dario Rossi
Simone Rossi
Markus Heinonen
Maurizio Filippone
Pietro Michiardi
29
24
0
01 Mar 2023
DIFUSCO: Graph-based Diffusion Solvers for Combinatorial Optimization
DIFUSCO: Graph-based Diffusion Solvers for Combinatorial Optimization
Zhiqing Sun
Yiming Yang
DiffM
22
117
0
16 Feb 2023
ReDi: Efficient Learning-Free Diffusion Inference via Trajectory
  Retrieval
ReDi: Efficient Learning-Free Diffusion Inference via Trajectory Retrieval
Kexun Zhang
Xianjun Yang
William Yang Wang
Lei Li
DiffM
35
8
0
05 Feb 2023
DiffSTG: Probabilistic Spatio-Temporal Graph Forecasting with Denoising
  Diffusion Models
DiffSTG: Probabilistic Spatio-Temporal Graph Forecasting with Denoising Diffusion Models
Haomin Wen
Youfang Lin
Yutong Xia
Huaiyu Wan
Qingsong Wen
Roger Zimmermann
Yuxuan Liang
DiffM
23
78
0
31 Jan 2023
Cross-domain Neural Pitch and Periodicity Estimation
Cross-domain Neural Pitch and Periodicity Estimation
Max Morrison
Caedon Hsieh
Nathan Pruyne
Bryan Pardo
15
16
0
28 Jan 2023
DiffSDS: A language diffusion model for protein backbone inpainting
  under geometric conditions and constraints
DiffSDS: A language diffusion model for protein backbone inpainting under geometric conditions and constraints
Zhangyang Gao
Cheng Tan
Stan Z. Li
DiffM
16
19
0
22 Jan 2023
Msanii: High Fidelity Music Synthesis on a Shoestring Budget
Msanii: High Fidelity Music Synthesis on a Shoestring Budget
Kinyugo Maina
19
5
0
16 Jan 2023
ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to
  Speech
ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech
Ze Chen
Yihan Wu
Yichong Leng
Jiawei Chen
Haohe Liu
...
Ke Wang
Lei He
Sheng Zhao
Jiang Bian
Danilo P. Mandic
DiffM
20
22
0
30 Dec 2022
UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice
  Synthesis
UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice Synthesis
Yinjiao Lei
Shan Yang
Xinsheng Wang
Qicong Xie
Jixun Yao
Linfu Xie
Dan Su
DiffM
11
8
0
03 Dec 2022
EmoDiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label
  Guidance
EmoDiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label Guidance
Yiwei Guo
Chenpeng Du
Xie Chen
K. Yu
DiffM
52
39
0
17 Nov 2022
NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis
NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis
Hyeong-Seok Choi
Jinhyeok Yang
Juheon Lee
Hyeongju Kim
16
46
0
17 Nov 2022
VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by
  Digital Signal Processing Synthesizer
VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer
Yongmao Zhang
Heyang Xue
Hanzhao Li
Linfu Xie
Tingwei Guo
Ruixiong Zhang
Caixia Gong
DiffM
VLM
12
28
0
05 Nov 2022
DPM-Solver++: Fast Solver for Guided Sampling of Diffusion Probabilistic
  Models
DPM-Solver++: Fast Solver for Guided Sampling of Diffusion Probabilistic Models
Cheng Lu
Yuhao Zhou
Fan Bao
Jianfei Chen
Chongxuan Li
Jun Zhu
DiffM
30
551
0
02 Nov 2022
SDMuse: Stochastic Differential Music Editing and Generation via Hybrid
  Representation
SDMuse: Stochastic Differential Music Editing and Generation via Hybrid Representation
Chen Zhang
Yi Ren
Kecheng Zhang
Shuicheng Yan
DiffM
6
13
0
01 Nov 2022
NNSVS: A Neural Network-Based Singing Voice Synthesis Toolkit
NNSVS: A Neural Network-Based Singing Voice Synthesis Toolkit
Ryuichi Yamamoto
Reo Yoneyama
T. Toda
71
11
0
28 Oct 2022
Robust One-Shot Singing Voice Conversion
Robust One-Shot Singing Voice Conversion
Naoya Takahashi
M. Singh
Yuki Mitsufuji
DiffM
15
8
0
20 Oct 2022
OCD: Learning to Overfit with Conditional Diffusion Models
OCD: Learning to Overfit with Conditional Diffusion Models
Shahar Lutati
Lior Wolf
DiffM
13
8
0
02 Oct 2022
A Survey on Generative Diffusion Model
A Survey on Generative Diffusion Model
Hanqun Cao
Cheng Tan
Zhangyang Gao
Yilun Xu
Guangyong Chen
Pheng-Ann Heng
Stan Z. Li
MedIm
37
205
0
06 Sep 2022
Diffusion Models: A Comprehensive Survey of Methods and Applications
Diffusion Models: A Comprehensive Survey of Methods and Applications
Ling Yang
Zhilong Zhang
Yingxia Shao
Shenda Hong
Runsheng Xu
Yue Zhao
Wentao Zhang
Bin Cui
Ming-Hsuan Yang
DiffM
MedIm
224
1,296
0
02 Sep 2022
DDSP-based Singing Vocoders: A New Subtractive-based Synthesizer and A
  Comprehensive Evaluation
DDSP-based Singing Vocoders: A New Subtractive-based Synthesizer and A Comprehensive Evaluation
Da-Yi Wu
Wen-Yi Hsiao
Fu-Rong Yang
Oscar D. Friedman
Warren Jackson
Scott Bruzenak
Yi-Wen Liu
Yi-Hsuan Yang
DiffM
23
24
0
09 Aug 2022
ProDiff: Progressive Fast Diffusion Model For High-Quality
  Text-to-Speech
ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-Speech
Rongjie Huang
Zhou Zhao
Huadai Liu
Jinglin Liu
Chenye Cui
Yi Ren
DiffM
44
193
0
13 Jul 2022
Entropy-driven Sampling and Training Scheme for Conditional Diffusion
  Generation
Entropy-driven Sampling and Training Scheme for Conditional Diffusion Generation
Sheng-liang Li
Guangcong Zheng
Haibo Wang
Taiping Yao
Yang Chen
Shoudong Ding
Xi Li
DiffM
13
22
0
23 Jun 2022
Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for
  Text-to-Speech
Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech
Ziyue Jiang
Zhe Su
Zhou Zhao
Qian Yang
Yi Ren
Jinglin Liu
Zhe Ye
24
4
0
05 Jun 2022
GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain
  Text-to-Speech
GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech
Rongjie Huang
Yi Ren
Jinglin Liu
Chenye Cui
Zhou Zhao
OODD
VLM
115
34
0
15 May 2022
Muskits: an End-to-End Music Processing Toolkit for Singing Voice
  Synthesis
Muskits: an End-to-End Music Processing Toolkit for Singing Voice Synthesis
Jiatong Shi
Shuai Guo
Tao Qian
Nan Huo
Tomoki Hayashi
...
Xuankai Chang
Hua-Wei Li
Peter Wu
Shinji Watanabe
Qin Jin
VLM
9
26
0
09 May 2022
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech
  Synthesis
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis
Rongjie Huang
Max W. Y. Lam
J. Wang
Dan Su
Dong Yu
Yi Ren
Zhou Zhao
DiffM
11
164
0
21 Apr 2022
Learn2Sing 2.0: Diffusion and Mutual Information-Based Target Speaker
  SVS by Learning from Singing Teacher
Learn2Sing 2.0: Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher
Heyang Xue
Xinsheng Wang
Yongmao Zhang
Lei Xie
Pengcheng Zhu
Mengxiao Bi
DiffM
11
11
0
30 Mar 2022
Contrastive Learning with Positive-Negative Frame Mask for Music
  Representation
Contrastive Learning with Positive-Negative Frame Mask for Music Representation
D. Yao
Zhou Zhao
Shengyu Zhang
Jieming Zhu
Yudong Zhu
Rui Zhang
Xiuqiang He
12
21
0
17 Mar 2022
Previous
1234
Next