Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2009.09761
Cited By
v1
v2
v3 (latest)
DiffWave: A Versatile Diffusion Model for Audio Synthesis
International Conference on Learning Representations (ICLR), 2020
21 September 2020
Zhifeng Kong
Ming-Yu Liu
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
DiffM
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"DiffWave: A Versatile Diffusion Model for Audio Synthesis"
50 / 1,135 papers shown
It's Raw! Audio Generation with State-Space Models
International Conference on Machine Learning (ICML), 2022
Karan Goel
Albert Gu
Chris Donahue
Christopher Ré
276
235
0
20 Feb 2022
Truncated Diffusion Probabilistic Models and Diffusion-based Adversarial Auto-Encoders
International Conference on Learning Representations (ICLR), 2022
Huangjie Zheng
Pengcheng He
Weizhu Chen
Mingyuan Zhou
DiffM
311
55
0
19 Feb 2022
Conditional Diffusion Probabilistic Model for Speech Enhancement
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Yen-Ju Lu
Zhongqiu Wang
Shinji Watanabe
Alexander Richard
Cheng Yu
Yu Tsao
DiffM
231
264
0
10 Feb 2022
InferGrad: Improving Diffusion Models for Vocoder by Considering Inference in Training
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Zehua Chen
Xu Tan
Ke Wang
Shifeng Pan
Danilo Mandic
Lei He
Sheng Zhao
DiffM
191
35
0
08 Feb 2022
Score-based Generative Modeling of Graphs via the System of Stochastic Differential Equations
International Conference on Machine Learning (ICML), 2022
Jaehyeong Jo
Seul Lee
Sung Ju Hwang
DiffM
362
297
0
05 Feb 2022
ItôWave: Itô Stochastic Differential Equation Is All You Need For Wave Generation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Shoule Wu
Ziqiang Shi
DiffM
777
9
0
29 Jan 2022
DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
Songxiang Liu
Jane Polak Scowcroft
Dong Yu
DiffM
309
75
0
28 Jan 2022
J-MAC: Japanese multi-speaker audiobook corpus for speech synthesis
Interspeech (Interspeech), 2022
Shinnosuke Takamichi
Wataru Nakata
Naoko Tanji
Hiroshi Saruwatari
AuLLM
135
8
0
26 Jan 2022
Analytic-DPM: an Analytic Estimate of the Optimal Reverse Variance in Diffusion Probabilistic Models
International Conference on Learning Representations (ICLR), 2022
Fan Bao
Chongxuan Li
Jun Zhu
Bo Zhang
DiffM
374
390
0
17 Jan 2022
Audio representations for deep learning in sound synthesis: A review
ACS/IEEE International Conference on Computer Systems and Applications (AICCSA), 2021
Anastasia Natsiou
Seán O'Leary
AI4TS
156
26
0
07 Jan 2022
A sinusoidal signal reconstruction method for the inversion of the mel-spectrogram
IEEE International Symposium on Multimedia (ISM), 2021
Anastasia Natsiou
Seán O'Leary
104
3
0
07 Jan 2022
Quasi-Taylor Samplers for Diffusion Generative Models based on Ideal Derivatives
Hideyuki Tachibana
Mocho Go
Muneyoshi Inahara
Yotaro Katayama
Yotaro Watanabe
DiffM
298
4
0
26 Dec 2021
High-Resolution Image Synthesis with Latent Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2021
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
DiffM
3.1K
21,434
0
20 Dec 2021
Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus
ACM Multimedia (MM), 2021
Rongjie Huang
Feiyang Chen
Yi Ren
Jinglin Liu
Chenye Cui
Zhou Zhao
224
126
0
20 Dec 2021
Soundify: Matching Sound Effects to Video
ACM Symposium on User Interface Software and Technology (UIST), 2021
David Chuan-En Lin
Anastasis Germanidis
Cristobal Valenzuela
Yining Shi
Nikolas Martelaro
308
21
0
17 Dec 2021
Tackling the Generative Learning Trilemma with Denoising Diffusion GANs
Zhisheng Xiao
Karsten Kreis
Arash Vahdat
DiffM
435
678
0
15 Dec 2021
Score-Based Generative Modeling with Critically-Damped Langevin Diffusion
Tim Dockhorn
Arash Vahdat
Karsten Kreis
DiffM
685
272
0
14 Dec 2021
A Conditional Point Diffusion-Refinement Paradigm for 3D Point Cloud Completion
Zhaoyang Lyu
Zhifeng Kong
Xudong Xu
Liang Pan
Dahua Lin
DiffM
BDL
605
152
0
07 Dec 2021
VocBench: A Neural Vocoder Benchmark for Speech Synthesis
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Ehab A. AlBadawy
Andrew Gibiansky
Qing He
Jilong Wu
Ming-Ching Chang
Siwei Lyu
180
17
0
06 Dec 2021
Global Context with Discrete Diffusion in Vector Quantised Modelling for Image Generation
Minghui Hu
Yujie Wang
Tat-Jen Cham
Jianfei Yang
P.N.Suganthan
DiffM
174
52
0
03 Dec 2021
SegDiff: Image Segmentation with Diffusion Probabilistic Models
Tomer Amit
Tal Shaharbany
Eliya Nachmani
Lior Wolf
DiffM
365
397
0
01 Dec 2021
Guided-TTS: A Diffusion Model for Text-to-Speech via Classifier Guidance
Heeseung Kim
Sungwon Kim
Sungroh Yoon
DiffM
BDL
333
127
0
23 Nov 2021
More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech
Computer Vision and Pattern Recognition (CVPR), 2021
Michael Hassid
Michelle Tadmor Ramanovich
Brendan Shillingford
Miaosen Wang
Ye Jia
Tal Remez
DiffM
213
21
0
19 Nov 2021
Palette: Image-to-Image Diffusion Models
International Conference on Computer Graphics and Interactive Techniques (SIGGRAPH), 2021
Chitwan Saharia
William Chan
Huiwen Chang
Chris A. Lee
Jonathan Ho
Tim Salimans
David J. Fleet
Mohammad Norouzi
DiffM
VLM
1.2K
2,033
0
10 Nov 2021
Estimating High Order Gradients of the Data Distribution by Denoising
Chenlin Meng
Yang Song
Wenzhe Li
Stefano Ermon
DiffM
217
64
0
08 Nov 2021
WaveFake: A Data Set to Facilitate Audio Deepfake Detection
Joel Frank
Lea Schonherr
DiffM
336
185
0
04 Nov 2021
Likelihood Training of Schrödinger Bridge using Forward-Backward SDEs Theory
International Conference on Learning Representations (ICLR), 2021
T. Chen
Guan-Horng Liu
Evangelos A. Theodorou
DiffM
OT
697
229
0
21 Oct 2021
Diffusion Normalizing Flow
Qinsheng Zhang
Yongxin Chen
DiffM
212
105
0
14 Oct 2021
SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation
Rongjie Huang
Chenye Cui
Feiyang Chen
Yi Ren
Jinglin Liu
Zhou Zhao
Baoxing Huai
N. Yuan
GAN
342
70
0
14 Oct 2021
SpecSinGAN: Sound Effect Variation Synthesis Using Single-Image GANs
Adrián Barahona-Ríos
Tom Collins
GAN
147
7
0
14 Oct 2021
Denoising Diffusion Gamma Models
Eliya Nachmani
S. Robin
Lior Wolf
DiffM
VLM
219
34
0
10 Oct 2021
Score-based diffusion models for accelerated MRI
Hyungjin Chung
Jong Chul Ye
DiffM
MedIm
546
506
0
08 Oct 2021
EdiTTS: Score-based Editing for Controllable Text-to-Speech
Jaesung Tae
Hyeongju Kim
Taesu Kim
DiffM
409
47
0
06 Oct 2021
Networked Time Series Prediction with Incomplete Data via Generative Adversarial Network
Yichen Zhu
Bo Jiang
Haiming Jin
Mengtian Zhang
Feng Gao
Jianqiang Huang
Tao Lin
Xinbing Wang
GNN
AI4TS
287
8
0
05 Oct 2021
Autoregressive Diffusion Models
Emiel Hoogeboom
Alexey A. Gritsenko
Jasmijn Bastings
Ben Poole
Rianne van den Berg
Tim Salimans
DiffM
528
199
0
05 Oct 2021
On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis
Cheng-I Jeff Lai
Erica Cooper
Yang Zhang
Shiyu Chang
Kaizhi Qian
...
Yung-Sung Chuang
Alexander H. Liu
Junichi Yamagishi
David D. Cox
James R. Glass
190
7
0
04 Oct 2021
Diffusion-Based Voice Conversion with Fast Maximum Likelihood Sampling Scheme
Vadim Popov
Ivan Vovk
Vladimir Gogoryan
Tasnima Sadekova
Mikhail Kudinov
Jiansheng Wei
DiffM
BDL
310
177
0
28 Sep 2021
MSR-NV: Neural Vocoder Using Multiple Sampling Rates
Kentaro Mitsui
Kei Sawada
255
1
0
28 Sep 2021
Bilateral Denoising Diffusion Models
Max W. Y. Lam
Jun Wang
Rongjie Huang
Jane Polak Scowcroft
Dong Yu
DiffM
209
45
0
26 Aug 2021
ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models
IEEE International Conference on Computer Vision (ICCV), 2021
Jooyoung Choi
Sungwon Kim
Yonghyun Jeong
Youngjune Gwon
Sungroh Yoon
DiffM
678
875
0
06 Aug 2021
A Benchmarking Initiative for Audio-Domain Music Generation Using the Freesound Loop Dataset
Tun-Min Hung
Bo-Yu Chen
Yen-Tung Yeh
Yi-Hsuan Yang
241
12
0
03 Aug 2021
Toward Spatially Unbiased Generative Models
Jooyoung Choi
Jungbeom Lee
Yonghyun Jeong
Sungroh Yoon
DiffM
466
17
0
03 Aug 2021
A Study on Speech Enhancement Based on Diffusion Probabilistic Model
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2021
Yen-Ju Lu
Yu Tsao
Shinji Watanabe
DiffM
255
91
0
25 Jul 2021
CSDI: Conditional Score-based Diffusion Models for Probabilistic Time Series Imputation
Neural Information Processing Systems (NeurIPS), 2021
Y. Tashiro
Jiaming Song
Yang Song
Stefano Ermon
BDL
DiffM
412
835
0
07 Jul 2021
Structured Denoising Diffusion Models in Discrete State-Spaces
Jacob Austin
Daniel D. Johnson
Jonathan Ho
Daniel Tarlow
Rianne van den Berg
DiffM
905
1,386
0
07 Jul 2021
Variational Diffusion Models
Diederik P. Kingma
Tim Salimans
Ben Poole
Jonathan Ho
DiffM
926
1,372
0
01 Jul 2021
On the Generative Utility of Cyclic Conditionals
Neural Information Processing Systems (NeurIPS), 2021
Yu Xie
Haoyue Tang
Tao Qin
Jintao Wang
Tie-Yan Liu
238
4
0
30 Jun 2021
A Survey on Neural Speech Synthesis
Xu Tan
Tao Qin
Frank Soong
Tie-Yan Liu
AI4TS
350
435
0
29 Jun 2021
Distilling the Knowledge from Conditional Normalizing Flows
Dmitry Baranchuk
Vladimir Aliev
Artem Babenko
BDL
229
4
0
24 Jun 2021
ScoreGrad: Multivariate Probabilistic Time Series Forecasting with Continuous Energy-based Generative Models
Tijin Yan
Hongwei Zhang
Tong Zhou
Yufeng Zhan
Yuanqing Xia
DiffM
AI4TS
274
49
0
18 Jun 2021
Previous
1
2
3
...
21
22
23
Next
Page 22 of 23
Page
of 23
Go