Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2009.09761
Cited By
DiffWave: A Versatile Diffusion Model for Audio Synthesis
21 September 2020
Zhifeng Kong
Wei Ping
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
DiffM
BDL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DiffWave: A Versatile Diffusion Model for Audio Synthesis"
50 / 977 papers shown
Title
Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation
Ye Zhu
Yuehua Wu
Kyle Olszewski
Jian Ren
Sergey Tulyakov
Yan Yan
DiffM
20
47
0
15 Jun 2022
Estimating the Optimal Covariance with Imperfect Mean in Diffusion Probabilistic Models
Fan Bao
Chongxuan Li
Jiacheng Sun
Jun Zhu
Bo Zhang
DiffM
21
72
0
15 Jun 2022
Adversarial Audio Synthesis with Complex-valued Polynomial Networks
Yongtao Wu
Grigorios G. Chrysos
V. Cevher
DiffM
4
4
0
14 Jun 2022
Multi-instrument Music Synthesis with Spectrogram Diffusion
Curtis Hawthorne
Ian Simon
Adam Roberts
Neil Zeghidour
Josh Gardner
Ethan Manilow
Jesse Engel
DiffM
21
48
0
11 Jun 2022
How Much is Enough? A Study on Diffusion Times in Score-based Generative Models
Giulio Franzese
Simone Rossi
Lixuan Yang
A. Finamore
Dario Rossi
Maurizio Filippone
Pietro Michiardi
DiffM
13
46
0
10 Jun 2022
BigVGAN: A Universal Neural Vocoder with Large-Scale Training
Sang-gil Lee
Wei Ping
Boris Ginsburg
Bryan Catanzaro
Sung-Hoon Yoon
17
224
0
09 Jun 2022
Neural Diffusion Processes
Vincent Dutordoir
Alan D. Saul
Zoubin Ghahramani
F. Simpson
DiffM
38
37
0
08 Jun 2022
Universal Speech Enhancement with Score-based Diffusion
Joan Serra
Santiago Pascual
Jordi Pons
R. O. Araz
D. Scaini
DiffM
17
95
0
07 Jun 2022
Zero-Shot Voice Conditioning for Denoising Diffusion TTS Models
Alon Levkovitch
Eliya Nachmani
Lior Wolf
DiffM
19
29
0
05 Jun 2022
Score-Based Generative Models Detect Manifolds
Jakiw Pidstrigach
DiffM
24
70
0
02 Jun 2022
DiVAE: Photorealistic Images Synthesis with Denoising Diffusion Decoder
Jie Shi
Chenfei Wu
Jian Liang
Xiang Liu
Nan Duan
DiffM
9
25
0
01 Jun 2022
Elucidating the Design Space of Diffusion-Based Generative Models
Tero Karras
M. Aittala
Timo Aila
S. Laine
DiffM
50
1,826
0
01 Jun 2022
Improved Vector Quantized Diffusion Models
Zhicong Tang
Shuyang Gu
Jianmin Bao
Dong Chen
Fang Wen
DiffM
178
63
0
31 May 2022
Few-Shot Diffusion Models
Giorgio Giannone
Didrik Nielsen
Ole Winther
DiffM
183
49
0
30 May 2022
Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech with Untranscribed Data
Sungwon Kim
Heeseung Kim
Sung-Hoon Yoon
DiffM
196
52
0
30 May 2022
BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio Synthesis
Yichong Leng
Zehua Chen
Junliang Guo
Haohe Liu
Jiawei Chen
...
Lei He
Xiang-Yang Li
Tao Qin
Sheng Zhao
Tie-Yan Liu
DiffM
51
58
0
30 May 2022
Diffusion-LM Improves Controllable Text Generation
Xiang Lisa Li
John Thickstun
Ishaan Gulrajani
Percy Liang
Tatsunori B. Hashimoto
AI4CE
173
773
0
27 May 2022
Accelerating Diffusion Models via Early Stop of the Diffusion Process
Zhaoyang Lyu
Xu Xudong
Ceyuan Yang
Dahua Lin
Bo Dai
DiffM
193
92
0
25 May 2022
The ICML 2022 Expressive Vocalizations Workshop and Competition: Recognizing, Generating, and Personalizing Vocal Bursts
Alice Baird
Panagiotis Tzirakis
Gauthier Gidel
Marco Jiralerspong
Eilif B. Muller
Kory W. Mathewson
Björn Schuller
Erik Cambria
D. Keltner
Alan S. Cowen
VLM
28
30
0
03 May 2022
Parallel Synthesis for Autoregressive Speech Generation
Po-Chun Hsu
Da-Rong Liu
Andy T. Liu
Hung-yi Lee
34
5
0
25 Apr 2022
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis
Rongjie Huang
Max W. Y. Lam
J. Wang
Dan Su
Dong Yu
Yi Ren
Zhou Zhao
DiffM
28
164
0
21 Apr 2022
A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond
Yisheng Xiao
Lijun Wu
Junliang Guo
Juntao Li
M. Zhang
Tao Qin
Tie-Yan Liu
3DV
MedIm
AI4CE
30
82
0
20 Apr 2022
A Post Auto-regressive GAN Vocoder Focused on Spectrum Fracture
Zhe-ming Lu
Mengnan He
Ruixiong Zhang
Caixia Gong
GAN
9
2
0
12 Apr 2022
The Sillwood Technologies System for the VoiceMOS Challenge 2022
Jiameng Gao
18
0
0
08 Apr 2022
Video Diffusion Models
Jonathan Ho
Tim Salimans
Alexey A. Gritsenko
William Chan
Mohammad Norouzi
David J. Fleet
DiffM
VGen
44
1,504
0
07 Apr 2022
Perception Prioritized Training of Diffusion Models
Jooyoung Choi
Jungbeom Lee
Chaehun Shin
Sungwon Kim
Hyunwoo J. Kim
Sung-Hoon Yoon
DiffM
22
232
0
01 Apr 2022
Speech Enhancement with Score-Based Generative Models in the Complex STFT Domain
Simon Welker
Julius Richter
Timo Gerkmann
DiffM
22
110
0
31 Mar 2022
SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping
Yuma Koizumi
Heiga Zen
Kohei Yatabe
Nanxin Chen
M. Bacchiani
DiffM
23
45
0
31 Mar 2022
Stochastic Trajectory Prediction via Motion Indeterminacy Diffusion
Tianpei Gu
Guangyi Chen
Junlong Li
Chunze Lin
Yongming Rao
Jie Zhou
Jiwen Lu
DiffM
VGen
27
192
0
25 Mar 2022
BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis
Max W. Y. Lam
J. Wang
Dan Su
Dong Yu
DiffM
29
92
0
25 Mar 2022
On the link between conscious function and general intelligence in humans and machines
Arthur Juliani
Kai Arulkumaran
Shuntaro Sasai
Ryota Kanai
34
24
0
24 Mar 2022
Diffusion Probabilistic Modeling for Video Generation
Ruihan Yang
Prakhar Srivastava
Stephan Mandt
DiffM
VGen
36
255
0
16 Mar 2022
A Survey on Deep Graph Generation: Methods and Applications
Yanqiao Zhu
Yuanqi Du
Yinkai Wang
Yichen Xu
Jieyu Zhang
Qiang Liu
Shu Wu
3DV
GNN
31
67
0
13 Mar 2022
Score-Based Generative Models for Molecule Generation
Dwaraknath Gnaneshwar
Bharath Ramsundar
Dhairya Gandhi
Rachel C. Kurchin
V. Viswanathan
DiffM
22
11
0
07 Mar 2022
NeuralDPS: Neural Deterministic Plus Stochastic Model with Multiband Excitation for Noise-Controllable Waveform Generation
Tao Wang
Ruibo Fu
Jiangyan Yi
J. Tao
Zhengqi Wen
9
2
0
05 Mar 2022
Measurement-conditioned Denoising Diffusion Probabilistic Model for Under-sampled Medical Image Reconstruction
Yutong Xie
Quanzheng Li
DiffM
MedIm
24
87
0
05 Mar 2022
iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform
Takuhiro Kaneko
Kou Tanaka
Hirokazu Kameoka
Shogo Seki
15
60
0
04 Mar 2022
Wavebender GAN: An architecture for phonetically meaningful speech manipulation
Gustavo Teodoro Döhler Beck
Ulme Wennberg
Zofia Malisz
G. Henter
AI4CE
19
8
0
22 Feb 2022
Pseudo Numerical Methods for Diffusion Models on Manifolds
Luping Liu
Yi Ren
Zhijie Lin
Zhou Zhao
DiffM
35
619
0
20 Feb 2022
It's Raw! Audio Generation with State-Space Models
Karan Goel
Albert Gu
Chris Donahue
Christopher Ré
14
185
0
20 Feb 2022
Truncated Diffusion Probabilistic Models and Diffusion-based Adversarial Auto-Encoders
Huangjie Zheng
Pengcheng He
Weizhu Chen
Mingyuan Zhou
DiffM
17
44
0
19 Feb 2022
Conditional Diffusion Probabilistic Model for Speech Enhancement
Yen-Ju Lu
Zhongqiu Wang
Shinji Watanabe
Alexander Richard
Cheng Yu
Yu Tsao
DiffM
21
174
0
10 Feb 2022
InferGrad: Improving Diffusion Models for Vocoder by Considering Inference in Training
Zehua Chen
Xu Tan
Ke Wang
Shifeng Pan
Danilo P. Mandic
Lei He
Sheng Zhao
DiffM
18
28
0
08 Feb 2022
Score-based Generative Modeling of Graphs via the System of Stochastic Differential Equations
Jaehyeong Jo
Seul Lee
Sung Ju Hwang
DiffM
22
210
0
05 Feb 2022
ItôWave: Itô Stochastic Differential Equation Is All You Need For Wave Generation
Shoule Wu
Ziqiang Shi
DiffM
245
9
0
29 Jan 2022
DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
Songxiang Liu
Dan Su
Dong Yu
DiffM
68
65
0
28 Jan 2022
J-MAC: Japanese multi-speaker audiobook corpus for speech synthesis
Shinnosuke Takamichi
Wataru Nakata
Naoko Tanji
Hiroshi Saruwatari
AuLLM
17
6
0
26 Jan 2022
Analytic-DPM: an Analytic Estimate of the Optimal Reverse Variance in Diffusion Probabilistic Models
Fan Bao
Chongxuan Li
Jun Zhu
Bo Zhang
DiffM
30
337
0
17 Jan 2022
Audio representations for deep learning in sound synthesis: A review
Anastasia Natsiou
Seán O'Leary
AI4TS
14
18
0
07 Jan 2022
A sinusoidal signal reconstruction method for the inversion of the mel-spectrogram
Anastasia Natsiou
Seán O'Leary
9
3
0
07 Jan 2022
Previous
1
2
3
...
17
18
19
20
Next