v1v2v3 (latest)

DiffWave: A Versatile Diffusion Model for Audio Synthesis

International Conference on Learning Representations (ICLR), 2020

21 September 2020

Papers citing "DiffWave: A Versatile Diffusion Model for Audio Synthesis"

50 / 1,135 papers shown

It's Raw! Audio Generation with State-Space ModelsInternational Conference on Machine Learning (ICML), 2022

276

235

20 Feb 2022

Truncated Diffusion Probabilistic Models and Diffusion-based Adversarial Auto-EncodersInternational Conference on Learning Representations (ICLR), 2022

Huangjie Zheng

Pengcheng He

Weizhu Chen

Mingyuan Zhou

DiffM

311

19 Feb 2022

Conditional Diffusion Probabilistic Model for Speech EnhancementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

231

264

10 Feb 2022

InferGrad: Improving Diffusion Models for Vocoder by Considering Inference in TrainingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Xu Tan

Shifeng Pan

191

08 Feb 2022

Score-based Generative Modeling of Graphs via the System of Stochastic Differential EquationsInternational Conference on Machine Learning (ICML), 2022

362

297

05 Feb 2022

ItôWave: Itô Stochastic Differential Equation Is All You Need For Wave GenerationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Shoule Wu

Ziqiang Shi

DiffM

777

29 Jan 2022

DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

309

28 Jan 2022

J-MAC: Japanese multi-speaker audiobook corpus for speech synthesisInterspeech (Interspeech), 2022

Hiroshi Saruwatari

135

26 Jan 2022

Analytic-DPM: an Analytic Estimate of the Optimal Reverse Variance in Diffusion Probabilistic ModelsInternational Conference on Learning Representations (ICLR), 2022

Jun Zhu

374

390

17 Jan 2022

Audio representations for deep learning in sound synthesis: A reviewACS/IEEE International Conference on Computer Systems and Applications (AICCSA), 2021

Anastasia Natsiou

Seán O'Leary

AI4TS

156

07 Jan 2022

A sinusoidal signal reconstruction method for the inversion of the mel-spectrogramIEEE International Symposium on Multimedia (ISM), 2021

Anastasia Natsiou

Seán O'Leary

104

07 Jan 2022

Quasi-Taylor Samplers for Diffusion Generative Models based on Ideal Derivatives

298

26 Dec 2021

High-Resolution Image Synthesis with Latent Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2021

3.1K

21,434

20 Dec 2021

Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale CorpusACM Multimedia (MM), 2021

Rongjie Huang

Zhou Zhao

224

126

20 Dec 2021

Soundify: Matching Sound Effects to VideoACM Symposium on User Interface Software and Technology (UIST), 2021

308

17 Dec 2021

Tackling the Generative Learning Trilemma with Denoising Diffusion GANs

435

678

15 Dec 2021

Score-Based Generative Modeling with Critically-Damped Langevin Diffusion

685

272

14 Dec 2021

A Conditional Point Diffusion-Refinement Paradigm for 3D Point Cloud Completion

605

152

07 Dec 2021

VocBench: A Neural Vocoder Benchmark for Speech SynthesisIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Ehab A. AlBadawy

180

06 Dec 2021

Global Context with Discrete Diffusion in Vector Quantised Modelling for Image Generation

174

03 Dec 2021

SegDiff: Image Segmentation with Diffusion Probabilistic Models

Lior Wolf

365

397

01 Dec 2021

Guided-TTS: A Diffusion Model for Text-to-Speech via Classifier Guidance

333

127

23 Nov 2021

More than Words: In-the-Wild Visually-Driven Prosody for Text-to-SpeechComputer Vision and Pattern Recognition (CVPR), 2021

Michael Hassid

Michelle Tadmor Ramanovich

213

19 Nov 2021

Palette: Image-to-Image Diffusion ModelsInternational Conference on Computer Graphics and Interactive Techniques (SIGGRAPH), 2021

David J. Fleet

1.2K

2,033

10 Nov 2021

Estimating High Order Gradients of the Data Distribution by Denoising

217

08 Nov 2021

WaveFake: A Data Set to Facilitate Audio Deepfake Detection

Joel Frank

Lea Schonherr

DiffM

336

185

04 Nov 2021

Likelihood Training of Schrödinger Bridge using Forward-Backward SDEs TheoryInternational Conference on Learning Representations (ICLR), 2021

T. Chen

Guan-Horng Liu

Evangelos A. Theodorou

DiffM OT

697

229

21 Oct 2021

Diffusion Normalizing Flow

Qinsheng Zhang

Yongxin Chen

DiffM

212

105

14 Oct 2021

SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation

Rongjie Huang

Zhou Zhao

342

14 Oct 2021

SpecSinGAN: Sound Effect Variation Synthesis Using Single-Image GANs

Adrián Barahona-Ríos

Tom Collins

GAN

147

14 Oct 2021

Denoising Diffusion Gamma Models

Eliya Nachmani

S. Robin

Lior Wolf

DiffM VLM

219

10 Oct 2021

Score-based diffusion models for accelerated MRI

Hyungjin Chung

Jong Chul Ye

DiffM MedIm

546

506

08 Oct 2021

EdiTTS: Score-based Editing for Controllable Text-to-Speech

Jaesung Tae

Hyeongju Kim

Taesu Kim

DiffM

409

06 Oct 2021

Networked Time Series Prediction with Incomplete Data via Generative Adversarial Network

Xinbing Wang

287

05 Oct 2021

Autoregressive Diffusion Models

528

199

05 Oct 2021

On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis

Kaizhi Qian

...

190

04 Oct 2021

Diffusion-Based Voice Conversion with Fast Maximum Likelihood Sampling Scheme

310

177

28 Sep 2021

MSR-NV: Neural Vocoder Using Multiple Sampling Rates

Kentaro Mitsui

Kei Sawada

255

28 Sep 2021

Bilateral Denoising Diffusion Models

Rongjie Huang

209

26 Aug 2021

ILVR: Conditioning Method for Denoising Diffusion Probabilistic ModelsIEEE International Conference on Computer Vision (ICCV), 2021

678

875

06 Aug 2021

A Benchmarking Initiative for Audio-Domain Music Generation Using the Freesound Loop Dataset

241

03 Aug 2021

Toward Spatially Unbiased Generative Models

466

03 Aug 2021

A Study on Speech Enhancement Based on Diffusion Probabilistic ModelAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2021

255

25 Jul 2021

CSDI: Conditional Score-based Diffusion Models for Probabilistic Time Series ImputationNeural Information Processing Systems (NeurIPS), 2021

412

835

07 Jul 2021

Structured Denoising Diffusion Models in Discrete State-Spaces

905

1,386

07 Jul 2021

Variational Diffusion Models

926

1,372

01 Jul 2021

On the Generative Utility of Cyclic ConditionalsNeural Information Processing Systems (NeurIPS), 2021

238

30 Jun 2021

A Survey on Neural Speech Synthesis

Xu Tan

350

435

29 Jun 2021

Distilling the Knowledge from Conditional Normalizing Flows

229

24 Jun 2021

ScoreGrad: Multivariate Probabilistic Time Series Forecasting with Continuous Energy-based Generative Models

274

18 Jun 2021