v1v2v3 (latest)

DiffWave: A Versatile Diffusion Model for Audio Synthesis

International Conference on Learning Representations (ICLR), 2020

21 September 2020

Papers citing "DiffWave: A Versatile Diffusion Model for Audio Synthesis"

50 / 1,135 papers shown

AdaCat: Adaptive Categorical Discretization for Autoregressive ModelsConference on Uncertainty in Artificial Intelligence (UAI), 2022

Qiyang Li

Ajay Jain

Pieter Abbeel

OffRL

197

03 Aug 2022

DeScoD-ECG: Deep Score-Based Diffusion Model for ECG Baseline Wander and Noise RemovalIEEE journal of biomedical and health informatics (IEEE JBHI), 2022

226

31 Jul 2022

Classifier-Free Diffusion Guidance

Jonathan Ho

Tim Salimans

FaML

476

5,385

26 Jul 2022

A Proposal for Foley Sound Synthesis Challenge

127

21 Jul 2022

Diffsound: Discrete Diffusion Model for Text-to-sound GenerationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

Dongchao Yang

Helin Wang

Dong Yu

285

382

20 Jul 2022

ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-SpeechACM Multimedia (ACM MM), 2022

Rongjie Huang

Zhou Zhao

269

236

13 Jul 2022

Entropy-driven Sampling and Training Scheme for Conditional Diffusion GenerationEuropean Conference on Computer Vision (ECCV), 2022

Xi Li

333

23 Jun 2022

Generative Modelling With Inverse Heat DissipationInternational Conference on Learning Representations (ICLR), 2022

Severi Rissanen

Markus Heinonen

Arno Solin

DiffM

841

152

21 Jun 2022

A Flexible Diffusion ModelInternational Conference on Machine Learning (ICML), 2022

193

17 Jun 2022

NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling RatesInterspeech (Interspeech), 2022

Seungu Han

Junhyeok Lee

DiffM

308

17 Jun 2022

Maximum Likelihood Training for Score-Based Diffusion ODEs by High-Order Denoising Score MatchingInternational Conference on Machine Learning (ICML), 2022

Kaiwen Zheng

Jianfei Chen

Jun Zhu

271

103

16 Jun 2022

Discrete Contrastive Diffusion for Cross-Modal Music and Image GenerationInternational Conference on Learning Representations (ICLR), 2022

Yan Yan

383

15 Jun 2022

Estimating the Optimal Covariance with Imperfect Mean in Diffusion Probabilistic ModelsInternational Conference on Machine Learning (ICML), 2022

Jun Zhu

200

15 Jun 2022

Adversarial Audio Synthesis with Complex-valued Polynomial Networks

320

14 Jun 2022

Multi-instrument Music Synthesis with Spectrogram DiffusionInternational Society for Music Information Retrieval Conference (ISMIR), 2022

248

11 Jun 2022

How Much is Enough? A Study on Diffusion Times in Score-based Generative Models

234

10 Jun 2022

BigVGAN: A Universal Neural Vocoder with Large-Scale TrainingInternational Conference on Learning Representations (ICLR), 2022

Boris Ginsburg

311

388

09 Jun 2022

Neural Diffusion ProcessesInternational Conference on Machine Learning (ICML), 2022

376

08 Jun 2022

Universal Speech Enhancement with Score-based Diffusion

408

130

07 Jun 2022

Zero-Shot Voice Conditioning for Denoising Diffusion TTS ModelsInterspeech (Interspeech), 2022

210

05 Jun 2022

Score-Based Generative Models Detect ManifoldsNeural Information Processing Systems (NeurIPS), 2022

Jakiw Pidstrigach

DiffM

476

111

02 Jun 2022

DiVAE: Photorealistic Images Synthesis with Denoising Diffusion Decoder

Jian Liang

212

01 Jun 2022

Elucidating the Design Space of Diffusion-Based Generative ModelsNeural Information Processing Systems (NeurIPS), 2022

974

2,803

01 Jun 2022

Improved Vector Quantized Diffusion Models

Jianmin Bao

436

31 May 2022

Few-Shot Diffusion Models

361

30 May 2022

Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech with Untranscribed Data

416

30 May 2022

BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio SynthesisNeural Information Processing Systems (NeurIPS), 2022

Junliang Guo

...

Xiang-Yang Li

312

30 May 2022

Diffusion-LM Improves Controllable Text GenerationNeural Information Processing Systems (NeurIPS), 2022

Xiang Lisa Li

John Thickstun

Ishaan Gulrajani

Abigail Z. Jacobs

Tatsunori B. Hashimoto

AI4CE

514

1,115

27 May 2022

Accelerating Diffusion Models via Early Stop of the Diffusion Process

561

125

25 May 2022

The ICML 2022 Expressive Vocalizations Workshop and Competition: Recognizing, Generating, and Personalizing Vocal Bursts

Björn Schuller

175

03 May 2022

Parallel Synthesis for Autoregressive Speech GenerationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

286

25 Apr 2022

FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech SynthesisInternational Joint Conference on Artificial Intelligence (IJCAI), 2022

Rongjie Huang

Zhou Zhao

157

211

21 Apr 2022

A Survey on Non-Autoregressive Generation for Neural Machine Translation and BeyondIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

Junliang Guo

264

115

20 Apr 2022

A Post Auto-regressive GAN Vocoder Focused on Spectrum Fracture

12 Apr 2022

The Sillwood Technologies System for the VoiceMOS Challenge 2022

Jiameng Gao

179

08 Apr 2022

Video Diffusion ModelsNeural Information Processing Systems (NeurIPS), 2022

David J. Fleet

873

2,226

07 Apr 2022

Perception Prioritized Training of Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2022

300

332

01 Apr 2022

Speech Enhancement with Score-Based Generative Models in the Complex STFT DomainInterspeech (Interspeech), 2022

348

149

31 Mar 2022

SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral ShapingInterspeech (Interspeech), 2022

314

31 Mar 2022

Stochastic Trajectory Prediction via Motion Indeterminacy DiffusionComputer Vision and Pattern Recognition (CVPR), 2022

Jie Zhou

819

322

25 Mar 2022

BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech SynthesisInternational Conference on Learning Representations (ICLR), 2022

236

103

25 Mar 2022

On the link between conscious function and general intelligence in humans and machines

296

24 Mar 2022

Diffusion Probabilistic Modeling for Video Generation

624

316

16 Mar 2022

A Survey on Deep Graph Generation: Methods and ApplicationsLOG IN (LOG IN), 2022

380

13 Mar 2022

Score-Based Generative Models for Molecule Generation

Dwaraknath Gnaneshwar

110

07 Mar 2022

NeuralDPS: Neural Deterministic Plus Stochastic Model with Multiband Excitation for Noise-Controllable Waveform GenerationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

Tao Wang

Ruibo Fu

Jiangyan Yi

Jianhua Tao

Zhengqi Wen

05 Mar 2022

Measurement-conditioned Denoising Diffusion Probabilistic Model for Under-sampled Medical Image ReconstructionInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2022

Yutong Xie

Shijie Zhao

DiffM MedIm

274

121

05 Mar 2022

iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier TransformIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

191

04 Mar 2022

Wavebender GAN: An architecture for phonetically meaningful speech manipulationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Gustavo Teodoro Döhler Beck

178

22 Feb 2022

Pseudo Numerical Methods for Diffusion Models on ManifoldsInternational Conference on Learning Representations (ICLR), 2022

Zhou Zhao

546

805

20 Feb 2022