ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.09761
  4. Cited By
DiffWave: A Versatile Diffusion Model for Audio Synthesis
v1v2v3 (latest)

DiffWave: A Versatile Diffusion Model for Audio Synthesis

International Conference on Learning Representations (ICLR), 2020
21 September 2020
Zhifeng Kong
Ming-Yu Liu
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
    DiffMBDL
ArXiv (abs)PDFHTML

Papers citing "DiffWave: A Versatile Diffusion Model for Audio Synthesis"

50 / 1,135 papers shown
AdaCat: Adaptive Categorical Discretization for Autoregressive Models
AdaCat: Adaptive Categorical Discretization for Autoregressive ModelsConference on Uncertainty in Artificial Intelligence (UAI), 2022
Qiyang Li
Ajay Jain
Pieter Abbeel
OffRL
197
4
0
03 Aug 2022
DeScoD-ECG: Deep Score-Based Diffusion Model for ECG Baseline Wander and
  Noise Removal
DeScoD-ECG: Deep Score-Based Diffusion Model for ECG Baseline Wander and Noise RemovalIEEE journal of biomedical and health informatics (IEEE JBHI), 2022
Huayu Li
G. Ditzler
Janet Roveda
Ao Li
DiffM
226
80
0
31 Jul 2022
Classifier-Free Diffusion Guidance
Classifier-Free Diffusion Guidance
Jonathan Ho
Tim Salimans
FaML
476
5,385
0
26 Jul 2022
A Proposal for Foley Sound Synthesis Challenge
A Proposal for Foley Sound Synthesis Challenge
Keunwoo Choi
Sangshin Oh
Minsung Kang
Brian McFee
127
11
0
21 Jul 2022
Diffsound: Discrete Diffusion Model for Text-to-sound Generation
Diffsound: Discrete Diffusion Model for Text-to-sound GenerationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Dongchao Yang
Jianwei Yu
Helin Wang
Wen Wang
Chao Weng
Yuexian Zou
Dong Yu
DiffM
285
382
0
20 Jul 2022
ProDiff: Progressive Fast Diffusion Model For High-Quality
  Text-to-Speech
ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-SpeechACM Multimedia (ACM MM), 2022
Rongjie Huang
Zhou Zhao
Huadai Liu
Jinglin Liu
Chenye Cui
Yi Ren
DiffM
269
236
0
13 Jul 2022
Entropy-driven Sampling and Training Scheme for Conditional Diffusion
  Generation
Entropy-driven Sampling and Training Scheme for Conditional Diffusion GenerationEuropean Conference on Computer Vision (ECCV), 2022
Sheng-liang Li
Guangcong Zheng
Haibo Wang
Taiping Yao
Yang Chen
Shoudong Ding
Xi Li
DiffM
333
29
0
23 Jun 2022
Generative Modelling With Inverse Heat Dissipation
Generative Modelling With Inverse Heat DissipationInternational Conference on Learning Representations (ICLR), 2022
Severi Rissanen
Markus Heinonen
Arno Solin
DiffM
841
152
0
21 Jun 2022
A Flexible Diffusion Model
A Flexible Diffusion ModelInternational Conference on Machine Learning (ICML), 2022
Weitao Du
Tao Yang
Heidi Zhang
Yuanqi Du
DiffM
193
12
0
17 Jun 2022
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling
  Rates
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling RatesInterspeech (Interspeech), 2022
Seungu Han
Junhyeok Lee
DiffM
308
61
0
17 Jun 2022
Maximum Likelihood Training for Score-Based Diffusion ODEs by High-Order
  Denoising Score Matching
Maximum Likelihood Training for Score-Based Diffusion ODEs by High-Order Denoising Score MatchingInternational Conference on Machine Learning (ICML), 2022
Cheng Lu
Kaiwen Zheng
Fan Bao
Jianfei Chen
Chongxuan Li
Jun Zhu
DiffM
271
103
0
16 Jun 2022
Discrete Contrastive Diffusion for Cross-Modal Music and Image
  Generation
Discrete Contrastive Diffusion for Cross-Modal Music and Image GenerationInternational Conference on Learning Representations (ICLR), 2022
Ye Zhu
Yuehua Wu
Kyle Olszewski
Jian Ren
Sergey Tulyakov
Yan Yan
DiffM
383
57
0
15 Jun 2022
Estimating the Optimal Covariance with Imperfect Mean in Diffusion
  Probabilistic Models
Estimating the Optimal Covariance with Imperfect Mean in Diffusion Probabilistic ModelsInternational Conference on Machine Learning (ICML), 2022
Fan Bao
Chongxuan Li
Jiacheng Sun
Jun Zhu
Bo Zhang
DiffM
200
86
0
15 Jun 2022
Adversarial Audio Synthesis with Complex-valued Polynomial Networks
Adversarial Audio Synthesis with Complex-valued Polynomial Networks
Yongtao Wu
Grigorios G. Chrysos
Volkan Cevher
DiffM
320
4
0
14 Jun 2022
Multi-instrument Music Synthesis with Spectrogram Diffusion
Multi-instrument Music Synthesis with Spectrogram DiffusionInternational Society for Music Information Retrieval Conference (ISMIR), 2022
Curtis Hawthorne
Ian Simon
Adam Roberts
Neil Zeghidour
Josh Gardner
Ethan Manilow
Jesse Engel
DiffM
248
56
0
11 Jun 2022
How Much is Enough? A Study on Diffusion Times in Score-based Generative
  Models
How Much is Enough? A Study on Diffusion Times in Score-based Generative Models
Giulio Franzese
Simone Rossi
Lixuan Yang
A. Finamore
Dario Rossi
Maurizio Filippone
Pietro Michiardi
DiffM
234
50
0
10 Jun 2022
BigVGAN: A Universal Neural Vocoder with Large-Scale Training
BigVGAN: A Universal Neural Vocoder with Large-Scale TrainingInternational Conference on Learning Representations (ICLR), 2022
Sang-gil Lee
Ming-Yu Liu
Boris Ginsburg
Bryan Catanzaro
Sung-Hoon Yoon
311
388
0
09 Jun 2022
Neural Diffusion Processes
Neural Diffusion ProcessesInternational Conference on Machine Learning (ICML), 2022
Vincent Dutordoir
Alan D. Saul
Zoubin Ghahramani
F. Simpson
DiffM
376
50
0
08 Jun 2022
Universal Speech Enhancement with Score-based Diffusion
Universal Speech Enhancement with Score-based Diffusion
Joan Serrà
Santiago Pascual
Jordi Pons
R. O. Araz
D. Scaini
DiffM
408
130
0
07 Jun 2022
Zero-Shot Voice Conditioning for Denoising Diffusion TTS Models
Zero-Shot Voice Conditioning for Denoising Diffusion TTS ModelsInterspeech (Interspeech), 2022
Alon Levkovitch
Eliya Nachmani
Lior Wolf
DiffM
210
32
0
05 Jun 2022
Score-Based Generative Models Detect Manifolds
Score-Based Generative Models Detect ManifoldsNeural Information Processing Systems (NeurIPS), 2022
Jakiw Pidstrigach
DiffM
476
111
0
02 Jun 2022
DiVAE: Photorealistic Images Synthesis with Denoising Diffusion Decoder
DiVAE: Photorealistic Images Synthesis with Denoising Diffusion Decoder
Jie Shi
Chenfei Wu
Jian Liang
Xiang Liu
Nan Duan
DiffM
212
31
0
01 Jun 2022
Elucidating the Design Space of Diffusion-Based Generative Models
Elucidating the Design Space of Diffusion-Based Generative ModelsNeural Information Processing Systems (NeurIPS), 2022
Tero Karras
M. Aittala
Timo Aila
S. Laine
DiffM
974
2,803
0
01 Jun 2022
Improved Vector Quantized Diffusion Models
Improved Vector Quantized Diffusion Models
Zhicong Tang
Shuyang Gu
Jianmin Bao
Dong Chen
Fang Wen
DiffM
436
73
0
31 May 2022
Few-Shot Diffusion Models
Few-Shot Diffusion Models
Giorgio Giannone
Didrik Nielsen
Ole Winther
DiffM
361
55
0
30 May 2022
Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech
  with Untranscribed Data
Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech with Untranscribed Data
Sungwon Kim
Heeseung Kim
Sung-Hoon Yoon
DiffM
416
62
0
30 May 2022
BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for
  Binaural Audio Synthesis
BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio SynthesisNeural Information Processing Systems (NeurIPS), 2022
Yichong Leng
Zehua Chen
Junliang Guo
Haohe Liu
Jiawei Chen
...
Lei He
Xiang-Yang Li
Tao Qin
Sheng Zhao
Tie-Yan Liu
DiffM
312
77
0
30 May 2022
Diffusion-LM Improves Controllable Text Generation
Diffusion-LM Improves Controllable Text GenerationNeural Information Processing Systems (NeurIPS), 2022
Xiang Lisa Li
John Thickstun
Ishaan Gulrajani
Abigail Z. Jacobs
Tatsunori B. Hashimoto
AI4CE
514
1,115
0
27 May 2022
Accelerating Diffusion Models via Early Stop of the Diffusion Process
Accelerating Diffusion Models via Early Stop of the Diffusion Process
Zhaoyang Lyu
Xu Xudong
Ceyuan Yang
Dahua Lin
Bo Dai
DiffM
561
125
0
25 May 2022
The ICML 2022 Expressive Vocalizations Workshop and Competition:
  Recognizing, Generating, and Personalizing Vocal Bursts
The ICML 2022 Expressive Vocalizations Workshop and Competition: Recognizing, Generating, and Personalizing Vocal Bursts
Alice Baird
Panagiotis Tzirakis
Gauthier Gidel
Marco Jiralerspong
Eilif B. Muller
Kory W. Mathewson
Björn Schuller
Xiaoshi Zhong
D. Keltner
Alan S. Cowen
VLM
175
30
0
03 May 2022
Parallel Synthesis for Autoregressive Speech Generation
Parallel Synthesis for Autoregressive Speech GenerationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Po-Chun Hsu
Da-Rong Liu
Andy T. Liu
Hung-yi Lee
286
6
0
25 Apr 2022
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech
  Synthesis
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech SynthesisInternational Joint Conference on Artificial Intelligence (IJCAI), 2022
Rongjie Huang
Max W. Y. Lam
Jun Wang
Jane Polak Scowcroft
Dong Yu
Yi Ren
Zhou Zhao
DiffM
157
211
0
21 Apr 2022
A Survey on Non-Autoregressive Generation for Neural Machine Translation
  and Beyond
A Survey on Non-Autoregressive Generation for Neural Machine Translation and BeyondIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Yisheng Xiao
Lijun Wu
Junliang Guo
Juntao Li
Hao Fei
Tao Qin
Tie-Yan Liu
3DVMedImAI4CE
264
115
0
20 Apr 2022
A Post Auto-regressive GAN Vocoder Focused on Spectrum Fracture
Zhe-ming Lu
Mengnan He
Ruixiong Zhang
Caixia Gong
GAN
87
2
0
12 Apr 2022
The Sillwood Technologies System for the VoiceMOS Challenge 2022
The Sillwood Technologies System for the VoiceMOS Challenge 2022
Jiameng Gao
179
0
0
08 Apr 2022
Video Diffusion Models
Video Diffusion ModelsNeural Information Processing Systems (NeurIPS), 2022
Jonathan Ho
Tim Salimans
Alexey A. Gritsenko
William Chan
Mohammad Norouzi
David J. Fleet
DiffMVGen
873
2,226
0
07 Apr 2022
Perception Prioritized Training of Diffusion Models
Perception Prioritized Training of Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2022
Jooyoung Choi
Jungbeom Lee
Chaehun Shin
Sungwon Kim
Hyunwoo J. Kim
Sung-Hoon Yoon
DiffM
300
332
0
01 Apr 2022
Speech Enhancement with Score-Based Generative Models in the Complex
  STFT Domain
Speech Enhancement with Score-Based Generative Models in the Complex STFT DomainInterspeech (Interspeech), 2022
Simon Welker
Julius Richter
Timo Gerkmann
DiffM
348
149
0
31 Mar 2022
SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with
  Adaptive Noise Spectral Shaping
SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral ShapingInterspeech (Interspeech), 2022
Yuma Koizumi
Heiga Zen
Kohei Yatabe
Nanxin Chen
M. Bacchiani
DiffM
314
53
0
31 Mar 2022
Stochastic Trajectory Prediction via Motion Indeterminacy Diffusion
Stochastic Trajectory Prediction via Motion Indeterminacy DiffusionComputer Vision and Pattern Recognition (CVPR), 2022
Tianpei Gu
Guangyi Chen
Junlong Li
Chunze Lin
Yongming Rao
Jie Zhou
Jiwen Lu
DiffMVGen
819
322
0
25 Mar 2022
BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality
  Speech Synthesis
BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech SynthesisInternational Conference on Learning Representations (ICLR), 2022
Max W. Y. Lam
Jun Wang
Jane Polak Scowcroft
Dong Yu
DiffM
236
103
0
25 Mar 2022
On the link between conscious function and general intelligence in
  humans and machines
On the link between conscious function and general intelligence in humans and machines
Arthur Juliani
Kai Arulkumaran
Shuntaro Sasai
Ryota Kanai
296
28
0
24 Mar 2022
Diffusion Probabilistic Modeling for Video Generation
Diffusion Probabilistic Modeling for Video Generation
Ruihan Yang
Prakhar Srivastava
Stephan Mandt
DiffMVGen
624
316
0
16 Mar 2022
A Survey on Deep Graph Generation: Methods and Applications
A Survey on Deep Graph Generation: Methods and ApplicationsLOG IN (LOG IN), 2022
Yanqiao Zhu
Yuanqi Du
Yinkai Wang
Yichen Xu
Jieyu Zhang
Qiang Liu
Shu Wu
3DVGNN
380
75
0
13 Mar 2022
Score-Based Generative Models for Molecule Generation
Score-Based Generative Models for Molecule Generation
Dwaraknath Gnaneshwar
Bharath Ramsundar
Dhairya Gandhi
Rachel C. Kurchin
V. Viswanathan
DiffM
110
14
0
07 Mar 2022
NeuralDPS: Neural Deterministic Plus Stochastic Model with Multiband
  Excitation for Noise-Controllable Waveform Generation
NeuralDPS: Neural Deterministic Plus Stochastic Model with Multiband Excitation for Noise-Controllable Waveform GenerationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Tao Wang
Ruibo Fu
Jiangyan Yi
Jianhua Tao
Zhengqi Wen
87
2
0
05 Mar 2022
Measurement-conditioned Denoising Diffusion Probabilistic Model for
  Under-sampled Medical Image Reconstruction
Measurement-conditioned Denoising Diffusion Probabilistic Model for Under-sampled Medical Image ReconstructionInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2022
Yutong Xie
Shijie Zhao
DiffMMedIm
274
121
0
05 Mar 2022
iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating
  Inverse Short-Time Fourier Transform
iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier TransformIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Takuhiro Kaneko
Kou Tanaka
Hirokazu Kameoka
Shogo Seki
191
86
0
04 Mar 2022
Wavebender GAN: An architecture for phonetically meaningful speech
  manipulation
Wavebender GAN: An architecture for phonetically meaningful speech manipulationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Gustavo Teodoro Döhler Beck
Ulme Wennberg
Zofia Malisz
G. Henter
AI4CE
178
10
0
22 Feb 2022
Pseudo Numerical Methods for Diffusion Models on Manifolds
Pseudo Numerical Methods for Diffusion Models on ManifoldsInternational Conference on Learning Representations (ICLR), 2022
Luping Liu
Yi Ren
Zhijie Lin
Zhou Zhao
DiffM
546
805
0
20 Feb 2022
Previous
123...20212223
Next
Page 21 of 23
Pageof 23