ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.09761
  4. Cited By
DiffWave: A Versatile Diffusion Model for Audio Synthesis

DiffWave: A Versatile Diffusion Model for Audio Synthesis

21 September 2020
Zhifeng Kong
Wei Ping
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
    DiffM
    BDL
ArXivPDFHTML

Papers citing "DiffWave: A Versatile Diffusion Model for Audio Synthesis"

50 / 977 papers shown
Title
Discrete Contrastive Diffusion for Cross-Modal Music and Image
  Generation
Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation
Ye Zhu
Yuehua Wu
Kyle Olszewski
Jian Ren
Sergey Tulyakov
Yan Yan
DiffM
20
47
0
15 Jun 2022
Estimating the Optimal Covariance with Imperfect Mean in Diffusion
  Probabilistic Models
Estimating the Optimal Covariance with Imperfect Mean in Diffusion Probabilistic Models
Fan Bao
Chongxuan Li
Jiacheng Sun
Jun Zhu
Bo Zhang
DiffM
21
72
0
15 Jun 2022
Adversarial Audio Synthesis with Complex-valued Polynomial Networks
Adversarial Audio Synthesis with Complex-valued Polynomial Networks
Yongtao Wu
Grigorios G. Chrysos
V. Cevher
DiffM
4
4
0
14 Jun 2022
Multi-instrument Music Synthesis with Spectrogram Diffusion
Multi-instrument Music Synthesis with Spectrogram Diffusion
Curtis Hawthorne
Ian Simon
Adam Roberts
Neil Zeghidour
Josh Gardner
Ethan Manilow
Jesse Engel
DiffM
21
48
0
11 Jun 2022
How Much is Enough? A Study on Diffusion Times in Score-based Generative
  Models
How Much is Enough? A Study on Diffusion Times in Score-based Generative Models
Giulio Franzese
Simone Rossi
Lixuan Yang
A. Finamore
Dario Rossi
Maurizio Filippone
Pietro Michiardi
DiffM
13
46
0
10 Jun 2022
BigVGAN: A Universal Neural Vocoder with Large-Scale Training
BigVGAN: A Universal Neural Vocoder with Large-Scale Training
Sang-gil Lee
Wei Ping
Boris Ginsburg
Bryan Catanzaro
Sung-Hoon Yoon
17
224
0
09 Jun 2022
Neural Diffusion Processes
Neural Diffusion Processes
Vincent Dutordoir
Alan D. Saul
Zoubin Ghahramani
F. Simpson
DiffM
38
37
0
08 Jun 2022
Universal Speech Enhancement with Score-based Diffusion
Universal Speech Enhancement with Score-based Diffusion
Joan Serra
Santiago Pascual
Jordi Pons
R. O. Araz
D. Scaini
DiffM
17
95
0
07 Jun 2022
Zero-Shot Voice Conditioning for Denoising Diffusion TTS Models
Zero-Shot Voice Conditioning for Denoising Diffusion TTS Models
Alon Levkovitch
Eliya Nachmani
Lior Wolf
DiffM
19
29
0
05 Jun 2022
Score-Based Generative Models Detect Manifolds
Score-Based Generative Models Detect Manifolds
Jakiw Pidstrigach
DiffM
24
70
0
02 Jun 2022
DiVAE: Photorealistic Images Synthesis with Denoising Diffusion Decoder
DiVAE: Photorealistic Images Synthesis with Denoising Diffusion Decoder
Jie Shi
Chenfei Wu
Jian Liang
Xiang Liu
Nan Duan
DiffM
9
25
0
01 Jun 2022
Elucidating the Design Space of Diffusion-Based Generative Models
Elucidating the Design Space of Diffusion-Based Generative Models
Tero Karras
M. Aittala
Timo Aila
S. Laine
DiffM
50
1,826
0
01 Jun 2022
Improved Vector Quantized Diffusion Models
Improved Vector Quantized Diffusion Models
Zhicong Tang
Shuyang Gu
Jianmin Bao
Dong Chen
Fang Wen
DiffM
178
63
0
31 May 2022
Few-Shot Diffusion Models
Few-Shot Diffusion Models
Giorgio Giannone
Didrik Nielsen
Ole Winther
DiffM
183
49
0
30 May 2022
Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech
  with Untranscribed Data
Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech with Untranscribed Data
Sungwon Kim
Heeseung Kim
Sung-Hoon Yoon
DiffM
196
52
0
30 May 2022
BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for
  Binaural Audio Synthesis
BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio Synthesis
Yichong Leng
Zehua Chen
Junliang Guo
Haohe Liu
Jiawei Chen
...
Lei He
Xiang-Yang Li
Tao Qin
Sheng Zhao
Tie-Yan Liu
DiffM
51
58
0
30 May 2022
Diffusion-LM Improves Controllable Text Generation
Diffusion-LM Improves Controllable Text Generation
Xiang Lisa Li
John Thickstun
Ishaan Gulrajani
Percy Liang
Tatsunori B. Hashimoto
AI4CE
173
773
0
27 May 2022
Accelerating Diffusion Models via Early Stop of the Diffusion Process
Accelerating Diffusion Models via Early Stop of the Diffusion Process
Zhaoyang Lyu
Xu Xudong
Ceyuan Yang
Dahua Lin
Bo Dai
DiffM
193
92
0
25 May 2022
The ICML 2022 Expressive Vocalizations Workshop and Competition:
  Recognizing, Generating, and Personalizing Vocal Bursts
The ICML 2022 Expressive Vocalizations Workshop and Competition: Recognizing, Generating, and Personalizing Vocal Bursts
Alice Baird
Panagiotis Tzirakis
Gauthier Gidel
Marco Jiralerspong
Eilif B. Muller
Kory W. Mathewson
Björn Schuller
Erik Cambria
D. Keltner
Alan S. Cowen
VLM
28
30
0
03 May 2022
Parallel Synthesis for Autoregressive Speech Generation
Parallel Synthesis for Autoregressive Speech Generation
Po-Chun Hsu
Da-Rong Liu
Andy T. Liu
Hung-yi Lee
34
5
0
25 Apr 2022
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech
  Synthesis
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis
Rongjie Huang
Max W. Y. Lam
J. Wang
Dan Su
Dong Yu
Yi Ren
Zhou Zhao
DiffM
28
164
0
21 Apr 2022
A Survey on Non-Autoregressive Generation for Neural Machine Translation
  and Beyond
A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond
Yisheng Xiao
Lijun Wu
Junliang Guo
Juntao Li
M. Zhang
Tao Qin
Tie-Yan Liu
3DV
MedIm
AI4CE
30
82
0
20 Apr 2022
A Post Auto-regressive GAN Vocoder Focused on Spectrum Fracture
Zhe-ming Lu
Mengnan He
Ruixiong Zhang
Caixia Gong
GAN
9
2
0
12 Apr 2022
The Sillwood Technologies System for the VoiceMOS Challenge 2022
The Sillwood Technologies System for the VoiceMOS Challenge 2022
Jiameng Gao
18
0
0
08 Apr 2022
Video Diffusion Models
Video Diffusion Models
Jonathan Ho
Tim Salimans
Alexey A. Gritsenko
William Chan
Mohammad Norouzi
David J. Fleet
DiffM
VGen
44
1,504
0
07 Apr 2022
Perception Prioritized Training of Diffusion Models
Perception Prioritized Training of Diffusion Models
Jooyoung Choi
Jungbeom Lee
Chaehun Shin
Sungwon Kim
Hyunwoo J. Kim
Sung-Hoon Yoon
DiffM
22
232
0
01 Apr 2022
Speech Enhancement with Score-Based Generative Models in the Complex
  STFT Domain
Speech Enhancement with Score-Based Generative Models in the Complex STFT Domain
Simon Welker
Julius Richter
Timo Gerkmann
DiffM
22
110
0
31 Mar 2022
SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with
  Adaptive Noise Spectral Shaping
SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping
Yuma Koizumi
Heiga Zen
Kohei Yatabe
Nanxin Chen
M. Bacchiani
DiffM
23
45
0
31 Mar 2022
Stochastic Trajectory Prediction via Motion Indeterminacy Diffusion
Stochastic Trajectory Prediction via Motion Indeterminacy Diffusion
Tianpei Gu
Guangyi Chen
Junlong Li
Chunze Lin
Yongming Rao
Jie Zhou
Jiwen Lu
DiffM
VGen
27
192
0
25 Mar 2022
BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality
  Speech Synthesis
BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis
Max W. Y. Lam
J. Wang
Dan Su
Dong Yu
DiffM
29
92
0
25 Mar 2022
On the link between conscious function and general intelligence in
  humans and machines
On the link between conscious function and general intelligence in humans and machines
Arthur Juliani
Kai Arulkumaran
Shuntaro Sasai
Ryota Kanai
34
24
0
24 Mar 2022
Diffusion Probabilistic Modeling for Video Generation
Diffusion Probabilistic Modeling for Video Generation
Ruihan Yang
Prakhar Srivastava
Stephan Mandt
DiffM
VGen
36
255
0
16 Mar 2022
A Survey on Deep Graph Generation: Methods and Applications
A Survey on Deep Graph Generation: Methods and Applications
Yanqiao Zhu
Yuanqi Du
Yinkai Wang
Yichen Xu
Jieyu Zhang
Qiang Liu
Shu Wu
3DV
GNN
31
67
0
13 Mar 2022
Score-Based Generative Models for Molecule Generation
Score-Based Generative Models for Molecule Generation
Dwaraknath Gnaneshwar
Bharath Ramsundar
Dhairya Gandhi
Rachel C. Kurchin
V. Viswanathan
DiffM
22
11
0
07 Mar 2022
NeuralDPS: Neural Deterministic Plus Stochastic Model with Multiband
  Excitation for Noise-Controllable Waveform Generation
NeuralDPS: Neural Deterministic Plus Stochastic Model with Multiband Excitation for Noise-Controllable Waveform Generation
Tao Wang
Ruibo Fu
Jiangyan Yi
J. Tao
Zhengqi Wen
9
2
0
05 Mar 2022
Measurement-conditioned Denoising Diffusion Probabilistic Model for
  Under-sampled Medical Image Reconstruction
Measurement-conditioned Denoising Diffusion Probabilistic Model for Under-sampled Medical Image Reconstruction
Yutong Xie
Quanzheng Li
DiffM
MedIm
24
87
0
05 Mar 2022
iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating
  Inverse Short-Time Fourier Transform
iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform
Takuhiro Kaneko
Kou Tanaka
Hirokazu Kameoka
Shogo Seki
15
60
0
04 Mar 2022
Wavebender GAN: An architecture for phonetically meaningful speech
  manipulation
Wavebender GAN: An architecture for phonetically meaningful speech manipulation
Gustavo Teodoro Döhler Beck
Ulme Wennberg
Zofia Malisz
G. Henter
AI4CE
19
8
0
22 Feb 2022
Pseudo Numerical Methods for Diffusion Models on Manifolds
Pseudo Numerical Methods for Diffusion Models on Manifolds
Luping Liu
Yi Ren
Zhijie Lin
Zhou Zhao
DiffM
35
619
0
20 Feb 2022
It's Raw! Audio Generation with State-Space Models
It's Raw! Audio Generation with State-Space Models
Karan Goel
Albert Gu
Chris Donahue
Christopher Ré
14
185
0
20 Feb 2022
Truncated Diffusion Probabilistic Models and Diffusion-based Adversarial
  Auto-Encoders
Truncated Diffusion Probabilistic Models and Diffusion-based Adversarial Auto-Encoders
Huangjie Zheng
Pengcheng He
Weizhu Chen
Mingyuan Zhou
DiffM
17
44
0
19 Feb 2022
Conditional Diffusion Probabilistic Model for Speech Enhancement
Conditional Diffusion Probabilistic Model for Speech Enhancement
Yen-Ju Lu
Zhongqiu Wang
Shinji Watanabe
Alexander Richard
Cheng Yu
Yu Tsao
DiffM
21
174
0
10 Feb 2022
InferGrad: Improving Diffusion Models for Vocoder by Considering
  Inference in Training
InferGrad: Improving Diffusion Models for Vocoder by Considering Inference in Training
Zehua Chen
Xu Tan
Ke Wang
Shifeng Pan
Danilo P. Mandic
Lei He
Sheng Zhao
DiffM
18
28
0
08 Feb 2022
Score-based Generative Modeling of Graphs via the System of Stochastic
  Differential Equations
Score-based Generative Modeling of Graphs via the System of Stochastic Differential Equations
Jaehyeong Jo
Seul Lee
Sung Ju Hwang
DiffM
22
210
0
05 Feb 2022
ItôWave: Itô Stochastic Differential Equation Is All You Need For
  Wave Generation
ItôWave: Itô Stochastic Differential Equation Is All You Need For Wave Generation
Shoule Wu
Ziqiang Shi
DiffM
245
9
0
29 Jan 2022
DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising
  Diffusion GANs
DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
Songxiang Liu
Dan Su
Dong Yu
DiffM
68
65
0
28 Jan 2022
J-MAC: Japanese multi-speaker audiobook corpus for speech synthesis
J-MAC: Japanese multi-speaker audiobook corpus for speech synthesis
Shinnosuke Takamichi
Wataru Nakata
Naoko Tanji
Hiroshi Saruwatari
AuLLM
17
6
0
26 Jan 2022
Analytic-DPM: an Analytic Estimate of the Optimal Reverse Variance in
  Diffusion Probabilistic Models
Analytic-DPM: an Analytic Estimate of the Optimal Reverse Variance in Diffusion Probabilistic Models
Fan Bao
Chongxuan Li
Jun Zhu
Bo Zhang
DiffM
30
337
0
17 Jan 2022
Audio representations for deep learning in sound synthesis: A review
Audio representations for deep learning in sound synthesis: A review
Anastasia Natsiou
Seán O'Leary
AI4TS
14
18
0
07 Jan 2022
A sinusoidal signal reconstruction method for the inversion of the
  mel-spectrogram
A sinusoidal signal reconstruction method for the inversion of the mel-spectrogram
Anastasia Natsiou
Seán O'Leary
9
3
0
07 Jan 2022
Previous
123...17181920
Next