ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.09761
  4. Cited By
DiffWave: A Versatile Diffusion Model for Audio Synthesis
v1v2v3 (latest)

DiffWave: A Versatile Diffusion Model for Audio Synthesis

International Conference on Learning Representations (ICLR), 2020
21 September 2020
Zhifeng Kong
Ming-Yu Liu
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
    DiffMBDL
ArXiv (abs)PDFHTML

Papers citing "DiffWave: A Versatile Diffusion Model for Audio Synthesis"

50 / 1,129 papers shown
Title
How to Backdoor Diffusion Models?
How to Backdoor Diffusion Models?Computer Vision and Pattern Recognition (CVPR), 2022
Sheng-Yen Chou
Pin-Yu Chen
Tsung-Yi Ho
DiffMSILM
381
113
0
11 Dec 2022
Framewise WaveGAN: High Speed Adversarial Vocoder in Time Domain with
  Very Low Computational Complexity
Framewise WaveGAN: High Speed Adversarial Vocoder in Time Domain with Very Low Computational ComplexityIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Ahmed Mustafa
J. Valin
Jan Büthe
Paris Smaragdis
Mike Goodwin
123
6
0
08 Dec 2022
MoFusion: A Framework for Denoising-Diffusion-based Motion Synthesis
MoFusion: A Framework for Denoising-Diffusion-based Motion SynthesisComputer Vision and Pattern Recognition (CVPR), 2022
Rishabh Dabral
Muhammad Hamza Mughal
Vladislav Golyanik
Christian Theobalt
DiffMVGen
245
224
0
08 Dec 2022
Talking Head Generation with Probabilistic Audio-to-Visual Diffusion
  Priors
Talking Head Generation with Probabilistic Audio-to-Visual Diffusion PriorsIEEE International Conference on Computer Vision (ICCV), 2022
Zhentao Yu
Zixin Yin
Deyu Zhou
Duomin Wang
Finn Wong
Baoyuan Wang
DiffM
171
53
0
07 Dec 2022
Diffusion-SDF: Text-to-Shape via Voxelized Diffusion
Diffusion-SDF: Text-to-Shape via Voxelized DiffusionComputer Vision and Pattern Recognition (CVPR), 2022
Muheng Li
Yueqi Duan
Jie Zhou
Jiwen Lu
DiffM
239
147
0
06 Dec 2022
Denoising diffusion probabilistic models for probabilistic energy
  forecasting
Denoising diffusion probabilistic models for probabilistic energy forecasting
Esteban Hernandez Capel
Jonathan Dumas
DiffM
252
21
0
06 Dec 2022
DiffusionInst: Diffusion Model for Instance Segmentation
DiffusionInst: Diffusion Model for Instance SegmentationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Zhangxuan Gu
Haoxing Chen
Zhuoer Xu
Jun Lan
Changhua Meng
Weiqiang Wang
DiffM
175
107
0
06 Dec 2022
PhysDiff: Physics-Guided Human Motion Diffusion Model
PhysDiff: Physics-Guided Human Motion Diffusion ModelIEEE International Conference on Computer Vision (ICCV), 2022
Ye Yuan
Jiaming Song
Umar Iqbal
Arash Vahdat
Jan Kautz
VGenDiffM
483
345
0
05 Dec 2022
Diffusion Generative Models in Infinite Dimensions
Diffusion Generative Models in Infinite DimensionsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Gavin Kerrigan
Justin Ley
Padhraic Smyth
DiffM
336
42
0
01 Dec 2022
3D-LDM: Neural Implicit 3D Shape Generation with Latent Diffusion Models
3D-LDM: Neural Implicit 3D Shape Generation with Latent Diffusion Models
Gimin Nam
Mariem Khlifi
Andrew Rodriguez
Alberto Tono
Linqi Zhou
Paul Guerrero
DiffM
225
78
0
01 Dec 2022
Denoising Diffusion for Sampling SAT Solutions
Denoising Diffusion for Sampling SAT Solutions
Kārlis Freivalds
Sergejs Kozlovics
117
3
0
30 Nov 2022
DiffusionBERT: Improving Generative Masked Language Models with
  Diffusion Models
DiffusionBERT: Improving Generative Masked Language Models with Diffusion ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Zhengfu He
Tianxiang Sun
Kuan-Chieh Wang
Xuanjing Huang
Xipeng Qiu
DiffMVLM
191
193
0
28 Nov 2022
Fast Sampling of Diffusion Models via Operator Learning
Fast Sampling of Diffusion Models via Operator LearningInternational Conference on Machine Learning (ICML), 2022
Hongkai Zheng
Weili Nie
Arash Vahdat
Kamyar Azizzadenesheli
Anima Anandkumar
DiffM
317
180
0
24 Nov 2022
Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors
Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors
R. Burgert
Kanchana Ranasinghe
Xiang Li
Michael S. Ryoo
DiffMVLM
234
41
0
23 Nov 2022
Diffusion Denoising Process for Perceptron Bias in Out-of-distribution
  Detection
Diffusion Denoising Process for Perceptron Bias in Out-of-distribution Detection
Luping Liu
Yi Ren
Xize Cheng
Rongjie Huang
Chongxuan Li
Zhou Zhao
145
7
0
21 Nov 2022
Robust Vocal Quality Feature Embeddings for Dysphonic Voice Detection
Robust Vocal Quality Feature Embeddings for Dysphonic Voice DetectionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Jianwei Zhang
J. Liss
Suren Jayasuriya
Visar Berisha
165
8
0
17 Nov 2022
InstructPix2Pix: Learning to Follow Image Editing Instructions
InstructPix2Pix: Learning to Follow Image Editing InstructionsComputer Vision and Pattern Recognition (CVPR), 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
508
2,399
0
17 Nov 2022
Listen, Denoise, Action! Audio-Driven Motion Synthesis with Diffusion
  Models
Listen, Denoise, Action! Audio-Driven Motion Synthesis with Diffusion ModelsACM Transactions on Graphics (TOG), 2022
Simon Alexanderson
Rajmund Nagy
Jonas Beskow
G. Henter
DiffMVGen
249
220
0
17 Nov 2022
EmoDiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label
  Guidance
EmoDiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label GuidanceIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Yiwei Guo
Chenpeng Du
Xie Chen
K. Yu
DiffM
195
55
0
17 Nov 2022
Challenges in creative generative models for music: a divergence
  maximization perspective
Challenges in creative generative models for music: a divergence maximization perspective
Axel Chemla-Romeu-Santos
P. Esling
223
4
0
16 Nov 2022
Versatile Diffusion: Text, Images and Variations All in One Diffusion
  Model
Versatile Diffusion: Text, Images and Variations All in One Diffusion ModelIEEE International Conference on Computer Vision (ICCV), 2022
Xingqian Xu
Zinan Lin
Eric Zhang
Kai Wang
Humphrey Shi
DiffM
435
238
0
15 Nov 2022
Diffusion Models for Medical Image Analysis: A Comprehensive Survey
Diffusion Models for Medical Image Analysis: A Comprehensive Survey
Amirhossein Kazerouni
Ehsan Khodapanah Aghdam
Moein Heidari
Reza Azad
Mohsen Fayyaz
Ilker Hacihaliloglu
Dorit Merhof
DiffMMedIm
378
534
0
14 Nov 2022
Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image
  Generation
Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image GenerationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Zhihong Pan
Xiaoxia Zhou
Hao Tian
DiffM
129
13
0
14 Nov 2022
DriftRec: Adapting diffusion models to blind JPEG restoration
DriftRec: Adapting diffusion models to blind JPEG restorationIEEE Transactions on Image Processing (IEEE TIP), 2022
Simon Welker
H. Chapman
Timo Gerkmann
DiffM
175
23
0
12 Nov 2022
Few-shot Image Generation with Diffusion Models
Few-shot Image Generation with Diffusion Models
Jin Zhu
Huimin Ma
Jiansheng Chen
Jian Yuan
DiffM
272
28
0
07 Nov 2022
Modeling Temporal Data as Continuous Functions with Stochastic Process
  Diffusion
Modeling Temporal Data as Continuous Functions with Stochastic Process DiffusionInternational Conference on Machine Learning (ICML), 2022
Marin Bilos
Kashif Rasul
Anderson Schneider
Yuriy Nevmyvaka
Stephan Günnemann
DiffM
241
46
0
04 Nov 2022
Cold Diffusion for Speech Enhancement
Cold Diffusion for Speech EnhancementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Hao Yen
François Germain
Gordon Wichern
Jonathan Le Roux
DiffM
289
54
0
04 Nov 2022
NoreSpeech: Knowledge Distillation based Conditional Diffusion Model for
  Noise-robust Expressive TTS
NoreSpeech: Knowledge Distillation based Conditional Diffusion Model for Noise-robust Expressive TTSInterspeech (Interspeech), 2022
Dongchao Yang
Songxiang Liu
Jianwei Yu
Helin Wang
Chao Weng
Yuexian Zou
DiffMVLM
145
22
0
04 Nov 2022
An optimal control perspective on diffusion-based generative modeling
An optimal control perspective on diffusion-based generative modeling
Julius Berner
Lorenz Richter
Karen Ullrich
DiffM
395
123
0
02 Nov 2022
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert
  Denoisers
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers
Yogesh Balaji
Seungjun Nah
Xun Huang
Arash Vahdat
Jiaming Song
...
Timo Aila
S. Laine
Bryan Catanzaro
Tero Karras
Xuan Li
VLMMoE
485
965
0
02 Nov 2022
DSPGAN: a GAN-based universal vocoder for high-fidelity TTS by
  time-frequency domain supervision from DSP
DSPGAN: a GAN-based universal vocoder for high-fidelity TTS by time-frequency domain supervision from DSPIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Kun Song
Yongmao Zhang
Yinjiao Lei
Jian Cong
Hanzhao Li
Linfu Xie
Gang He
Jinfeng Bai
150
22
0
02 Nov 2022
Concrete Score Matching: Generalized Score Matching for Discrete Data
Concrete Score Matching: Generalized Score Matching for Discrete DataNeural Information Processing Systems (NeurIPS), 2022
Chenlin Meng
Kristy Choi
Jiaming Song
Stefano Ermon
DiffM
483
103
0
02 Nov 2022
SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for
  Text Generation and Modular Control
SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular ControlAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Xiaochuang Han
Sachin Kumar
Yulia Tsvetkov
290
131
0
31 Oct 2022
Guided Conditional Diffusion for Controllable Traffic Simulation
Guided Conditional Diffusion for Controllable Traffic SimulationIEEE International Conference on Robotics and Automation (ICRA), 2022
Ziyuan Zhong
Davis Rempe
Danfei Xu
Yuxiao Chen
Sushant Veer
Tong Che
Baishakhi Ray
Marco Pavone
239
210
0
31 Oct 2022
Robust MelGAN: A robust universal neural vocoder for high-fidelity TTS
Robust MelGAN: A robust universal neural vocoder for high-fidelity TTSInternational Symposium on Chinese Spoken Language Processing (ISCSLP), 2022
Kun Song
Jian Cong
Xinsheng Wang
Yongmao Zhang
Linfu Xie
Ning Jiang
Haiying Wu
128
0
0
31 Oct 2022
Diffusion-based Generative Speech Source Separation
Diffusion-based Generative Speech Source SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Robin Scheibler
Youna Ji
Soo-Whan Chung
J. Byun
Soyeon Choe
Min-Seok Choi
DiffM
318
60
0
31 Oct 2022
SRTNet: Time Domain Speech Enhancement Via Stochastic Refinement
SRTNet: Time Domain Speech Enhancement Via Stochastic RefinementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Zhibin Qiu
Mengfan Fu
Yinfeng Yu
Lili Yin
Gang Hua
Hao-Ming Huang
DiffM
219
22
0
30 Oct 2022
Conditioning and Sampling in Variational Diffusion Models for Speech
  Super-Resolution
Conditioning and Sampling in Variational Diffusion Models for Speech Super-ResolutionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Chin-Yun Yu
Sung-Lin Yeh
Gyorgy Fazekas
Hao Tang
DiffM
119
31
0
27 Oct 2022
Source-Filter HiFi-GAN: Fast and Pitch Controllable High-Fidelity Neural
  Vocoder
Source-Filter HiFi-GAN: Fast and Pitch Controllable High-Fidelity Neural VocoderIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Reo Yoneyama
Yi-Chiao Wu
Tomoki Toda
197
35
0
27 Oct 2022
Solving Audio Inverse Problems with a Diffusion Model
Solving Audio Inverse Problems with a Diffusion ModelIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Eloi Moliner
J. Lehtinen
Vesa Valimaki
DiffM
287
73
0
27 Oct 2022
Full-band General Audio Synthesis with Score-based Diffusion
Full-band General Audio Synthesis with Score-based DiffusionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Santiago Pascual
Gautam Bhattacharya
Chunghsin Yeh
Jordi Pons
Joan Serrà
DiffM
181
39
0
26 Oct 2022
Structure-based Drug Design with Equivariant Diffusion Models
Structure-based Drug Design with Equivariant Diffusion ModelsNature Computational Science (Nat. Comput. Sci.), 2022
Arne Schneuing
Yuanqi Du
Charles Harris
Arian R. Jamasb
Ilia Igashov
...
Pietro Lio
Daniel Schwalbe-Koda
Max Welling
Michael M. Bronstein
B. Correia
DiffM
316
325
0
24 Oct 2022
Deep Equilibrium Approaches to Diffusion Models
Deep Equilibrium Approaches to Diffusion ModelsNeural Information Processing Systems (NeurIPS), 2022
Ashwini Pokle
Zhengyang Geng
Zico Kolter
DiffM
254
48
0
23 Oct 2022
Diffusion Motion: Generate Text-Guided 3D Human Motion by Diffusion
  Model
Diffusion Motion: Generate Text-Guided 3D Human Motion by Diffusion ModelIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Zhiyuan Ren
Zhihong Pan
Xingfa Zhou
Le Kang
VGenDiffM
256
50
0
22 Oct 2022
Score-based Denoising Diffusion with Non-Isotropic Gaussian Noise Models
Score-based Denoising Diffusion with Non-Isotropic Gaussian Noise Models
Vikram S. Voleti
Christopher Pal
Adam M. Oberman
DiffM
216
23
0
21 Oct 2022
Boomerang: Local sampling on image manifolds using diffusion models
Boomerang: Local sampling on image manifolds using diffusion models
Lorenzo Luzi
P. Mayer
Josue Casco-Rodriguez
Ali Siahkoohi
Richard G. Baraniuk
DiffM
290
21
0
21 Oct 2022
Robust One-Shot Singing Voice Conversion
Robust One-Shot Singing Voice Conversion
Naoya Takahashi
M. Singh
Yuki Mitsufuji
DiffM
237
9
0
20 Oct 2022
Differentially Private Diffusion Models
Differentially Private Diffusion Models
Tim Dockhorn
Tianshi Cao
Arash Vahdat
Karsten Kreis
DiffM
362
125
0
18 Oct 2022
TorchDIVA: An Extensible Computational Model of Speech Production built
  on an Open-Source Machine Learning Library
TorchDIVA: An Extensible Computational Model of Speech Production built on an Open-Source Machine Learning LibraryPLoS ONE (PLoS ONE), 2022
Sean M. Kinahan
J. Liss
Visar Berisha
48
2
0
17 Oct 2022
DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models
DiffuSeq: Sequence to Sequence Text Generation with Diffusion ModelsInternational Conference on Learning Representations (ICLR), 2022
Shansan Gong
Mukai Li
Jiangtao Feng
Zhiyong Wu
Lingpeng Kong
334
436
0
17 Oct 2022
Previous
123...181920212223
Next