ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.09761
  4. Cited By
DiffWave: A Versatile Diffusion Model for Audio Synthesis
v1v2v3 (latest)

DiffWave: A Versatile Diffusion Model for Audio Synthesis

International Conference on Learning Representations (ICLR), 2020
21 September 2020
Zhifeng Kong
Ming-Yu Liu
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
    DiffMBDL
ArXiv (abs)PDFHTML

Papers citing "DiffWave: A Versatile Diffusion Model for Audio Synthesis"

50 / 1,134 papers shown
On the Design Fundamentals of Diffusion Models: A Survey
On the Design Fundamentals of Diffusion Models: A SurveyPattern Recognition (Pattern Recogn.), 2023
Ziyi Chang
George Alex Koulieris
Hyung Jin Chang
Hubert P. H. Shum
DiffM
637
80
0
07 Jun 2023
Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive
  Bias
Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias
Ziyue Jiang
Yi Ren
Zhe Ye
Jinglin Liu
Chen Zhang
...
Rongjie Huang
Chunfeng Wang
Xiang Yin
Zejun Ma
Zhou Zhao
DiffM
262
96
0
06 Jun 2023
LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading
LipVoicer: Generating Speech from Silent Videos Guided by Lip ReadingInternational Conference on Learning Representations (ICLR), 2023
Yochai Yemini
Aviv Shamsian
Lior Bracha
Sharon Gannot
Ethan Fetaya
DiffM
330
22
0
05 Jun 2023
Detector Guidance for Multi-Object Text-to-Image Generation
Detector Guidance for Multi-Object Text-to-Image Generation
Luping Liu
Zijian Zhang
Yi Ren
Rongjie Huang
Xiang Yin
Zhou Zhao
DiffM
180
12
0
04 Jun 2023
Conditional Generation from Unconditional Diffusion Models using
  Denoiser Representations
Conditional Generation from Unconditional Diffusion Models using Denoiser Representations
Alexandros Graikos
Srikar Yellapragada
Dimitris Samaras
DiffMAI4CE
188
7
0
02 Jun 2023
DiffECG: A Versatile Probabilistic Diffusion Model for ECG Signals
  Synthesis
DiffECG: A Versatile Probabilistic Diffusion Model for ECG Signals SynthesisInternational Conference on Software Engineering Research and Applications (ICSERA), 2023
Nour Neifar
A. Ben-Hamadou
Afef Mdhaffar
M. Jmaiel
DiffM
383
18
0
02 Jun 2023
Denoising Diffusion Semantic Segmentation with Mask Prior Modeling
Denoising Diffusion Semantic Segmentation with Mask Prior Modeling
Zeqiang Lai
Yuchen Duan
Jifeng Dai
Ziheng Li
Ying Fu
Jiaming Song
Yu Qiao
Wen Wang
DiffM
197
24
0
02 Jun 2023
UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion
  Model
UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion ModelInterspeech (Interspeech), 2023
A. Iashchenko
Pavel Andreev
Ivan Shchekotov
Nicholas Babaev
Dmitry Vetrov
DiffM
339
6
0
01 Jun 2023
Addressing Negative Transfer in Diffusion Models
Addressing Negative Transfer in Diffusion ModelsNeural Information Processing Systems (NeurIPS), 2023
Hyojun Go
Jinyoung Kim
Yunsung Lee
Seunghyun Lee
Shinhyeok Oh
Hyeongdon Moon
Seungtaek Choi
DiffMVLM
547
31
0
01 Jun 2023
A Geometric Perspective on Diffusion Models
A Geometric Perspective on Diffusion Models
Defang Chen
Zhenyu Zhou
Jianhan Mei
Chunhua Shen
Chun-Yen Chen
C. Wang
DiffM
212
21
0
31 May 2023
Spontaneous Symmetry Breaking in Generative Diffusion Models
Spontaneous Symmetry Breaking in Generative Diffusion ModelsNeural Information Processing Systems (NeurIPS), 2023
G. Raya
Luca Ambrogioni
DiffM
307
52
0
31 May 2023
Unsupervised Statistical Feature-Guided Diffusion Model for Sensor-based
  Human Activity Recognition
Unsupervised Statistical Feature-Guided Diffusion Model for Sensor-based Human Activity Recognition
Si Zuo
Vitor Fortes Rey
Sungho Suh
S. Sigg
P. Lukowicz
DiffM
250
5
0
30 May 2023
Nested Diffusion Processes for Anytime Image Generation
Nested Diffusion Processes for Anytime Image GenerationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Noam Elata
Bahjat Kawar
T. Michaeli
Michael Elad
DiffM
260
7
0
30 May 2023
Diffusion-Stego: Training-free Diffusion Generative Steganography via
  Message Projection
Diffusion-Stego: Training-free Diffusion Generative Steganography via Message ProjectionInformation Sciences (Inf. Sci.), 2023
Daegyu Kim
Chaehun Shin
Jooyoung Choi
Dahuin Jung
Sung-Hoon Yoon
DiffM
282
21
0
30 May 2023
Learning to Jump: Thinning and Thickening Latent Counts for Generative
  Modeling
Learning to Jump: Thinning and Thickening Latent Counts for Generative ModelingInternational Conference on Machine Learning (ICML), 2023
Tianqi Chen
Mingyuan Zhou
DiffM
188
11
0
28 May 2023
Functional Flow Matching
Functional Flow MatchingInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Gavin Kerrigan
Giosue Migliorini
Padhraic Smyth
368
31
0
26 May 2023
An Efficient Membership Inference Attack for the Diffusion Model by
  Proximal Initialization
An Efficient Membership Inference Attack for the Diffusion Model by Proximal InitializationInternational Conference on Learning Representations (ICLR), 2023
Fei Kong
Jinhao Duan
Ruipeng Ma
Hengtao Shen
Xiao-lan Zhu
Xiaoshuang Shi
Kaidi Xu
DiffM
209
47
0
26 May 2023
DiffusionNAG: Predictor-guided Neural Architecture Generation with
  Diffusion Models
DiffusionNAG: Predictor-guided Neural Architecture Generation with Diffusion ModelsInternational Conference on Learning Representations (ICLR), 2023
Sohyun An
Hayeon Lee
Jaehyeong Jo
Seanie Lee
Sung Ju Hwang
DiffM
514
16
0
26 May 2023
Diverse and Expressive Speech Prosody Prediction with Denoising
  Diffusion Probabilistic Model
Diverse and Expressive Speech Prosody Prediction with Denoising Diffusion Probabilistic ModelInterspeech (Interspeech), 2023
Xiang Li
Songxiang Liu
Max W. Y. Lam
Zhiyong Wu
Chao Weng
Helen Meng
DiffM
221
5
0
26 May 2023
DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion
  Models
DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion ModelsNeural Information Processing Systems (NeurIPS), 2023
Ying Fan
Olivia Watkins
Yuqing Du
Hao Liu
Moonkyung Ryu
Craig Boutilier
Pieter Abbeel
Mohammad Ghavamzadeh
Kangwook Lee
Kimin Lee
417
287
0
25 May 2023
Non-adversarial training of Neural SDEs with signature kernel scores
Non-adversarial training of Neural SDEs with signature kernel scoresNeural Information Processing Systems (NeurIPS), 2023
Zacharia Issa
Blanka Horvath
M. Lemercier
C. Salvi
AI4TS
321
39
0
25 May 2023
Trans-Dimensional Generative Modeling via Jump Diffusion Models
Trans-Dimensional Generative Modeling via Jump Diffusion ModelsNeural Information Processing Systems (NeurIPS), 2023
Andrew Campbell
William Harvey
Christian D. Weilbach
Valentin De Bortoli
Tom Rainforth
Arnaud Doucet
DiffM
240
25
0
25 May 2023
DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled
  Representation and Prior Mixup for Verified Robust Voice Conversion
DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for Verified Robust Voice ConversionAAAI Conference on Artificial Intelligence (AAAI), 2023
Haram Choi
Sang-Hoon Lee
Seong-Whan Lee
DiffM
147
58
0
25 May 2023
Efficient Neural Music Generation
Efficient Neural Music GenerationNeural Information Processing Systems (NeurIPS), 2023
Max W. Y. Lam
Qiao Tian
Tang-Chun Li
Zongyu Yin
Siyuan Feng
...
Mingbo Ma
Xuchen Song
Jitong Chen
Yuping Wang
Yuxuan Wang
DiffMMGen
251
83
0
25 May 2023
David helps Goliath: Inference-Time Collaboration Between Small
  Specialized and Large General Diffusion LMs
David helps Goliath: Inference-Time Collaboration Between Small Specialized and Large General Diffusion LMsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Xiaochuang Han
Sachin Kumar
Yulia Tsvetkov
Marjan Ghazvininejad
DiffM
282
5
0
24 May 2023
Revisit and Outstrip Entity Alignment: A Perspective of Generative
  Models
Revisit and Outstrip Entity Alignment: A Perspective of Generative ModelsInternational Conference on Learning Representations (ICLR), 2023
Lingbing Guo
Zhuo Chen
Jiaoyan Chen
Yin Fang
Wen Zhang
Huajun Chen
184
15
0
24 May 2023
Improved Convergence of Score-Based Diffusion Models via
  Prediction-Correction
Improved Convergence of Score-Based Diffusion Models via Prediction-Correction
Francesco Pedrotti
J. Maas
Marco Mondelli
DiffM
316
21
0
23 May 2023
FluentSpeech: Stutter-Oriented Automatic Speech Editing with
  Context-Aware Diffusion Models
FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Ziyue Jiang
Qiang Yang
Jia-li Zuo
Zhe Ye
Rongjie Huang
Yixiang Ren
Zhou Zhao
DiffM
161
29
0
23 May 2023
VDT: General-purpose Video Diffusion Transformers via Mask Modeling
VDT: General-purpose Video Diffusion Transformers via Mask ModelingInternational Conference on Learning Representations (ICLR), 2023
Haoyu Lu
Guoxing Yang
Nanyi Fei
Yuqi Huo
Zhiwu Lu
Ping Luo
Mingyu Ding
DiffMVGen
226
100
0
22 May 2023
DiffusionNER: Boundary Diffusion for Named Entity Recognition
DiffusionNER: Boundary Diffusion for Named Entity RecognitionAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Yongliang Shen
Kaitao Song
Xuejiao Tan
Dongsheng Li
Weiming Lu
Yueting Zhuang
DiffM
285
58
0
22 May 2023
GSURE-Based Diffusion Model Training with Corrupted Data
GSURE-Based Diffusion Model Training with Corrupted Data
Bahjat Kawar
Noam Elata
T. Michaeli
Michael Elad
DiffM
466
39
0
22 May 2023
AudioToken: Adaptation of Text-Conditioned Diffusion Models for
  Audio-to-Image Generation
AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation
Guy Yariv
Itai Gat
Lior Wolf
Yossi Adi
Idan Schwartz
DiffM
262
28
0
22 May 2023
DiffAVA: Personalized Text-to-Audio Generation with Visual Alignment
DiffAVA: Personalized Text-to-Audio Generation with Visual Alignment
Shentong Mo
Jing Shi
Yapeng Tian
139
18
0
22 May 2023
NAS-FM: Neural Architecture Search for Tunable and Interpretable Sound
  Synthesis based on Frequency Modulation
NAS-FM: Neural Architecture Search for Tunable and Interpretable Sound Synthesis based on Frequency ModulationInternational Joint Conference on Artificial Intelligence (IJCAI), 2023
Zhe Ye
Wei Xue
Xuejiao Tan
Qi-fei Liu
Yi-Ting Guo
181
3
0
22 May 2023
Duplex Diffusion Models Improve Speech-to-Speech Translation
Duplex Diffusion Models Improve Speech-to-Speech TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Xianchao Wu
DiffM
223
5
0
22 May 2023
Guided Motion Diffusion for Controllable Human Motion Synthesis
Guided Motion Diffusion for Controllable Human Motion SynthesisIEEE International Conference on Computer Vision (ICCV), 2023
Korrawe Karunratanakul
Konpat Preechakul
Supasorn Suwajanakorn
Siyu Tang
DiffM
427
205
0
21 May 2023
Towards Consistent Stochastic Human Motion Prediction via Motion
  Diffusion
Towards Consistent Stochastic Human Motion Prediction via Motion DiffusionEuropean Conference on Computer Vision (ECCV), 2023
Jiarui Sun
Girish Chowdhary
DiffM
247
17
0
21 May 2023
Spatio-temporal Diffusion Point Processes
Spatio-temporal Diffusion Point ProcessesKnowledge Discovery and Data Mining (KDD), 2023
Yuan Yuan
Jingtao Ding
Chenyang Shao
Depeng Jin
Yong Li
DiffM
240
65
0
21 May 2023
DiffCap: Exploring Continuous Diffusion on Image Captioning
DiffCap: Exploring Continuous Diffusion on Image Captioning
Yufeng He
Zefan Cai
Xu Gan
Baobao Chang
DiffM
205
11
0
20 May 2023
Incomplete Multi-view Clustering via Diffusion Completion
Incomplete Multi-view Clustering via Diffusion Completion
Sifan Fang
DiffM
172
10
0
19 May 2023
Data Redaction from Conditional Generative Models
Data Redaction from Conditional Generative Models
Zhifeng Kong
Kamalika Chaudhuri
KELM
209
8
0
18 May 2023
Blackout Diffusion: Generative Diffusion Models in Discrete-State Spaces
Blackout Diffusion: Generative Diffusion Models in Discrete-State SpacesInternational Conference on Machine Learning (ICML), 2023
Javier E. Santos
Z. Fox
Nicholas Lubbers
Yen Ting Lin
DiffM
261
23
0
18 May 2023
FastFit: Towards Real-Time Iterative Neural Vocoder by Replacing U-Net
  Encoder With Multiple STFTs
FastFit: Towards Real-Time Iterative Neural Vocoder by Replacing U-Net Encoder With Multiple STFTsInterspeech (Interspeech), 2023
Won Jang
D. Lim
Heayoung Park
198
1
0
18 May 2023
Catch-Up Distillation: You Only Need to Train Once for Accelerating
  Sampling
Catch-Up Distillation: You Only Need to Train Once for Accelerating Sampling
Shitong Shao
Xu Dai
Shouyi Yin
Lujun Li
Huanran Chen
Yang Hu
381
22
0
18 May 2023
Controllable Mind Visual Diffusion Model
Controllable Mind Visual Diffusion ModelAAAI Conference on Artificial Intelligence (AAAI), 2023
Bo-Wen Zeng
Shanglin Li
Xuhui Liu
Sicheng Gao
Xiaolong Jiang
Xu Tang
Feng-Long Xie
Jianzhuang Liu
Baochang Zhang
DiffM
229
38
0
17 May 2023
Discrete Diffusion Probabilistic Models for Symbolic Music Generation
Discrete Diffusion Probabilistic Models for Symbolic Music GenerationInternational Joint Conference on Artificial Intelligence (IJCAI), 2023
Matthias Plasser
S. Peter
Gerhard Widmer
DiffMMGen
176
20
0
16 May 2023
TESS: Text-to-Text Self-Conditioned Simplex Diffusion
TESS: Text-to-Text Self-Conditioned Simplex DiffusionConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Rabeeh Karimi Mahabadi
Michal Guerquin
Jaesung Tae
James Henderson
Iz Beltagy
Matthew E. Peters
Arman Cohan
300
45
0
15 May 2023
Learn to Sing by Listening: Building Controllable Virtual Singer by
  Unsupervised Learning from Voice Recordings
Learn to Sing by Listening: Building Controllable Virtual Singer by Unsupervised Learning from Voice Recordings
Wei Xue
Yiwen Wang
Qi-fei Liu
Yi-Ting Guo
183
1
0
09 May 2023
Can Diffusion Model Achieve Better Performance in Text Generation?
  Bridging the Gap between Training and Inference!
Can Diffusion Model Achieve Better Performance in Text Generation? Bridging the Gap between Training and Inference!Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Zecheng Tang
Pinzheng Wang
Keyan Zhou
Juntao Li
Ziqiang Cao
Hao Fei
DiffM
212
15
0
08 May 2023
A Variational Perspective on Solving Inverse Problems with Diffusion
  Models
A Variational Perspective on Solving Inverse Problems with Diffusion ModelsInternational Conference on Learning Representations (ICLR), 2023
Morteza Mardani
Jiaming Song
Jan Kautz
Arash Vahdat
DiffM
323
206
0
07 May 2023
Previous
123...151617...212223
Next
Page 16 of 23
Pageof 23