ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.09761
  4. Cited By
DiffWave: A Versatile Diffusion Model for Audio Synthesis
v1v2v3 (latest)

DiffWave: A Versatile Diffusion Model for Audio Synthesis

International Conference on Learning Representations (ICLR), 2020
21 September 2020
Zhifeng Kong
Ming-Yu Liu
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
    DiffMBDL
ArXiv (abs)PDFHTML

Papers citing "DiffWave: A Versatile Diffusion Model for Audio Synthesis"

50 / 1,135 papers shown
Revisiting Energy Based Models as Policies: Ranking Noise Contrastive
  Estimation and Interpolating Energy Models
Revisiting Energy Based Models as Policies: Ranking Noise Contrastive Estimation and Interpolating Energy Models
Sumeet Singh
Stephen Tu
Vikas Sindhwani
DiffM
285
11
0
11 Sep 2023
Diffusion-Based Co-Speech Gesture Generation Using Joint Text and Audio
  Representation
Diffusion-Based Co-Speech Gesture Generation Using Joint Text and Audio RepresentationInternational Conference on Multimodal Interaction (ICMI), 2023
Anna Deichler
Shivam Mehta
Simon Alexanderson
Jonas Beskow
DiffM
227
30
0
11 Sep 2023
Discrete Denoising Diffusion Approach to Integer Factorization
Discrete Denoising Diffusion Approach to Integer FactorizationInternational Conference on Artificial Neural Networks (ICANN), 2023
Kārlis Freivalds
Emīls Ozoliņš
Guntis Barzdins
DiffM
143
1
0
11 Sep 2023
Variations and Relaxations of Normalizing Flows
Variations and Relaxations of Normalizing Flows
Keegan Kelly
Lorena Piedras
Sukrit Rao
David Samuel Roth
BDL
275
2
0
08 Sep 2023
Matcha-TTS: A fast TTS architecture with conditional flow matching
Matcha-TTS: A fast TTS architecture with conditional flow matchingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Shivam Mehta
Ruibo Tu
Jonas Beskow
Éva Székely
G. Henter
312
179
0
06 Sep 2023
BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial
  Network
BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial NetworkIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Takashi Shibuya
Yuhta Takida
Yuki Mitsufuji
272
16
0
06 Sep 2023
sasdim: self-adaptive noise scaling diffusion model for spatial time
  series imputation
sasdim: self-adaptive noise scaling diffusion model for spatial time series imputationInternational Joint Conference on Artificial Intelligence (IJCAI), 2023
Shunyang Zhang
Senzhang Wang
Xianzhen Tan
Ruochen Liu
Jian Zhang
Jianxin Wang
177
10
0
05 Sep 2023
DiffHPE: Robust, Coherent 3D Human Pose Lifting with Diffusion
DiffHPE: Robust, Coherent 3D Human Pose Lifting with Diffusion
Cédric Rommel
Eduardo Valle
Mickaël Chen
Souhaiel Khalfaoui
Renaud Marlet
Matthieu Cord
Patrick Pérez
224
18
0
04 Sep 2023
FinDiff: Diffusion Models for Financial Tabular Data Generation
FinDiff: Diffusion Models for Financial Tabular Data GenerationInternational Conference on AI in Finance (ICAF), 2023
Timur Sattarov
Marco Schreyer
Damian Borth
DiffM
199
61
0
04 Sep 2023
NADiffuSE: Noise-aware Diffusion-based Model for Speech Enhancement
NADiffuSE: Noise-aware Diffusion-based Model for Speech EnhancementAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2023
Wen Wang
Dongchao Yang
Qichen Ye
Bowen Cao
Yuexian Zou
DiffM
290
4
0
03 Sep 2023
Diffusion Models with Deterministic Normalizing Flow Priors
Diffusion Models with Deterministic Normalizing Flow Priors
Mohsen Zand
Ali Etemad
Michael A. Greenspan
DiffM
380
4
0
03 Sep 2023
PathLDM: Text conditioned Latent Diffusion Model for Histopathology
PathLDM: Text conditioned Latent Diffusion Model for HistopathologyIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Srikar Yellapragada
Alexandros Graikos
Prateek Prasanna
Tahsin M. Kurc
Joel H. Saltz
Dimitris Samaras
AI4CE
365
56
0
01 Sep 2023
VideoGen: A Reference-Guided Latent Diffusion Approach for High
  Definition Text-to-Video Generation
VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation
Xin Li
Wenqing Chu
Ye Wu
Weihang Yuan
Fanglong Liu
Tao Gui
Fu Li
Haocheng Feng
Errui Ding
Jingdong Wang
VGen
332
70
0
01 Sep 2023
LightGrad: Lightweight Diffusion Probabilistic Model for Text-to-Speech
LightGrad: Lightweight Diffusion Probabilistic Model for Text-to-SpeechIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Jing Chen
Xingcheng Song
Zhendong Peng
Binbin Zhang
Fuping Pan
Zhiyong Wu
DiffM
163
24
0
31 Aug 2023
A Review of Differentiable Digital Signal Processing for Music & Speech
  Synthesis
A Review of Differentiable Digital Signal Processing for Music & Speech SynthesisFrontiers in Signal Processing (FSP), 2023
B. Hayes
Jordie Shier
Gyorgy Fazekas
Andrew Mcpherson
C. Saitis
240
41
0
29 Aug 2023
Elucidating the Exposure Bias in Diffusion Models
Elucidating the Exposure Bias in Diffusion ModelsInternational Conference on Learning Representations (ICLR), 2023
Mang Ning
Mingxiao Li
Jianlin Su
A. A. Salah
Itir Onal Ertugrul
DiffM
521
73
0
29 Aug 2023
C2G2: Controllable Co-speech Gesture Generation with Latent Diffusion
  Model
C2G2: Controllable Co-speech Gesture Generation with Latent Diffusion Model
Longbin Ji
Pengfei Wei
Yi Ren
Jinglin Liu
Chen Zhang
Xiang Yin
DiffM
149
3
0
29 Aug 2023
Transfusor: Transformer Diffusor for Controllable Human-like Generation
  of Vehicle Lane Changing Trajectories
Transfusor: Transformer Diffusor for Controllable Human-like Generation of Vehicle Lane Changing Trajectories
Jiqian Dong
Sikai Chen
Samuel Labi
155
2
0
28 Aug 2023
Voice Conversion with Denoising Diffusion Probabilistic GAN Models
Voice Conversion with Denoising Diffusion Probabilistic GAN ModelsInternational Conference on Advanced Data Mining and Applications (ADMA), 2023
Xulong Zhang
Jianzong Wang
Ning Cheng
Jing Xiao
DiffM
150
8
0
28 Aug 2023
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
AI-Generated Content (AIGC) for Various Data Modalities: A SurveyACM Computing Surveys (ACM Comput. Surv.), 2023
Lin Geng Foo
Hossein Rahmani
Jing Liu
770
49
0
27 Aug 2023
DiffI2I: Efficient Diffusion Model for Image-to-Image Translation
DiffI2I: Efficient Diffusion Model for Image-to-Image TranslationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Bin Xia
Yulun Zhang
Shiyin Wang
Yitong Wang
Xing Wu
Yapeng Tian
Wenming Yang
Radu Timotfe
Luc Van Gool
DiffMVLM
242
22
0
26 Aug 2023
Exploiting Time-Frequency Conformers for Music Audio Enhancement
Exploiting Time-Frequency Conformers for Music Audio EnhancementACM Multimedia (ACM MM), 2023
Yunkee Chae
Junghyun Koo
Sungho Lee
Kyogu Lee
181
8
0
24 Aug 2023
Audio Generation with Multiple Conditional Diffusion Model
Audio Generation with Multiple Conditional Diffusion ModelAAAI Conference on Artificial Intelligence (AAAI), 2023
Zhifang Guo
Jianguo Mao
Ruijie Tao
Long Yan
Kazushige Ouchi
Hong Liu
Xiangdong Wang
DiffM
350
31
0
23 Aug 2023
Shape-conditioned 3D Molecule Generation via Equivariant Diffusion
  Models
Shape-conditioned 3D Molecule Generation via Equivariant Diffusion Models
Ziqi Chen
Bo Peng
Srinivas Parthasarathy
Xia Ning
DiffM
318
17
0
23 Aug 2023
Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning
Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning
Jiasheng Ye
Zaixiang Zheng
Yu Bao
Lihua Qian
Quanquan Gu
DiffM
647
34
0
23 Aug 2023
Convergence guarantee for consistency models
Convergence guarantee for consistency models
Junlong Lyu
Zhitang Chen
Shoubo Feng
DiffM
155
5
0
22 Aug 2023
Fast Inference and Update of Probabilistic Density Estimation on
  Trajectory Prediction
Fast Inference and Update of Probabilistic Density Estimation on Trajectory PredictionIEEE International Conference on Computer Vision (ICCV), 2023
Takahiro Maeda
Norimichi Ukita
241
46
0
17 Aug 2023
Enhancing Phrase Representation by Information Bottleneck Guided Text
  Diffusion Process for Keyphrase Extraction
Enhancing Phrase Representation by Information Bottleneck Guided Text Diffusion Process for Keyphrase ExtractionInternational Conference on Language Resources and Evaluation (LREC), 2023
Yuanzhen Luo
Qingyu Zhou
F. Zhou
DiffM
211
3
0
17 Aug 2023
DiffV2S: Diffusion-based Video-to-Speech Synthesis with Vision-guided
  Speaker Embedding
DiffV2S: Diffusion-based Video-to-Speech Synthesis with Vision-guided Speaker EmbeddingIEEE International Conference on Computer Vision (ICCV), 2023
J. Choi
Joanna Hong
Y. Ro
DiffM
192
31
0
15 Aug 2023
iSTFTNet2: Faster and More Lightweight iSTFT-Based Neural Vocoder Using
  1D-2D CNN
iSTFTNet2: Faster and More Lightweight iSTFT-Based Neural Vocoder Using 1D-2D CNNInterspeech (Interspeech), 2023
Takuhiro Kaneko
Hirokazu Kameoka
Kou Tanaka
Shogo Seki
166
8
0
14 Aug 2023
ModelScope Text-to-Video Technical Report
ModelScope Text-to-Video Technical Report
Jiuniu Wang
Hangjie Yuan
Dayou Chen
Yingya Zhang
Xiang Wang
Shiwei Zhang
VGenDiffM
348
615
0
12 Aug 2023
Audio is all in one: speech-driven gesture synthetics using WavLM pre-trained model
Fan Zhang
Naye Ji
Fuxing Gao
Siyuan Zhao
Zhaohan Wang
Shunman Li
249
0
0
11 Aug 2023
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised
  Pretraining
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
Haohe Liu
Yiitan Yuan
Xubo Liu
Xinhao Mei
Qiuqiang Kong
Qiao Tian
Yuping Wang
Wenwu Wang
Yuxuan Wang
Mark D. Plumbley
DiffM
356
387
0
10 Aug 2023
On Error Propagation of Diffusion Models
On Error Propagation of Diffusion ModelsInternational Conference on Learning Representations (ICLR), 2023
Yangming Li
M. Schaar
DiffM
259
25
0
09 Aug 2023
JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models
JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion ModelsConference on Algebraic Informatics (CAI), 2023
Peike Li
Bo-Yu Chen
Yao Yao
Yikai Wang
Allen Wang
Alex Jinpeng Wang
MGenVLMDiffM
672
53
0
09 Aug 2023
From Unimodal to Multimodal: improving sEMG-Based Pattern Recognition
  via deep generative models
From Unimodal to Multimodal: improving sEMG-Based Pattern Recognition via deep generative models
Wentao Wei
Linyan Ren
109
2
0
08 Aug 2023
Diffusion Model in Causal Inference with Unmeasured Confounders
Diffusion Model in Causal Inference with Unmeasured ConfoundersIEEE Symposium Series on Computational Intelligence (IEEE-SSCI), 2023
Tatsuhiro Shimizu
DiffM
270
6
0
07 Aug 2023
DiffDance: Cascaded Human Motion Diffusion Model for Dance Generation
DiffDance: Cascaded Human Motion Diffusion Model for Dance GenerationACM Multimedia (ACM MM), 2023
Qiaosong Qi
Le Zhuo
Aixi Zhang
Yue Liao
Fei Fang
Si Liu
Shuicheng Yan
223
38
0
05 Aug 2023
Improved Order Analysis and Design of Exponential Integrator for
  Diffusion Models Sampling
Improved Order Analysis and Design of Exponential Integrator for Diffusion Models Sampling
Qinsheng Zhang
Jiaming Song
Yongxin Chen
DiffM
188
16
0
04 Aug 2023
Synthesizing Long-Term Human Motions with Diffusion Models via Coherent
  Sampling
Synthesizing Long-Term Human Motions with Diffusion Models via Coherent SamplingACM Multimedia (ACM MM), 2023
Zhaohui Yang
Fuchun Sun
Ji-Rong Wen
DiffM
244
20
0
03 Aug 2023
MusicLDM: Enhancing Novelty in Text-to-Music Generation Using
  Beat-Synchronous Mixup Strategies
MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup StrategiesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Kai Chen
Yusong Wu
Haohe Liu
Marianna Nezhurina
Taylor Berg-Kirkpatrick
Shlomo Dubnov
DiffM
253
128
0
03 Aug 2023
From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion
From Discrete Tokens to High-Fidelity Audio Using Multi-Band DiffusionNeural Information Processing Systems (NeurIPS), 2023
Robin San Roman
Yossi Adi
Antoine Deleforge
Romain Serizel
Gabriel Synnaeve
Alexandre Défossez
DiffM
255
37
0
02 Aug 2023
DAVIS: High-Quality Audio-Visual Separation with Generative Diffusion
  Models
DAVIS: High-Quality Audio-Visual Separation with Generative Diffusion ModelsAsian Conference on Computer Vision (ACCV), 2023
Chao Huang
Susan Liang
Yapeng Tian
Anurag Kumar
Chenliang Xu
DiffM
183
4
0
31 Jul 2023
Image Synthesis under Limited Data: A Survey and Taxonomy
Image Synthesis under Limited Data: A Survey and TaxonomyInternational Journal of Computer Vision (IJCV), 2023
Mengping Yang
Zhe Wang
241
16
0
31 Jul 2023
A Novel DDPM-based Ensemble Approach for Energy Theft Detection in Smart
  Grids
A Novel DDPM-based Ensemble Approach for Energy Theft Detection in Smart Grids
Xun Yuan
Yang Yang
Asif Iqbal
P. Gope
Biplab Sikdar
DiffM
147
3
0
30 Jul 2023
RGB-D-Fusion: Image Conditioned Depth Diffusion of Humanoid Subjects
RGB-D-Fusion: Image Conditioned Depth Diffusion of Humanoid SubjectsIEEE Access (IEEE Access), 2023
Sascha Kirch
Valeria Olyunina
Jan Ondřej
Rafael Pagés
Sergio Martín
Clara Pérez-Molina
200
3
0
29 Jul 2023
Minimally-Supervised Speech Synthesis with Conditional Diffusion Model
  and Language Model: A Comparative Study of Semantic Coding
Minimally-Supervised Speech Synthesis with Conditional Diffusion Model and Language Model: A Comparative Study of Semantic CodingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Chunyu Qiang
Hao Li
Hao Ni
He Qu
Ruibo Fu
Tao Wang
Longbiao Wang
Jianwu Dang
DiffM
199
16
0
28 Jul 2023
TransFusion: Generating Long, High Fidelity Time Series using Diffusion
  Models with Transformers
TransFusion: Generating Long, High Fidelity Time Series using Diffusion Models with TransformersMachine Learning with Applications (MLWA), 2023
Md Fahim Sikder
R. Ramachandranpillai
Fredrik Heintz
DiffM
262
20
0
24 Jul 2023
Predict, Refine, Synthesize: Self-Guiding Diffusion Models for
  Probabilistic Time Series Forecasting
Predict, Refine, Synthesize: Self-Guiding Diffusion Models for Probabilistic Time Series ForecastingNeural Information Processing Systems (NeurIPS), 2023
Marcel Kollovieh
Abdul Fatir Ansari
Michael Bohlke-Schneider
Jasper Zschiegner
Hao Wang
Yuyang Wang
DiffMAI4TS
316
90
0
21 Jul 2023
Progressive distillation diffusion for raw music generation
Progressive distillation diffusion for raw music generation
Svetlana Pavlova
DiffM
227
0
0
20 Jul 2023
Previous
123...131415...212223
Next
Page 14 of 23
Pageof 23