ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.09761
  4. Cited By
DiffWave: A Versatile Diffusion Model for Audio Synthesis

DiffWave: A Versatile Diffusion Model for Audio Synthesis

21 September 2020
Zhifeng Kong
Wei Ping
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
    DiffM
    BDL
ArXivPDFHTML

Papers citing "DiffWave: A Versatile Diffusion Model for Audio Synthesis"

50 / 977 papers shown
Title
Diff-MTS: Temporal-Augmented Conditional Diffusion-based AIGC for
  Industrial Time Series Towards the Large Model Era
Diff-MTS: Temporal-Augmented Conditional Diffusion-based AIGC for Industrial Time Series Towards the Large Model Era
Lei Ren
Haiteng Wang
Y. Laili
AI4CE
41
4
0
16 Jul 2024
AIGC for Industrial Time Series: From Deep Generative Models to Large Generative Models
AIGC for Industrial Time Series: From Deep Generative Models to Large Generative Models
Lei Ren
Haiteng Wang
Yang Tang
Yang Tang
Chunhua Yang
AI4TS
AI4CE
49
5
0
16 Jul 2024
R3D-AD: Reconstruction via Diffusion for 3D Anomaly Detection
R3D-AD: Reconstruction via Diffusion for 3D Anomaly Detection
Zheyuan Zhou
Le Wang
N. Fang
Zili Wang
Le-miao Qiu
Shuyou Zhang
42
12
0
15 Jul 2024
GROOT: Generating Robust Watermark for Diffusion-Model-Based Audio
  Synthesis
GROOT: Generating Robust Watermark for Diffusion-Model-Based Audio Synthesis
Weizhi Liu
Yue Li
Dongdong Lin
Hui Tian
Haizhou Li
WIGM
32
8
0
15 Jul 2024
LiteFocus: Accelerated Diffusion Inference for Long Audio Synthesis
LiteFocus: Accelerated Diffusion Inference for Long Audio Synthesis
Zhenxiong Tan
Xinyin Ma
Gongfan Fang
Xinchao Wang
36
3
0
15 Jul 2024
Mutual Learning for Acoustic Matching and Dereverberation via Visual
  Scene-driven Diffusion
Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-driven Diffusion
Jian Ma
Wenguan Wang
Yi Yang
Feng Zheng
DiffM
43
0
0
15 Jul 2024
Adaptive Compressed Sensing with Diffusion-Based Posterior Sampling
Adaptive Compressed Sensing with Diffusion-Based Posterior Sampling
Noam Elata
T. Michaeli
Michael Elad
DiffM
MedIm
29
9
0
11 Jul 2024
Deep Inverse Design for High-Level Synthesis
Deep Inverse Design for High-Level Synthesis
Ping Chang
Tosiron Adegbija
Yuchao Liao
Claudio Talarico
Ao Li
Janet Roveda
30
0
0
11 Jul 2024
Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion
Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion
Yu Cao
Shaogang Gong
DiffM
32
2
0
09 Jul 2024
TimeLDM: Latent Diffusion Model for Unconditional Time Series Generation
TimeLDM: Latent Diffusion Model for Unconditional Time Series Generation
Jian Qian
Miao Sun
Sifan Zhou
Biao Wan
Minhao Li
Patrick Chiang
33
7
0
05 Jul 2024
No Training, No Problem: Rethinking Classifier-Free Guidance for
  Diffusion Models
No Training, No Problem: Rethinking Classifier-Free Guidance for Diffusion Models
Seyedmorteza Sadat
Manuel Kansy
Otmar Hilliges
Romann M. Weber
36
10
0
02 Jul 2024
GVDIFF: Grounded Text-to-Video Generation with Diffusion Models
GVDIFF: Grounded Text-to-Video Generation with Diffusion Models
Huanzhang Dou
Ruixiang Li
Wei Su
Xi Li
DiffM
39
1
0
02 Jul 2024
Pictures Of MIDI: Controlled Music Generation via Graphical Prompts for
  Image-Based Diffusion Inpainting
Pictures Of MIDI: Controlled Music Generation via Graphical Prompts for Image-Based Diffusion Inpainting
Scott H. Hawley
30
2
0
01 Jul 2024
A Comprehensive Survey on Diffusion Models and Their Applications
A Comprehensive Survey on Diffusion Models and Their Applications
M. Ahsan
S. Raman
Yingtao Liu
Zahed Siddique
MedIm
DiffM
39
1
0
01 Jul 2024
An Expectation-Maximization Algorithm for Training Clean Diffusion
  Models from Corrupted Observations
An Expectation-Maximization Algorithm for Training Clean Diffusion Models from Corrupted Observations
Weimin Bai
Yifei Wang
Wenzheng Chen
He Sun
36
7
0
01 Jul 2024
Diffusion Models and Representation Learning: A Survey
Diffusion Models and Representation Learning: A Survey
Michael Fuest
Pingchuan Ma
Ming Gui
Johannes S. Fischer
Vincent Tao Hu
Bjorn Ommer
DiffM
30
19
0
30 Jun 2024
Consistency Purification: Effective and Efficient Diffusion Purification
  towards Certified Robustness
Consistency Purification: Effective and Efficient Diffusion Purification towards Certified Robustness
Yiquan Li
Zhongzhu Chen
Kun Jin
Jiongxiao Wang
Bo Li
Chaowei Xiao
DiffM
31
1
0
30 Jun 2024
Open-Source Conversational AI with SpeechBrain 1.0
Open-Source Conversational AI with SpeechBrain 1.0
Mirco Ravanelli
Titouan Parcollet
Adel Moumen
Sylvain de Langen
Cem Subakan
...
Salima Mdhaffar
G. Laperriere
Mickael Rouvier
Renato De Mori
Yannick Esteve
VLM
34
10
0
29 Jun 2024
Latent Diffusion for Neural Spiking Data
Latent Diffusion for Neural Spiking Data
J. Kapoor
Auguste Schulz
Julius Vetter
Felix Pei
Richard Gao
Jakob H. Macke
DiffM
38
2
0
27 Jun 2024
DEX-TTS: Diffusion-based EXpressive Text-to-Speech with Style Modeling
  on Time Variability
DEX-TTS: Diffusion-based EXpressive Text-to-Speech with Style Modeling on Time Variability
Hyun Joon Park
Jin Sob Kim
Wooseok Shin
Sung Won Han
DiffM
33
2
0
27 Jun 2024
DiffuseHigh: Training-free Progressive High-Resolution Image Synthesis
  through Structure Guidance
DiffuseHigh: Training-free Progressive High-Resolution Image Synthesis through Structure Guidance
Younghyun Kim
Geunmin Hwang
Junyu Zhang
Eunbyung Park
47
6
0
26 Jun 2024
Molecular Diffusion Models with Virtual Receptors
Molecular Diffusion Models with Virtual Receptors
Matan Halfon
Eyal Rozenberg
Ehud Rivlin
Daniel Freedman
47
0
0
26 Jun 2024
Towards Zero-Shot Text-To-Speech for Arabic Dialects
Towards Zero-Shot Text-To-Speech for Arabic Dialects
Khai Duy Doan
Abdul Waheed
Muhammad Abdul-Mageed
38
0
0
24 Jun 2024
Video-Infinity: Distributed Long Video Generation
Video-Infinity: Distributed Long Video Generation
Zhenxiong Tan
Xingyi Yang
Songhua Liu
Xinchao Wang
VGen
35
19
0
24 Jun 2024
Provable Statistical Rates for Consistency Diffusion Models
Provable Statistical Rates for Consistency Diffusion Models
Zehao Dou
Minshuo Chen
Mengdi Wang
Zhuoran Yang
DiffM
29
3
0
23 Jun 2024
The Music Maestro or The Musically Challenged, A Massive Music
  Evaluation Benchmark for Large Language Models
The Music Maestro or The Musically Challenged, A Massive Music Evaluation Benchmark for Large Language Models
Jiajia Li
Lu Yang
Mingni Tang
Cong Chen
Zuchao Li
Ping Wang
Hai Zhao
LM&MA
38
4
0
22 Jun 2024
LatentExplainer: Explaining Latent Representations in Deep Generative Models with Multimodal Large Language Models
LatentExplainer: Explaining Latent Representations in Deep Generative Models with Multimodal Large Language Models
Mengdan Zhu
Raasikh Kanjiani
Jiahui Lu
Andrew Choi
Qirui Ye
Liang Zhao
DiffM
36
1
0
21 Jun 2024
Watch the Watcher! Backdoor Attacks on Security-Enhancing Diffusion
  Models
Watch the Watcher! Backdoor Attacks on Security-Enhancing Diffusion Models
Changjiang Li
Ren Pang
Bochuan Cao
Jinghui Chen
Fenglong Ma
Shouling Ji
Ting Wang
DiffM
36
3
0
14 Jun 2024
Alleviating Distortion in Image Generation via Multi-Resolution
  Diffusion Models
Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models
Qihao Liu
Zhanpeng Zeng
Ju He
Qihang Yu
Xiaohui Shen
Liang-Chieh Chen
48
18
0
13 Jun 2024
Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric
  Videos
Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos
Changan Chen
Puyuan Peng
Ami Baid
Zihui Xue
Wei-Ning Hsu
David F. Harwath
Kristen Grauman
VGen
39
7
0
13 Jun 2024
Generative Inverse Design of Crystal Structures via Diffusion Models
  with Transformers
Generative Inverse Design of Crystal Structures via Diffusion Models with Transformers
Izumi Takahara
Kiyou Shibata
Teruyasu Mizoguchi
DiffM
AI4CE
34
2
0
13 Jun 2024
CDSA: Conservative Denoising Score-based Algorithm for Offline
  Reinforcement Learning
CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning
Zeyuan Liu
Kai Yang
Xiu Li
OffRL
42
0
0
11 Jun 2024
AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising
AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising
Zigeng Chen
Xinyin Ma
Gongfan Fang
Zhenxiong Tan
Xinchao Wang
46
7
0
11 Jun 2024
Margin-aware Preference Optimization for Aligning Diffusion Models
  without Reference
Margin-aware Preference Optimization for Aligning Diffusion Models without Reference
Jiwoo Hong
Sayak Paul
Noah Lee
Kashif Rasul
James Thorne
Jongheon Jeong
35
13
0
10 Jun 2024
JenGAN: Stacked Shifted Filters in GAN-Based Speech Synthesis
JenGAN: Stacked Shifted Filters in GAN-Based Speech Synthesis
Hyunjae Cho
Junhyeok Lee
Wonbin Jung
16
0
0
10 Jun 2024
Should you use a probabilistic duration model in TTS? Probably!
  Especially for spontaneous speech
Should you use a probabilistic duration model in TTS? Probably! Especially for spontaneous speech
Shivam Mehta
Harm Lameris
Rajiv Punmiya
Jonas Beskow
Éva Székely
G. Henter
23
1
0
08 Jun 2024
Differentiable Time-Varying Linear Prediction in the Context of
  End-to-End Analysis-by-Synthesis
Differentiable Time-Varying Linear Prediction in the Context of End-to-End Analysis-by-Synthesis
Chin-Yun Yu
Gyorgy Fazekas
21
1
0
07 Jun 2024
Efficient 3D Shape Generation via Diffusion Mamba with Bidirectional
  SSMs
Efficient 3D Shape Generation via Diffusion Mamba with Bidirectional SSMs
Shentong Mo
Mamba
21
4
0
07 Jun 2024
Bayesian Power Steering: An Effective Approach for Domain Adaptation of
  Diffusion Models
Bayesian Power Steering: An Effective Approach for Domain Adaptation of Diffusion Models
Ding Huang
Ting Li
Jian Huang
DiffM
39
1
0
06 Jun 2024
Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few
  Steps Image Generation
Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation
Clement Chadebec
O. Tasar
Eyal Benaroche
Benjamin Aubin
VLM
60
8
0
04 Jun 2024
SimpleSpeech: Towards Simple and Efficient Text-to-Speech with Scalar
  Latent Transformer Diffusion Models
SimpleSpeech: Towards Simple and Efficient Text-to-Speech with Scalar Latent Transformer Diffusion Models
Dongchao Yang
Dingdong Wang
Haohan Guo
Xueyuan Chen
Xixin Wu
Helen M. Meng
59
25
0
04 Jun 2024
A Survey of Transformer Enabled Time Series Synthesis
A Survey of Transformer Enabled Time Series Synthesis
Alexander Sommers
Logan Cummins
Sudip Mittal
Shahram Rahimi
Maria Seale
Joseph Jaboure
Thomas Arnold
AI4TS
37
2
0
04 Jun 2024
An Independence-promoting Loss for Music Generation with Language Models
An Independence-promoting Loss for Music Generation with Language Models
Jean-Marie Lemercier
Simon Rouard
Jade Copet
Yossi Adi
Alexandre Défossez
20
1
0
04 Jun 2024
Convergence of the denoising diffusion probabilistic models for general noise schedules
Convergence of the denoising diffusion probabilistic models for general noise schedules
Yumiharu Nakano
DiffM
49
0
0
03 Jun 2024
Covariance-Adaptive Sequential Black-box Optimization for Diffusion
  Targeted Generation
Covariance-Adaptive Sequential Black-box Optimization for Diffusion Targeted Generation
Yueming Lyu
Kim yong Tan
Yew-Soon Ong
Ivor W. Tsang
DiffM
28
1
0
02 Jun 2024
Diffusion Tuning: Transferring Diffusion Models via Chain of Forgetting
Diffusion Tuning: Transferring Diffusion Models via Chain of Forgetting
Jincheng Zhong
Xingzhuo Guo
Jiaxiang Dong
Mingsheng Long
DiffM
40
2
0
02 Jun 2024
AudioLCM: Text-to-Audio Generation with Latent Consistency Models
AudioLCM: Text-to-Audio Generation with Latent Consistency Models
Huadai Liu
Rongjie Huang
Yang Liu
Hengyuan Cao
Jialei Wang
Xize Cheng
Siqi Zheng
Zhou Zhao
68
8
0
01 Jun 2024
A Survey of Deep Learning Audio Generation Methods
A Survey of Deep Learning Audio Generation Methods
Matej Bozic
Marko Horvat
VLM
MedIm
52
0
0
31 May 2024
Unified Directly Denoising for Both Variance Preserving and Variance
  Exploding Diffusion Models
Unified Directly Denoising for Both Variance Preserving and Variance Exploding Diffusion Models
Jingjing Wang
Dan Zhang
Feng Luo
DiffM
26
0
0
31 May 2024
Adv-KD: Adversarial Knowledge Distillation for Faster Diffusion Sampling
Adv-KD: Adversarial Knowledge Distillation for Faster Diffusion Sampling
Kidist Amde Mekonnen
Nicola DallÁsen
Paolo Rota
21
1
0
31 May 2024
Previous
123456...181920
Next