ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.09761
  4. Cited By
DiffWave: A Versatile Diffusion Model for Audio Synthesis
v1v2v3 (latest)

DiffWave: A Versatile Diffusion Model for Audio Synthesis

International Conference on Learning Representations (ICLR), 2020
21 September 2020
Zhifeng Kong
Ming-Yu Liu
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
    DiffMBDL
ArXiv (abs)PDFHTML

Papers citing "DiffWave: A Versatile Diffusion Model for Audio Synthesis"

50 / 1,133 papers shown
Title
T2I-Diff: fMRI Signal Generation via Time-Frequency Image Transform and Classifier-Free Denoising Diffusion Models
T2I-Diff: fMRI Signal Generation via Time-Frequency Image Transform and Classifier-Free Denoising Diffusion ModelsInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Hwa Hui Tew
Junn Yong Loo
Yee-Fan Tan
Xinyu Tang
H. Ombao
Fuad M. Noman
Raphaël C.-W. Phan
Chee-Ming Ting
DiffMMedIm
132
0
0
25 Sep 2025
Prompt-aware classifier free guidance for diffusion models
Prompt-aware classifier free guidance for diffusion models
Xuanhao Zhang
Chang Li
DiffMVLM
104
0
0
25 Sep 2025
Score-based Idempotent Distillation of Diffusion Models
Score-based Idempotent Distillation of Diffusion Models
Shehtab Zaman
Chengyan Liu
Kenneth Chiu
DiffM
140
0
0
25 Sep 2025
SPREAD: Sampling-based Pareto front Refinement via Efficient Adaptive Diffusion
SPREAD: Sampling-based Pareto front Refinement via Efficient Adaptive Diffusion
Sedjro Salomon Hotegni
Sebastian Peitz
DiffM
123
0
0
25 Sep 2025
SimDiff: Simulator-constrained Diffusion Model for Physically Plausible Motion Generation
SimDiff: Simulator-constrained Diffusion Model for Physically Plausible Motion Generation
Akihisa Watanabe
Jiawei Ren
Li Siyao
Yichen Peng
Erwin Wu
Edgar Simo-Serra
VGen
138
0
0
25 Sep 2025
Shortcut Flow Matching for Speech Enhancement: Step-Invariant flows via single stage training
Shortcut Flow Matching for Speech Enhancement: Step-Invariant flows via single stage training
Naisong Zhou
Saisamarth Rajesh Phaye
Milos Cernak
Tijana Stojkovic
Andy Pearce
Andrea Cavallaro
Andy Harper
84
0
0
25 Sep 2025
PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation
PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation
Chen Wang
Chuhao Chen
Yiming Huang
Zhiyang Dou
Yuan Liu
Jiatao Gu
Lingjie Liu
DiffMVGenPINN
575
8
0
24 Sep 2025
Learnable Sampler Distillation for Discrete Diffusion Models
Learnable Sampler Distillation for Discrete Diffusion Models
Feiyang Fu
Tongxian Guo
Zhaoqiang Liu
DiffM
138
1
0
24 Sep 2025
DynaFlow: Dynamics-embedded Flow Matching for Physically Consistent Motion Generation from State-only Demonstrations
DynaFlow: Dynamics-embedded Flow Matching for Physically Consistent Motion Generation from State-only Demonstrations
Sowoo Lee
Dongyun Kang
Jaehyun Park
Hae-Won Park
AI4CE
280
0
0
24 Sep 2025
DiffQ: Unified Parameter Initialization for Variational Quantum Algorithms via Diffusion Models
DiffQ: Unified Parameter Initialization for Variational Quantum Algorithms via Diffusion Models
Chi Zhang
Mengxin Zheng
Qian Lou
Fan Chen
DiffM
77
0
0
22 Sep 2025
Audio Super-Resolution with Latent Bridge Models
Audio Super-Resolution with Latent Bridge Models
Chang Li
Zehua Chen
Liyuan Wang
Jun Zhu
300
3
0
22 Sep 2025
Discrete-Time Diffusion-Like Models for Speech Synthesis
Discrete-Time Diffusion-Like Models for Speech Synthesis
Xiaozhou Tan
Minghui Zhao
Mattias Cross
DiffM
130
0
0
22 Sep 2025
FG-Attn: Leveraging Fine-Grained Sparsity In Diffusion Transformers
FG-Attn: Leveraging Fine-Grained Sparsity In Diffusion Transformers
Sankeerth Durvasula
Kavya Sreedhar
Zain Moustafa
Suraj Kothawade
Ashish Gondimalla
Suvinay Subramanian
Narges Shahidi
Nandita Vijaykumar
VGen
98
0
0
20 Sep 2025
Diffusion-Based Cross-Modal Feature Extraction for Multi-Label Classification
Diffusion-Based Cross-Modal Feature Extraction for Multi-Label Classification
Tian Lan
Yiming Zheng
Jianxin Yin
136
0
0
19 Sep 2025
AdaSTI: Conditional Diffusion Models with Adaptive Dependency Modeling for Spatio-Temporal Imputation
AdaSTI: Conditional Diffusion Models with Adaptive Dependency Modeling for Spatio-Temporal Imputation
Yubo Yang
Yichen Zhu
Bo Jiang
104
0
0
15 Sep 2025
Scaling to Multimodal and Multichannel Heart Sound Classification with Synthetic and Augmented Biosignals
Scaling to Multimodal and Multichannel Heart Sound Classification with Synthetic and Augmented Biosignals
Milan Marocchi
Matthew Fynn
Kayapanda Mandana
Yue Rong
119
0
0
15 Sep 2025
Flow Straight and Fast in Hilbert Space: Functional Rectified Flow
Flow Straight and Fast in Hilbert Space: Functional Rectified Flow
Jianxin Zhang
Clayton Scott
132
0
0
12 Sep 2025
MoLEx: Mixture of LoRA Experts in Speech Self-Supervised Models for Audio Deepfake Detection
MoLEx: Mixture of LoRA Experts in Speech Self-Supervised Models for Audio Deepfake Detection
Zihan Pan
Sailor Hardik Bhupendra
Jinyang Wu
MoE
136
1
0
11 Sep 2025
Automated Tuning for Diffusion Inverse Problem Solvers without Generative Prior Retraining
Automated Tuning for Diffusion Inverse Problem Solvers without Generative Prior Retraining
Yasar Utku Alçalar
Junno Yun
Mehmet Akçakaya
DiffMMedIm
104
2
0
11 Sep 2025
ArtifactGen: Benchmarking WGAN-GP vs Diffusion for Label-Aware EEG Artifact Synthesis
ArtifactGen: Benchmarking WGAN-GP vs Diffusion for Label-Aware EEG Artifact Synthesis
Hritik Arasu
Faisal R Jahangiri
DiffM
140
0
0
09 Sep 2025
DreamAudio: Customized Text-to-Audio Generation with Diffusion Models
DreamAudio: Customized Text-to-Audio Generation with Diffusion Models
Yi Yuan
Xubo Liu
Haohe Liu
Xiyuan Kang
Zhuo Chen
Yuping Wang
Mark D. Plumbley
Wenwu Wang
DiffM
120
0
0
07 Sep 2025
Diffusion Generative Models Meet Compressed Sensing, with Applications to Imaging and Finance
Diffusion Generative Models Meet Compressed Sensing, with Applications to Imaging and Finance
Zhengyi Guo
Jiatu Li
Wenpin Tang
D. Yao
DiffMMedIm
173
0
0
04 Sep 2025
SwinSRGAN: Swin Transformer-based Generative Adversarial Network for High-Fidelity Speech Super-Resolution
SwinSRGAN: Swin Transformer-based Generative Adversarial Network for High-Fidelity Speech Super-Resolution
Jiajun Yuan
Xiaochen Wang
Yuhang Xiao
Yulin Wu
Chenhao Hu
Xueyang Lv
168
0
0
04 Sep 2025
FlowECG: Using Flow Matching to Create a More Efficient ECG Signal Generator
FlowECG: Using Flow Matching to Create a More Efficient ECG Signal Generator
Vitalii Bondar
Serhii Semenov
Vira Babenko
Dmytro Holovniak
DiffMMedIm
84
0
0
31 Aug 2025
Towards High-Fidelity and Controllable Bioacoustic Generation via Enhanced Diffusion Learning
Towards High-Fidelity and Controllable Bioacoustic Generation via Enhanced Diffusion Learning
Tianyu Song
Tôn Việt Tạ
DiffM
162
3
0
30 Aug 2025
Partially Functional Dynamic Backdoor Diffusion-based Causal Model
Partially Functional Dynamic Backdoor Diffusion-based Causal Model
Xinwen Liu
Lei Qian
Song Xi Chen
Niansheng Tang
173
0
0
30 Aug 2025
Visually Grounded Narratives: Reducing Cognitive Burden in Researcher-Participant Interaction
Visually Grounded Narratives: Reducing Cognitive Burden in Researcher-Participant Interaction
Runtong Wu
Jiayao Song
Fei Teng
Xianhao Ren
Yuyan Gao
Kailun Yang
116
0
0
30 Aug 2025
Stage-Diff: Stage-wise Long-Term Time Series Generation Based on Diffusion Models
Stage-Diff: Stage-wise Long-Term Time Series Generation Based on Diffusion Models
Xuan Hou
Shuhan Liu
Zhaohui Peng
Yaohui Chu
Y. Zhang
Yining Wang
DiffM
91
0
0
29 Aug 2025
WaveLLDM: Design and Development of a Lightweight Latent Diffusion Model for Speech Enhancement and Restoration
WaveLLDM: Design and Development of a Lightweight Latent Diffusion Model for Speech Enhancement and Restoration
Kevin Putra Santoso
Rizka Wakhidatus Sholikah
Raden Venantius Hari Ginardi
139
0
0
28 Aug 2025
WildSpoof Challenge Evaluation Plan
WildSpoof Challenge Evaluation Plan
Yihan Wu
Jee-weon Jung
Hye-jin Shim
Xin Cheng
Xin Eric Wang
64
2
0
23 Aug 2025
PC-Sampler: Position-Aware Calibration of Decoding Bias in Masked Diffusion Models
PC-Sampler: Position-Aware Calibration of Decoding Bias in Masked Diffusion Models
Pengcheng Huang
Shuhao Liu
Zhenghao Liu
Shi Yu
Kaiyan Zhang
Zulong Chen
Tong Xiao
184
13
0
18 Aug 2025
FoleySpace: Vision-Aligned Binaural Spatial Audio Generation
FoleySpace: Vision-Aligned Binaural Spatial Audio Generation
Lei Zhao
Rujin Chen
Chi Zhang
Xiao-Lei Zhang
Xuelong Li
112
1
0
18 Aug 2025
Diffusion is a code repair operator and generator
Diffusion is a code repair operator and generator
Mukul Singh
Gust Verbruggen
Vu Le
Sumit Gulwani
DiffM
76
0
0
14 Aug 2025
EEGDM: EEG Representation Learning via Generative Diffusion Model
EEGDM: EEG Representation Learning via Generative Diffusion Model
Jia Hong Puah
Sim Kuan Goh
Ziwei Zhang
Zixuan Ye
Chow Khuen Chan
Kheng Seang Lim
Si Lei Fong
Kok Sin Woon
Cuntai Guan
DiffM
188
1
0
13 Aug 2025
Enhancing Spectrogram Realism in Singing Voice Synthesis via Explicit Bandwidth Extension Prior to Vocoder
Enhancing Spectrogram Realism in Singing Voice Synthesis via Explicit Bandwidth Extension Prior to Vocoder
Runxuan Yang
Kai Li
Guo Chen
Xiaolin Hu
105
0
0
03 Aug 2025
Occlusion-robust Stylization for Drawing-based 3D Animation
Occlusion-robust Stylization for Drawing-based 3D Animation
Sunjae Yoon
Gwanhyeong Koo
Younghwan Lee
Ji Woo Hong
C. Yoo
3DH
140
1
0
01 Aug 2025
Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction
Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction
Giuseppe Cartella
Vittorio Cuculo
Alessandro D’Amelio
Marcella Cornia
Giuseppe Boccignone
Rita Cucchiara
103
1
0
30 Jul 2025
Learning Neural Vocoder from Range-Null Space Decomposition
Learning Neural Vocoder from Range-Null Space DecompositionInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
Andong Li
Tong Lei
Zhihang Sun
Rilin Chen
Erwei Yin
Xiaodong Li
C. Zheng
125
2
0
28 Jul 2025
Flow Matching Policy Gradients
Flow Matching Policy Gradients
David McAllister
Songwei Ge
Brent Yi
Chung Min Kim
Ethan Weber
Hongsuk Choi
Haiwen Feng
Angjoo Kanazawa
234
12
0
28 Jul 2025
Efficient Vocal-Conditioned Music Generation via Soft Alignment Attention and Latent Diffusion
Efficient Vocal-Conditioned Music Generation via Soft Alignment Attention and Latent Diffusion
Hei Shing Cheung
Boya Zhang
DiffM
118
0
0
26 Jul 2025
SonicGauss: Position-Aware Physical Sound Synthesis for 3D Gaussian Representations
SonicGauss: Position-Aware Physical Sound Synthesis for 3D Gaussian Representations
Chunshi Wang
Hongxing Li
Yawei Luo
3DGS
88
0
0
26 Jul 2025
A diffusion-based generative model for financial time series via geometric Brownian motion
A diffusion-based generative model for financial time series via geometric Brownian motion
Gihun Kim
Sun-Yong Choi
Yeoneung Kim
DiffMAI4TS
55
0
0
25 Jul 2025
A Comprehensive Review of Diffusion Models in Smart Agriculture: Progress, Applications, and Challenges
A Comprehensive Review of Diffusion Models in Smart Agriculture: Progress, Applications, and Challenges
Xing Hua
Haodong Chen
Qianqian Duan
Danfeng Hong
Ruijiao Li
Huiliang Shang
MedIm
373
2
0
24 Jul 2025
Diffusion Models for Solving Inverse Problems via Posterior Sampling with Piecewise Guidance
Diffusion Models for Solving Inverse Problems via Posterior Sampling with Piecewise Guidance
Saeed Mohseni-Sehdeh
Walid Saad
Kei Sakaguchi
Tao Yu
DiffM
130
0
0
22 Jul 2025
CHORDS: Diffusion Sampling Accelerator with Multi-core Hierarchical ODE Solvers
CHORDS: Diffusion Sampling Accelerator with Multi-core Hierarchical ODE Solvers
Jiaqi Han
Haotian Ye
Puheng Li
Minkai Xu
James Zou
Stefano Ermon
DiffM
191
0
0
21 Jul 2025
Distilling Parallel Gradients for Fast ODE Solvers of Diffusion Models
Distilling Parallel Gradients for Fast ODE Solvers of Diffusion Models
B. Zhu
Ruoyu Wang
Tong Zhao
Hanwang Zhang
Chi Zhang
DiffM
123
2
0
20 Jul 2025
Diffusion-based translation between unpaired spontaneous premature neonatal EEG and fetal MEG
Diffusion-based translation between unpaired spontaneous premature neonatal EEG and fetal MEG
Benoît Brebion
Alban Gallard
Katrin Sippel
Amer Zaylaa
Hubert Preissl
Sahar Moghimi
Fabrice Wallois
Yaël Frégier
MedIm
109
0
0
16 Jul 2025
RODS: Robust Optimization Inspired Diffusion Sampling for Detecting and Reducing Hallucination in Generative Models
RODS: Robust Optimization Inspired Diffusion Sampling for Detecting and Reducing Hallucination in Generative Models
Yiqi Tian
Pengfei Jin
Mingze Yuan
Na Li
Bo Zeng
Shijie Zhao
DiffM
115
0
0
16 Jul 2025
Knowing When to Quit: Probabilistic Early Exits for Speech Separation
Knowing When to Quit: Probabilistic Early Exits for Speech Separation
Kenny Falkær Olsen
Mads Østergaard
Karl Ulbæk
S. F. V. Nielsen
Rasmus Malik Høegh Lindrup
Bjørn Sand Jensen
Morten Mørup
UQCV
211
0
0
13 Jul 2025
Warm Starts Accelerate Conditional Diffusion
Warm Starts Accelerate Conditional Diffusion
Jonas Scholz
Richard Turner
DiffMVLMAI4CE
111
0
0
12 Jul 2025
Previous
12345...212223
Next