ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.09761
  4. Cited By
DiffWave: A Versatile Diffusion Model for Audio Synthesis
v1v2v3 (latest)

DiffWave: A Versatile Diffusion Model for Audio Synthesis

International Conference on Learning Representations (ICLR), 2020
21 September 2020
Zhifeng Kong
Ming-Yu Liu
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
    DiffMBDL
ArXiv (abs)PDFHTML

Papers citing "DiffWave: A Versatile Diffusion Model for Audio Synthesis"

50 / 1,134 papers shown
Research on Anomaly Detection Methods Based on Diffusion Models
Research on Anomaly Detection Methods Based on Diffusion Models
Yi Chen
DiffM
326
0
0
08 May 2025
Wasserstein Convergence of Score-based Generative Models under Semiconvexity and Discontinuous Gradients
Wasserstein Convergence of Score-based Generative Models under Semiconvexity and Discontinuous Gradients
Stefano Bruno
Sotirios Sabanis
DiffM
477
6
0
06 May 2025
T2S: High-resolution Time Series Generation with Text-to-Series Diffusion Models
T2S: High-resolution Time Series Generation with Text-to-Series Diffusion ModelsInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
Yunfeng Ge
Jiawei Li
Yiji Zhao
Haomin Wen
Zhao Li
M. Qiu
Haoyang Li
Ming Jin
Xiaojun Jia
DiffM
765
6
0
05 May 2025
A Time-Series Data Augmentation Model through Diffusion and Transformer Integration
A Time-Series Data Augmentation Model through Diffusion and Transformer Integration
Yuren Zhang
Zhongnan Pu
Lei Jing
195
1
0
01 May 2025
TriniMark: A Robust Generative Speech Watermarking Method for Trinity-Level Attribution
TriniMark: A Robust Generative Speech Watermarking Method for Trinity-Level Attribution
Yue Li
Wen Liu
Dongdong Lin
284
1
0
29 Apr 2025
Integration Flow Models
Integration Flow Models
Jingjing Wang
Dan Zhang
Joshua Luo
Yin Yang
Feng Luo
967
1
0
28 Apr 2025
SOLIDO: A Robust Watermarking Method for Speech Synthesis via Low-Rank Adaptation
SOLIDO: A Robust Watermarking Method for Speech Synthesis via Low-Rank Adaptation
Yue Li
Weizhi Liu
Dongdong Lin
374
0
0
21 Apr 2025
Emergence and Evolution of Interpretable Concepts in Diffusion Models
Emergence and Evolution of Interpretable Concepts in Diffusion Models
Berk Tinaz
Zalan Fabian
Mahdi Soltanolkotabi
DiffM
271
7
0
21 Apr 2025
Novel Concept-Oriented Synthetic Data approach for Training Generative AI-Driven Crystal Grain Analysis Using Diffusion Model
Novel Concept-Oriented Synthetic Data approach for Training Generative AI-Driven Crystal Grain Analysis Using Diffusion ModelComputational materials science (Comput. Mater. Sci.), 2025
Ahmed Sobhi Saleh
Kristof Croes
Hajdin Ceric
Ingrid De Wolf
Houman Zahedmanesh
DiffM
194
1
0
21 Apr 2025
Diffusion-Driven Inertial Generated Data for Smartphone Location Classification
Diffusion-Driven Inertial Generated Data for Smartphone Location Classification
Noa Cohen
Rotem Dror
Itzik Klein
DiffM
169
0
0
20 Apr 2025
Image Editing with Diffusion Models: A Survey
Image Editing with Diffusion Models: A Survey
Jia Wang
Jie Hu
Xiaoqi Ma
Hanghang Ma
Xiaoming Wei
Enhua Wu
322
5
0
17 Apr 2025
Beyond Words: Augmenting Discriminative Richness via Diffusions in Unsupervised Prompt Learning
Beyond Words: Augmenting Discriminative Richness via Diffusions in Unsupervised Prompt LearningComputer Vision and Pattern Recognition (CVPR), 2025
Hairui Ren
Fan Tang
He Zhao
Zixuan Wang
Dandan Guo
Yi Chang
VLM
227
0
0
16 Apr 2025
Generalized Visual Relation Detection with Diffusion Models
Generalized Visual Relation Detection with Diffusion Models
Kaifeng Gao
Siqi Chen
Hanwang Zhang
Jun Xiao
Yueting Zhuang
Qianru Sun
285
0
0
16 Apr 2025
Deep Audio Watermarks are Shallow: Limitations of Post-Hoc Watermarking Techniques for Speech
Deep Audio Watermarks are Shallow: Limitations of Post-Hoc Watermarking Techniques for Speech
P. O'Reilly
Zeyu Jin
Jiaqi Su
Bryan Pardo
275
6
0
15 Apr 2025
SD-ReID: View-aware Stable Diffusion for Aerial-Ground Person Re-Identification
SD-ReID: View-aware Stable Diffusion for Aerial-Ground Person Re-Identification
Xiang Hu
Silong Yong
Yuhao Wang
Bin Yan
Huchuan Lu
328
5
0
13 Apr 2025
Scalable Motion In-betweening via Diffusion and Physics-Based Character Adaptation
Scalable Motion In-betweening via Diffusion and Physics-Based Character Adaptation
Jia Qin
DiffMVGen
236
0
0
13 Apr 2025
D$^2$iT: Dynamic Diffusion Transformer for Accurate Image Generation
D2^22iT: Dynamic Diffusion Transformer for Accurate Image GenerationComputer Vision and Pattern Recognition (CVPR), 2025
Weinan Jia
Mengqi Huang
Nan Chen
Lei Zhang
Zhendong Mao
306
6
0
13 Apr 2025
On the Design of Diffusion-based Neural Speech Codecs
On the Design of Diffusion-based Neural Speech Codecs
Pietro Foti
Andreas Brendel
DiffM
190
0
0
11 Apr 2025
SlimSpeech: Lightweight and Efficient Text-to-Speech with Slim Rectified Flow
SlimSpeech: Lightweight and Efficient Text-to-Speech with Slim Rectified FlowIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Kaidi Wang
Wenhao Guan
Shenghui Lu
Jianglong Yao
Lin Li
Q. Hong
453
4
0
10 Apr 2025
Probabilistic QoS Metric Forecasting in Delay-Tolerant Networks Using Conditional Diffusion Models on Latent Dynamics
Probabilistic QoS Metric Forecasting in Delay-Tolerant Networks Using Conditional Diffusion Models on Latent Dynamics
Enming Zhang
Zheng Liu
Yu Xiang
Yanwen Qu
177
2
0
09 Apr 2025
A Hybrid Wavelet-Fourier Method for Next-Generation Conditional Diffusion Models
A Hybrid Wavelet-Fourier Method for Next-Generation Conditional Diffusion Models
Andrew Kiruluta
Andreas Lemos
DiffM
244
5
0
04 Apr 2025
A Survey on Music Generation from Single-Modal, Cross-Modal, and Multi-Modal Perspectives
A Survey on Music Generation from Single-Modal, Cross-Modal, and Multi-Modal Perspectives
Shuyu Li
Shulei Ji
Zihao Wang
Songruoyao Wu
Jiaxing Yu
Jianchao Tan
MGenVGen
568
3
0
01 Apr 2025
ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion
ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion
Rana Muhammad Shahroz Khan
Dongwen Tang
Pingzhi Li
Xiaojiang Peng
Tianlong Chen
AI4CE
1.1K
1
0
31 Mar 2025
Dual Audio-Centric Modality Coupling for Talking Head Generation
Dual Audio-Centric Modality Coupling for Talking Head Generation
Ao Fu
Ziqi Ni
Yi Zhou
303
2
0
26 Mar 2025
PCM : Picard Consistency Model for Fast Parallel Sampling of Diffusion Models
PCM : Picard Consistency Model for Fast Parallel Sampling of Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2025
Junhyuk So
Jiwoong Shin
Chaeyeon Jang
Eunhyeok Park
DiffM
339
0
0
25 Mar 2025
Improving Discriminator Guidance in Diffusion Models
Improving Discriminator Guidance in Diffusion Models
Alexandre Verine
Mehdi Inane
Florian Le Bronnec
Benjamin Négrevergne
Y. Chevaleyre
DiffM
307
0
0
20 Mar 2025
WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching
WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow MatchingNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025
Tianze Luo
Xingchen Miao
Wenbo Duan
DiffM
235
6
0
20 Mar 2025
Bezier Distillation
Bezier Distillation
Ling Feng
SK Yang
152
0
0
20 Mar 2025
DiffGAP: A Lightweight Diffusion Module in Contrastive Space for Bridging Cross-Model Gap
DiffGAP: A Lightweight Diffusion Module in Contrastive Space for Bridging Cross-Model GapIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Shentong Mo
Zehua Chen
Fan Bao
Jun-Jie Zhu
DiffM
234
3
0
15 Mar 2025
Cross-Modal Diffusion for Biomechanical Dynamical Systems Through Local Manifold Alignment
Cross-Modal Diffusion for Biomechanical Dynamical Systems Through Local Manifold Alignment
S. Dey
Sarath Ravindran Nair
DiffM
296
0
0
15 Mar 2025
StFT: Spatio-temporal Fourier Transformer for Long-term Dynamics Prediction
StFT: Spatio-temporal Fourier Transformer for Long-term Dynamics Prediction
Da Long
Shandian Zhe
Samuel Williams
L. Oliker
Zhe Bai
AI4TSAI4CE
344
0
0
14 Mar 2025
MAVFlow: Preserving Paralinguistic Elements with Conditional Flow Matching for Zero-Shot AV2AV Multilingual Translation
MAVFlow: Preserving Paralinguistic Elements with Conditional Flow Matching for Zero-Shot AV2AV Multilingual Translation
Sungwoo Cho
J. Choi
Sungnyun Kim
Se-Young Yun
322
0
0
14 Mar 2025
R^RRFLAV: Rolling Flow matching for infinite Audio Video generation
Alex Ergasti
Giuseppe Tarollo
Filippo Botti
Tomaso Fontanini
Claudio Ferrari
Massimo Bertozzi
Andrea Prati
VGen
210
2
0
13 Mar 2025
Probabilistic Forecasting via Autoregressive Flow Matching
Ahmed El-Gazzar
Marcel van Gerven
AI4TS
286
1
0
13 Mar 2025
Data augmentation using diffusion models to enhance inverse Ising inferencePhysical Review E (Phys. Rev. E), 2025
Yechan Lim
Sangwon Lee
Junghyo Jo
DiffM
180
0
0
13 Mar 2025
Studying Classifier(-Free) Guidance From a Classifier-Centric Perspective
Studying Classifier(-Free) Guidance From a Classifier-Centric Perspective
Xiaoming Zhao
Alexander Schwing
FaML
368
1
0
13 Mar 2025
Probability-Flow ODE in Infinite-Dimensional Function Spaces
Kunwoo Na
Junghyun Lee
Se-Young Yun
Sungbin Lim
285
1
0
13 Mar 2025
Accelerating Diffusion Sampling via Exploiting Local Transition Coherence
Accelerating Diffusion Sampling via Exploiting Local Transition Coherence
Shangwen Zhu
Han Zhang
Zhantao Yang
Qianyu Peng
Zhao Pu
Jian Shu
Fan Cheng
DiffM
322
0
0
12 Mar 2025
Understanding the Quality-Diversity Trade-off in Diffusion Language Models
Zak Buzzard
DiffM
192
1
0
11 Mar 2025
Boosting Diffusion-Based Text Image Super-Resolution Model Towards Generalized Real-World Scenarios
Boosting Diffusion-Based Text Image Super-Resolution Model Towards Generalized Real-World Scenarios
Chenglu Pan
Xiaohan Li
Ganggui Ding
Yunke Zhang
Wenbo Li
Jiarong Xu
Qingbiao Wu
455
2
0
10 Mar 2025
Backdoor Attacks on Discrete Graph Diffusion Models
Jiawen Wang
Samin Karim
Yuan Hong
Binghui Wang
DiffM
473
1
0
08 Mar 2025
Accelerating db-A* for Kinodynamic Motion Planning Using Diffusion
Julius Franke
A. Moldagalieva
Pia Hanfeld
Wolfgang Hönig
DiffM
303
0
0
07 Mar 2025
Discrete Contrastive Learning for Diffusion Policies in Autonomous DrivingIEEE International Conference on Robotics and Automation (ICRA), 2025
Kalle Kujanpää
Daulet Baimukashev
Farzeen Munir
Shoaib Azam
Tomasz Piotr Kucner
Joni Pajarinen
Ville Kyrki
198
0
0
07 Mar 2025
Towards Understanding Text Hallucination of Diffusion Models via Local Generation BiasInternational Conference on Learning Representations (ICLR), 2025
Rui Lu
Runzhe Wang
Kaifeng Lyu
Xitai Jiang
Gao Huang
Mengdi Wang
DiffM
286
5
0
05 Mar 2025
Self-attention-based Diffusion Model for Time-series Imputation in Partial Blackout ScenariosAAAI Conference on Artificial Intelligence (AAAI), 2025
Mohammad Rafid Ul Islam
Prasad Tadepalli
Alan Fern
177
8
0
03 Mar 2025
Enhancing Retinal Vessel Segmentation Generalization via Layout-Aware Generative Modelling
Enhancing Retinal Vessel Segmentation Generalization via Layout-Aware Generative Modelling
Jonathan Fhima
Jan Van Eijgen
Lennert Beeckmans
Thomas Jacobs
Moti Freiman
Luis Filipe Nakayama
Ingeborg Stalmans
Chaim Baskin
Joachim A. Behar
MedIm
456
1
0
03 Mar 2025
Dynamical Diffusion: Learning Temporal Dynamics with Diffusion ModelsInternational Conference on Learning Representations (ICLR), 2025
Xingzhuo Guo
Yu Zhang
Baixu Chen
Haoran Xu
Chao Guo
Mingsheng Long
DiffMAI4TS
357
6
0
02 Mar 2025
Optimal Stochastic Trace Estimation in Generative Modeling
Optimal Stochastic Trace Estimation in Generative ModelingInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2025
Xinyang Liu
Hengrong Du
Wei Deng
Ruqi Zhang
AI4TS
262
0
0
26 Feb 2025
DualSpec: Text-to-spatial-audio Generation via Dual-Spectrogram Guided Diffusion Model
DualSpec: Text-to-spatial-audio Generation via Dual-Spectrogram Guided Diffusion Model
Lei Zhao
Sizhou Chen
Linfeng Feng
Ju Liu
Xuelong Li
Fangqiu Yi
Xuelong Li
DiffMMDE
417
4
0
26 Feb 2025
Diffusion-based Planning with Learned Viability Filters
Diffusion-based Planning with Learned Viability FiltersProceedings of the ACM on Computer Graphics and Interactive Techniques (PACMCGIT), 2025
Nicholas Ioannidis
Daniele Reda
S. Cohan
M. van de Panne
258
0
0
26 Feb 2025
Previous
12345...212223
Next