ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.09761
  4. Cited By
DiffWave: A Versatile Diffusion Model for Audio Synthesis

DiffWave: A Versatile Diffusion Model for Audio Synthesis

21 September 2020
Zhifeng Kong
Wei Ping
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
    DiffM
    BDL
ArXivPDFHTML

Papers citing "DiffWave: A Versatile Diffusion Model for Audio Synthesis"

50 / 977 papers shown
Title
SingNet: Towards a Large-Scale, Diverse, and In-the-Wild Singing Voice Dataset
SingNet: Towards a Large-Scale, Diverse, and In-the-Wild Singing Voice Dataset
Yicheng Gu
Chaoren Wang
J. Zhang
Xueyao Zhang
Zihao Fang
Haorui He
Zhizheng Wu
18
2
0
14 May 2025
ItDPDM: Information-Theoretic Discrete Poisson Diffusion Model
ItDPDM: Information-Theoretic Discrete Poisson Diffusion Model
Sagnik Bhattacharya
Abhiram Gorle
Ahmed Mohsin
Ahsan Bilal
Connor Ding
Amit Kumar Singh Yadav
Tsachy Weissman
DiffM
45
0
0
08 May 2025
Research on Anomaly Detection Methods Based on Diffusion Models
Research on Anomaly Detection Methods Based on Diffusion Models
Yi Chen
DiffM
56
0
0
08 May 2025
Wasserstein Convergence of Score-based Generative Models under Semiconvexity and Discontinuous Gradients
Wasserstein Convergence of Score-based Generative Models under Semiconvexity and Discontinuous Gradients
Stefano Bruno
Sotirios Sabanis
DiffM
48
0
0
06 May 2025
T2S: High-resolution Time Series Generation with Text-to-Series Diffusion Models
T2S: High-resolution Time Series Generation with Text-to-Series Diffusion Models
Yunfeng Ge
Jiawei Li
Yiji Zhao
Haomin Wen
Zhao Li
M. Qiu
H. Li
Ming Jin
Shirui Pan
DiffM
129
0
0
05 May 2025
A Time-Series Data Augmentation Model through Diffusion and Transformer Integration
A Time-Series Data Augmentation Model through Diffusion and Transformer Integration
Yuren Zhang
Zhongnan Pu
Lei Jing
27
0
0
01 May 2025
TriniMark: A Robust Generative Speech Watermarking Method for Trinity-Level Attribution
TriniMark: A Robust Generative Speech Watermarking Method for Trinity-Level Attribution
Yue Li
W. Liu
Dongdong Lin
42
0
0
29 Apr 2025
Integration Flow Models
Integration Flow Models
Jingjing Wang
Dan Zhang
Joshua Luo
Yin Yang
Feng Luo
122
0
0
28 Apr 2025
Emergence and Evolution of Interpretable Concepts in Diffusion Models
Emergence and Evolution of Interpretable Concepts in Diffusion Models
Berk Tinaz
Zalan Fabian
Mahdi Soltanolkotabi
DiffM
19
0
0
21 Apr 2025
SOLIDO: A Robust Watermarking Method for Speech Synthesis via Low-Rank Adaptation
SOLIDO: A Robust Watermarking Method for Speech Synthesis via Low-Rank Adaptation
Yue Li
Weizhi Liu
Dongdong Lin
27
0
0
21 Apr 2025
Novel Concept-Oriented Synthetic Data approach for Training Generative AI-Driven Crystal Grain Analysis Using Diffusion Model
Novel Concept-Oriented Synthetic Data approach for Training Generative AI-Driven Crystal Grain Analysis Using Diffusion Model
Ahmed Sobhi Saleh
Kristof Croes
Hajdin Ceric
Ingrid De Wolf
Houman Zahedmanesh
DiffM
24
0
0
21 Apr 2025
Diffusion-Driven Inertial Generated Data for Smartphone Location Classification
Diffusion-Driven Inertial Generated Data for Smartphone Location Classification
Noa Cohen
Rotem Dror
Itzik Klein
DiffM
40
0
0
20 Apr 2025
Image Editing with Diffusion Models: A Survey
Image Editing with Diffusion Models: A Survey
Jia Wang
Jie Hu
Xiaoqi Ma
Hanghang Ma
Xiaoming Wei
Enhua Wu
66
0
0
17 Apr 2025
Generalized Visual Relation Detection with Diffusion Models
Generalized Visual Relation Detection with Diffusion Models
Kaifeng Gao
Siqi Chen
Hanwang Zhang
Jun Xiao
Yueting Zhuang
Qianru Sun
34
0
0
16 Apr 2025
Beyond Words: Augmenting Discriminative Richness via Diffusions in Unsupervised Prompt Learning
Beyond Words: Augmenting Discriminative Richness via Diffusions in Unsupervised Prompt Learning
Hairui Ren
Fan Tang
He Zhao
Zixuan Wang
Dandan Guo
Yi Chang
VLM
36
0
0
16 Apr 2025
Deep Audio Watermarks are Shallow: Limitations of Post-Hoc Watermarking Techniques for Speech
Deep Audio Watermarks are Shallow: Limitations of Post-Hoc Watermarking Techniques for Speech
P. O'Reilly
Zeyu Jin
Jiaqi Su
Bryan Pardo
24
0
0
15 Apr 2025
SD-ReID: View-aware Stable Diffusion for Aerial-Ground Person Re-Identification
SD-ReID: View-aware Stable Diffusion for Aerial-Ground Person Re-Identification
Xiang Hu
Pingping Zhang
Yuhao Wang
Bin Yan
Huchuan Lu
23
0
0
13 Apr 2025
D$^2$iT: Dynamic Diffusion Transformer for Accurate Image Generation
D2^22iT: Dynamic Diffusion Transformer for Accurate Image Generation
Weinan Jia
Mengqi Huang
Nan Chen
Lei Zhang
Zhendong Mao
21
0
0
13 Apr 2025
Scalable Motion In-betweening via Diffusion and Physics-Based Character Adaptation
Scalable Motion In-betweening via Diffusion and Physics-Based Character Adaptation
Jia Qin
DiffM
VGen
36
0
0
13 Apr 2025
On the Design of Diffusion-based Neural Speech Codecs
On the Design of Diffusion-based Neural Speech Codecs
Pietro Foti
Andreas Brendel
DiffM
34
0
0
11 Apr 2025
SlimSpeech: Lightweight and Efficient Text-to-Speech with Slim Rectified Flow
SlimSpeech: Lightweight and Efficient Text-to-Speech with Slim Rectified Flow
K. Wang
Wenhao Guan
Shenghui Lu
Jianglong Yao
Lin Li
Q. Hong
27
0
0
10 Apr 2025
Probabilistic QoS Metric Forecasting in Delay-Tolerant Networks Using Conditional Diffusion Models on Latent Dynamics
Probabilistic QoS Metric Forecasting in Delay-Tolerant Networks Using Conditional Diffusion Models on Latent Dynamics
Enming Zhang
Zheng Liu
Yu Xiang
Yanwen Qu
26
0
0
09 Apr 2025
A Hybrid Wavelet-Fourier Method for Next-Generation Conditional Diffusion Models
A Hybrid Wavelet-Fourier Method for Next-Generation Conditional Diffusion Models
Andrew Kiruluta
Andreas Lemos
DiffM
30
3
0
04 Apr 2025
A Survey on Music Generation from Single-Modal, Cross-Modal, and Multi-Modal Perspectives
A Survey on Music Generation from Single-Modal, Cross-Modal, and Multi-Modal Perspectives
Shuyu Li
Shulei Ji
Zihao W. Wang
Songruoyao Wu
Jiaxing Yu
K. Zhang
MGen
VGen
70
1
0
01 Apr 2025
ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion
ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion
Rana Muhammad Shahroz Khan
Dongwen Tang
Pingzhi Li
Kai Wang
Tianlong Chen
AI4CE
130
0
0
31 Mar 2025
Dual Audio-Centric Modality Coupling for Talking Head Generation
Dual Audio-Centric Modality Coupling for Talking Head Generation
Ao Fu
Ziqi Ni
Yi Zhou
37
1
0
26 Mar 2025
PCM : Picard Consistency Model for Fast Parallel Sampling of Diffusion Models
PCM : Picard Consistency Model for Fast Parallel Sampling of Diffusion Models
Junhyuk So
Jiwoong Shin
Chaeyeon Jang
Eunhyeok Park
DiffM
48
0
0
25 Mar 2025
WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching
WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching
Tianze Luo
Xingchen Miao
Wenbo Duan
DiffM
37
0
0
20 Mar 2025
Bezier Distillation
Bezier Distillation
Ling Feng
SK Yang
39
0
0
20 Mar 2025
Improving Discriminator Guidance in Diffusion Models
Improving Discriminator Guidance in Diffusion Models
Alexandre Verine
Mehdi Inane
Florian Le Bronnec
Benjamin Négrevergne
Y. Chevaleyre
DiffM
48
0
0
20 Mar 2025
DiffGAP: A Lightweight Diffusion Module in Contrastive Space for Bridging Cross-Model Gap
DiffGAP: A Lightweight Diffusion Module in Contrastive Space for Bridging Cross-Model Gap
Shentong Mo
Zehua Chen
Fan Bao
Jun-Jie Zhu
DiffM
50
0
0
15 Mar 2025
Cross-Modal Diffusion for Biomechanical Dynamical Systems Through Local Manifold Alignment
Cross-Modal Diffusion for Biomechanical Dynamical Systems Through Local Manifold Alignment
S. Dey
Sarath Ravindran Nair
DiffM
75
0
0
15 Mar 2025
Spatio-temporal Fourier Transformer (StFT) for Long-term Dynamics Prediction
Spatio-temporal Fourier Transformer (StFT) for Long-term Dynamics Prediction
Da Long
Shandian Zhe
Samuel Williams
L. Oliker
Zhe Bai
AI4TS
AI4CE
44
0
0
14 Mar 2025
MAVFlow: Preserving Paralinguistic Elements with Conditional Flow Matching for Zero-Shot AV2AV Multilingual Translation
Sungwoo Cho
J. Choi
Sungnyun Kim
Se-Young Yun
63
0
0
14 Mar 2025
R^RRFLAV: Rolling Flow matching for infinite Audio Video generation
Alex Ergasti
Giuseppe Tarollo
Filippo Botti
Tomaso Fontanini
Claudio Ferrari
Massimo Bertozzi
Andrea Prati
VGen
45
0
0
13 Mar 2025
Probability-Flow ODE in Infinite-Dimensional Function Spaces
Kunwoo Na
Junghyun Lee
Se-Young Yun
Sungbin Lim
45
0
0
13 Mar 2025
Probabilistic Forecasting via Autoregressive Flow Matching
Ahmed El-Gazzar
Marcel van Gerven
AI4TS
43
0
0
13 Mar 2025
Data augmentation using diffusion models to enhance inverse Ising inference
Yechan Lim
Sangwon Lee
Junghyo Jo
DiffM
43
0
0
13 Mar 2025
Studying Classifier(-Free) Guidance From a Classifier-Centric Perspective
Xiaoming Zhao
Alexander Schwing
FaML
63
0
0
13 Mar 2025
Accelerating Diffusion Sampling via Exploiting Local Transition Coherence
Shangwen Zhu
Han Zhang
Zhantao Yang
Qianyu Peng
Zhao Pu
H. Wang
Fan Cheng
DiffM
48
0
0
12 Mar 2025
Understanding the Quality-Diversity Trade-off in Diffusion Language Models
Zak Buzzard
DiffM
48
0
0
11 Mar 2025
Boosting Diffusion-Based Text Image Super-Resolution Model Towards Generalized Real-World Scenarios
Chenglu Pan
Xiaogang Xu
Ganggui Ding
Y. Zhang
Wenbo Li
Jiarong Xu
Qingbiao Wu
55
0
0
10 Mar 2025
Backdoor Attacks on Discrete Graph Diffusion Models
Jiawen Wang
Samin Karim
Yuan Hong
Binghui Wang
DiffM
63
0
0
08 Mar 2025
Discrete Contrastive Learning for Diffusion Policies in Autonomous Driving
Kalle Kujanpää
Daulet Baimukashev
Farzeen Munir
Shoaib Azam
Tomasz Piotr Kucner
J. Pajarinen
Ville Kyrki
41
0
0
07 Mar 2025
Accelerating db-A* for Kinodynamic Motion Planning Using Diffusion
Julius Franke
A. Moldagalieva
Pia Hanfeld
Wolfgang Hönig
DiffM
75
0
0
07 Mar 2025
Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias
Rui Lu
Runzhe Wang
Kaifeng Lyu
Xitai Jiang
Gao Huang
Mengdi Wang
DiffM
86
1
0
05 Mar 2025
Enhancing Retinal Vessel Segmentation Generalization via Layout-Aware Generative Modelling
Enhancing Retinal Vessel Segmentation Generalization via Layout-Aware Generative Modelling
Jonathan Fhima
Jan Van Eijgen
Lennert Beeckmans
Thomas Jacobs
Moti Freiman
Luis Filipe Nakayama
Ingeborg Stalmans
Chaim Baskin
Joachim A. Behar
MedIm
62
0
0
03 Mar 2025
Self-attention-based Diffusion Model for Time-series Imputation in Partial Blackout Scenarios
Mohammad Rafid Ul Islam
Prasad Tadepalli
Alan Fern
33
0
0
03 Mar 2025
Dynamical Diffusion: Learning Temporal Dynamics with Diffusion Models
Xingzhuo Guo
Yu Zhang
Baixu Chen
Haoran Xu
J. Z. Wang
Mingsheng Long
DiffM
AI4TS
35
1
0
02 Mar 2025
Diffusion-based Planning with Learned Viability Filters
Diffusion-based Planning with Learned Viability Filters
Nicholas Ioannidis
Daniele Reda
S. Cohan
M. van de Panne
69
0
0
26 Feb 2025
1234...181920
Next