ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.09761
  4. Cited By
DiffWave: A Versatile Diffusion Model for Audio Synthesis
v1v2v3 (latest)

DiffWave: A Versatile Diffusion Model for Audio Synthesis

International Conference on Learning Representations (ICLR), 2020
21 September 2020
Zhifeng Kong
Ming-Yu Liu
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
    DiffMBDL
ArXiv (abs)PDFHTML

Papers citing "DiffWave: A Versatile Diffusion Model for Audio Synthesis"

50 / 1,133 papers shown
Title
ITS3D: Inference-Time Scaling for Text-Guided 3D Diffusion Models
ITS3D: Inference-Time Scaling for Text-Guided 3D Diffusion Models
Zhenglin Zhou
Fan Ma
Xiaobo Xia
Hehe Fan
Yi Yang
Tat-Seng Chua
DiffM3DGS
53
0
0
27 Nov 2025
GLA-Grad++: An Improved Griffin-Lim Guided Diffusion Model for Speech Synthesis
GLA-Grad++: An Improved Griffin-Lim Guided Diffusion Model for Speech Synthesis
Teysir Baoueb
Xiaoyu Bie
Mathieu Fontaine
Gaël Richard
DiffM
53
0
0
27 Nov 2025
Advancing Marine Bioacoustics with Deep Generative Models: A Hybrid Augmentation Strategy for Southern Resident Killer Whale Detection
Advancing Marine Bioacoustics with Deep Generative Models: A Hybrid Augmentation Strategy for Southern Resident Killer Whale Detection
Bruno Padovese
Fabio Frazao
Michael Dowd
Ruth Joy
8
0
0
26 Nov 2025
Generating Separated Singing Vocals Using a Diffusion Model Conditioned on Music Mixtures
Generating Separated Singing Vocals Using a Diffusion Model Conditioned on Music MixturesIEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2025
Genís Plaja-Roglans
Yun-Ning Hung
Xavier Serra
Igor Pereira
DiffM
213
0
0
26 Nov 2025
Diffusion for Fusion: Designing Stellarators with Generative AI
Diffusion for Fusion: Designing Stellarators with Generative AI
Misha Padidar
Teresa Huang
Andrew Giuliani
Marina Spivak
DiffM
163
0
0
25 Nov 2025
Efficient and Fast Generative-Based Singing Voice Separation using a Latent Diffusion Model
Efficient and Fast Generative-Based Singing Voice Separation using a Latent Diffusion ModelIEEE International Joint Conference on Neural Network (IJCNN), 2025
Genís Plaja-Roglans
Yun-Ning Hung
Xavier Serra
Igor Pereira
DiffM
165
1
0
25 Nov 2025
Temporal-Visual Semantic Alignment: A Unified Architecture for Transferring Spatial Priors from Vision Models to Zero-Shot Temporal Tasks
Temporal-Visual Semantic Alignment: A Unified Architecture for Transferring Spatial Priors from Vision Models to Zero-Shot Temporal Tasks
Xiangkai Ma
Han Zhang
Wenzhong Li
Sanglu Lu
AI4TSVGen
223
0
0
25 Nov 2025
Demystifying Diffusion Objectives: Reweighted Losses are Better Variational Bounds
Demystifying Diffusion Objectives: Reweighted Losses are Better Variational Bounds
Jiaxin Shi
Michalis K. Titsias
DiffM
186
0
0
24 Nov 2025
Pathlet Variational Auto-Encoder for Robust Trajectory Generation
Yuanbo Tang
Yan Tang
Z. Zhang
Zihui Zhao
Yang Li
99
0
0
20 Nov 2025
Towards Stable and Structured Time Series Generation with Perturbation-Aware Flow Matching
Towards Stable and Structured Time Series Generation with Perturbation-Aware Flow Matching
Jintao Zhang
Mingyue Cheng
Zirui Liu
Xianquan Wang
Yitong Zhou
Qi Liu
AI4TS
85
0
0
18 Nov 2025
Multi-modal Deepfake Detection and Localization with FPN-Transformer
Multi-modal Deepfake Detection and Localization with FPN-Transformer
Chende Zheng
Ruiqi suo
Zhoulin Ji
Jingyi Deng
Fangbin Yi
Chenhao Lin
Chao Shen
ViT
48
0
0
11 Nov 2025
TimeFlow: Towards Stochastic-Aware and Efficient Time Series Generation via Flow Matching Modeling
TimeFlow: Towards Stochastic-Aware and Efficient Time Series Generation via Flow Matching Modeling
He Panjing
Cheng Mingyue
Li Li
Zhang XiaoHan
AI4TS
127
0
0
11 Nov 2025
BridgeVoC: Revitalizing Neural Vocoder from a Restoration Perspective
BridgeVoC: Revitalizing Neural Vocoder from a Restoration Perspective
Andong Li
Tong Lei
Rilin Chen
Kai Li
Meng Yu
Xiaodong Li
Dong Yu
C. Zheng
DiffM
132
0
0
10 Nov 2025
Diffolio: A Diffusion Model for Multivariate Probabilistic Financial Time-Series Forecasting and Portfolio Construction
Diffolio: A Diffusion Model for Multivariate Probabilistic Financial Time-Series Forecasting and Portfolio Construction
So-Yoon Cho
Jin-Young Kim
Kayoung Ban
Hyeng Keun Koo
Hyun-Gyoon Kim
DiffM
68
0
0
10 Nov 2025
Learning to Land Anywhere: Transferable Generative Models for Aircraft Trajectories
Learning to Land Anywhere: Transferable Generative Models for Aircraft Trajectories
Olav Finne Praesteng Larsen
Massimiliano Ruocco
Michail Spitieris
Abdulmajid Murad
Martina Ragosta
128
0
0
06 Nov 2025
HAGI++: Head-Assisted Gaze Imputation and Generation
HAGI++: Head-Assisted Gaze Imputation and Generation
Chuhan Jiao
Zhiming Hu
Andreas Bulling
152
0
0
04 Nov 2025
Effective Series Decomposition and Components Learning for Time Series Generation
Effective Series Decomposition and Components Learning for Time Series Generation
Zixuan Ma
Chenfeng Huang
DiffMAI4TS
179
0
0
02 Nov 2025
Diffusion Models at the Drug Discovery Frontier: A Review on Generating Small Molecules versus Therapeutic Peptides
Diffusion Models at the Drug Discovery Frontier: A Review on Generating Small Molecules versus Therapeutic Peptides
Yiquan Wang
Yahui Ma
Yuhan Chang
Jiayao Yan
Jialin Zhang
Minnuo Cai
Kai Wei
MedIm
347
0
0
31 Oct 2025
NaturalVoices: A Large-Scale, Spontaneous and Emotional Podcast Dataset for Voice Conversion
NaturalVoices: A Large-Scale, Spontaneous and Emotional Podcast Dataset for Voice Conversion
Zongyang Du
Shreeram Suresh Chandra
Ismail Rasim Ulgen
Aurosweta Mahapatra
Ali N. Salman
Carlos Busso
Berrak Sisman
122
0
0
31 Oct 2025
Information-Theoretic Discrete Diffusion
Information-Theoretic Discrete Diffusion
Moongyu Jeon
Sangwoo Shin
Dongjae Jeon
Albert No
DiffMFedML
151
0
0
28 Oct 2025
Closing Gaps: An Imputation Analysis of ICU Vital Signs
Closing Gaps: An Imputation Analysis of ICU Vital Signs
Alisher Turubayev
Anna Shopova
Fabian Lange
Mahmut Kamalak
Paul Mattes
Victoria Ayvasky
B. Arnrich
Bjarne Pfitzner
Robin Van De Water
120
1
0
28 Oct 2025
Simple Denoising Diffusion Language Models
Simple Denoising Diffusion Language Models
Huaisheng Zhu
Zhengyu Chen
Shijie Zhou
Zhihui Xie
Yige Yuan
Zhimeng Guo
Siyuan Xu
Hangfan Zhang
V. Honavar
Teng Xiao
DiffM
138
0
0
27 Oct 2025
BadGraph: A Backdoor Attack Against Latent Diffusion Model for Text-Guided Graph Generation
BadGraph: A Backdoor Attack Against Latent Diffusion Model for Text-Guided Graph Generation
Liang Ye
Shengqin Chen
Jiazhu Dai
DiffM
113
0
0
23 Oct 2025
Towards General Modality Translation with Contrastive and Predictive Latent Diffusion Bridge
Towards General Modality Translation with Contrastive and Predictive Latent Diffusion Bridge
Nimrod Berman
O. Joglekar
Eitan Kosman
Dotan Di Castro
Omri Azencot
DiffM
174
2
0
23 Oct 2025
AutoScape: Geometry-Consistent Long-Horizon Scene Generation
AutoScape: Geometry-Consistent Long-Horizon Scene Generation
Jiacheng Chen
Ziyu Jiang
Mingfu Liang
Bingbing Zhuang
Jong-Chyi Su
Sparsh Garg
Ying Wu
Manmohan Chandraker
VGen
112
0
0
23 Oct 2025
Gradient Variance Reveals Failure Modes in Flow-Based Generative Models
Gradient Variance Reveals Failure Modes in Flow-Based Generative Models
Teodora Reu
Sixtine Dromigny
Michael M Bronstein
Francisco Vargas
192
1
0
20 Oct 2025
MUG-V 10B: High-efficiency Training Pipeline for Large Video Generation Models
MUG-V 10B: High-efficiency Training Pipeline for Large Video Generation Models
Yongshun Zhang
Zhongyi Fan
Yonghang Zhang
Zhangzikang Li
Weifeng Chen
Zhongwei Feng
Chaoyue Wang
Peng Hou
Anxiang Zeng
VGen
251
0
0
20 Oct 2025
Adaptive Discretization for Consistency Models
Adaptive Discretization for Consistency Models
Jiayu Bai
Zhanbo Feng
Zhijie Deng
Tianqi Hou
Robert C. Qiu
Zenan Ling
120
0
0
20 Oct 2025
Sequence Modeling with Spectral Mean Flows
Sequence Modeling with Spectral Mean Flows
Jinwoo Kim
Max Beier
Nicolas Hoischen
Nayun Kim
Seunghoon Hong
BDL
138
0
0
17 Oct 2025
Counting Hallucinations in Diffusion Models
Counting Hallucinations in Diffusion Models
Shuai Fu
Jian Zhou
Qi Chen
Huang Jing
Huy Anh Nguyen
Xiaohan Liu
Zhixiong Zeng
Lin Ma
Quanshi Zhang
Qi Wu
DiffMHILM
247
0
0
15 Oct 2025
Diffusion Models for Reinforcement Learning: Foundations, Taxonomy, and Development
Diffusion Models for Reinforcement Learning: Foundations, Taxonomy, and Development
Changfu Xu
Jianxiong Guo
Yuzhu Liang
Haiyang Huang
Haodong Zou
Xi Zheng
Shui Yu
Xiaowen Chu
Jiannong Cao
Tian-sheng Wang
OffRLAI4CE
171
0
0
14 Oct 2025
Audio Palette: A Diffusion Transformer with Multi-Signal Conditioning for Controllable Foley Synthesis
Audio Palette: A Diffusion Transformer with Multi-Signal Conditioning for Controllable Foley Synthesis
Junnuo Wang
DiffM
91
0
0
14 Oct 2025
Unlocking the Potential of Diffusion Language Models through Template Infilling
Unlocking the Potential of Diffusion Language Models through Template Infilling
Junhoo Lee
Seungyeon Kim
Nojun Kwak
DiffMAI4CE
52
0
0
13 Oct 2025
WaveletDiff: Multilevel Wavelet Diffusion For Time Series Generation
WaveletDiff: Multilevel Wavelet Diffusion For Time Series Generation
Yu-Hsiang Wang
O. Milenkovic
DiffMAI4TS
280
0
0
13 Oct 2025
DiffStyleTS: Diffusion Model for Style Transfer in Time Series
DiffStyleTS: Diffusion Model for Style Transfer in Time Series
Mayank Nagda
Phil Ostheimer
Justus Arweiler
Indra Jungjohann
Jennifer Werner
...
Michael Bortz
Hans Hasse
Stephan Mandt
Marius Kloft
Sophie Fellenz
DiffMAI4TS
76
0
0
13 Oct 2025
O_O-VC: Synthetic Data-Driven One-to-One Alignment for Any-to-Any Voice Conversion
O_O-VC: Synthetic Data-Driven One-to-One Alignment for Any-to-Any Voice Conversion
Huu Tuong Tu
Huan Vu
cuong tien nguyen
Dien Hy Ngo
Nguyen Thi Thu Trang
68
0
0
10 Oct 2025
Traj-Transformer: Diffusion Models with Transformer for GPS Trajectory Generation
Traj-Transformer: Diffusion Models with Transformer for GPS Trajectory Generation
Zhiyang Zhang
Ningcong Chen
Xin Zhang
Yanhua Li
Shen Su
Hui Lu
Jun Luo
91
0
0
07 Oct 2025
Demystifying MaskGIT Sampler and Beyond: Adaptive Order Selection in Masked Diffusion
Demystifying MaskGIT Sampler and Beyond: Adaptive Order Selection in Masked Diffusion
Satoshi Hayakawa
Yuhta Takida
Masaaki Imaizumi
Hiromi Wakaki
Yuki Mitsufuji
DiffM
275
0
0
06 Oct 2025
Pitch-Conditioned Instrument Sound Synthesis From an Interactive Timbre Latent Space
Pitch-Conditioned Instrument Sound Synthesis From an Interactive Timbre Latent Space
Christian Limberg
Fares Schulz
Zhe Zhang
Stefan Weinzierl
77
0
0
05 Oct 2025
Beyond Static Knowledge Messengers: Towards Adaptive, Fair, and Scalable Federated Learning for Medical AI
Beyond Static Knowledge Messengers: Towards Adaptive, Fair, and Scalable Federated Learning for Medical AI
Jahidul Arafat
Fariha Tasmin
Sanjaya Poudel
Ahsan Habib Tareq
FedML
179
0
0
05 Oct 2025
GDiffuSE: Diffusion-based speech enhancement with noise model guidance
GDiffuSE: Diffusion-based speech enhancement with noise model guidance
Efrayim Yanir
David Burshtein
Sharon Gannot
DiffM
116
0
0
05 Oct 2025
SingMOS-Pro: An Comprehensive Benchmark for Singing Quality Assessment
SingMOS-Pro: An Comprehensive Benchmark for Singing Quality Assessment
Yuxun Tang
Lan Liu
Wenhao Feng
Yiwen Zhao
Jionghao Han
Yifeng Yu
Jiatong Shi
Qin Jin
136
0
0
02 Oct 2025
Inference-Time Search using Side Information for Diffusion-based Image Reconstruction
Inference-Time Search using Side Information for Diffusion-based Image Reconstruction
Mahdi Farahbakhsh
Vishnu Teja Kunde
D. Kalathil
Krishna R. Narayanan
J. Chamberland
119
0
0
02 Oct 2025
Diffusion Models and the Manifold Hypothesis: Log-Domain Smoothing is Geometry Adaptive
Diffusion Models and the Manifold Hypothesis: Log-Domain Smoothing is Geometry Adaptive
Tyler Farghly
Peter Potaptchik
Samuel Howard
George Deligiannidis
Jakiw Pidstrigach
DiffM
179
2
0
02 Oct 2025
Contrastive Diffusion Guidance for Spatial Inverse Problems
Contrastive Diffusion Guidance for Spatial Inverse Problems
Sattwik Basu
Chaitanya Amballa
Zhongweiyang Xu
Jorge Vančo Sampedro
Srihari Nelakuditi
Romit Roy Choudhury
68
0
0
30 Sep 2025
MARS: Audio Generation via Multi-Channel Autoregression on Spectrograms
MARS: Audio Generation via Multi-Channel Autoregression on Spectrograms
Eleonora Ristori
Luca Bindini
Paolo Frasconi
76
0
0
30 Sep 2025
Free Draft-and-Verification: Toward Lossless Parallel Decoding for Diffusion Large Language Models
Free Draft-and-Verification: Toward Lossless Parallel Decoding for Diffusion Large Language Models
Shutong Wu
Jiawei Zhang
DiffM
275
1
0
30 Sep 2025
Data-to-Energy Stochastic Dynamics
Data-to-Energy Stochastic Dynamics
Kirill Tamogashev
Nikolay Malkin
DiffM
124
0
0
30 Sep 2025
Environment-Aware Satellite Image Generation with Diffusion Models
Environment-Aware Satellite Image Generation with Diffusion Models
Nikos Kostagiolas
Pantelis Georgiades
Yannis Panagakis
M. Nicolaou
68
0
0
29 Sep 2025
High-Quality Sound Separation Across Diverse Categories via Visually-Guided Generative Modeling
High-Quality Sound Separation Across Diverse Categories via Visually-Guided Generative Modeling
Chao Huang
Susan Liang
Yapeng Tian
Anurag Kumar
Chenliang Xu
DiffM
107
0
0
26 Sep 2025
1234...212223
Next