Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2009.09761
Cited By
v1
v2
v3 (latest)
DiffWave: A Versatile Diffusion Model for Audio Synthesis
International Conference on Learning Representations (ICLR), 2020
21 September 2020
Zhifeng Kong
Ming-Yu Liu
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
DiffM
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"DiffWave: A Versatile Diffusion Model for Audio Synthesis"
50 / 1,135 papers shown
GCDance: Genre-Controlled Music-Driven 3D Full Body Dance Generation
Xinran Liu
Xu Dong
Shenbin Qian
Diptesh Kanojia
Wenwu Wang
Zhenhua Feng
DiffM
391
0
0
25 Feb 2025
Bayesian Computation in Deep Learning
Wenlong Chen
Bolian Li
Ruqi Zhang
Yingzhen Li
BDL
580
1
0
25 Feb 2025
VVRec: Reconstruction Attacks on DL-based Volumetric Video Upstreaming via Latent Diffusion Model with Gamma Distribution
AAAI Conference on Artificial Intelligence (AAAI), 2025
Rui Lu
B. Zhang
Dan Wang
VGen
516
0
0
25 Feb 2025
VCT: Training Consistency Models with Variational Noise Coupling
Gianluigi Silvestri
Luca Ambrogioni
Chieh-Hsin Lai
Yuhta Takida
Yuki Mitsufuji
373
7
0
25 Feb 2025
Sample-Efficient Diffusion-based Control of Complex Physics Systems
Hongyi Chen
Jingtao Ding
Jianhai Shu
Xinchun Yu
Xiaojun Liang
Yong Li
Jinqiang Cui
933
0
0
25 Feb 2025
Diffusion Models for Tabular Data: Challenges, Current Progress, and Future Directions
Zhong Li
Qi Huang
Lincen Yang
Jiayang Shi
Zhao Yang
Niki van Stein
Thomas Bäck
M. Leeuwen
DiffM
311
8
0
24 Feb 2025
RestoreGrad: Signal Restoration Using Conditional Denoising Diffusion Models with Jointly Learned Prior
Ching Hua Lee
Chouchang Yang
Jaejin Cho
Yashas Malur Saidutta
R. S. Srinivasa
Yilin Shen
Hongxia Jin
DiffM
524
1
0
19 Feb 2025
RAPID: Retrieval Augmented Training of Differentially Private Diffusion Models
International Conference on Learning Representations (ICLR), 2025
Tanqiu Jiang
Changjiang Li
Fenglong Ma
Ting Wang
297
1
0
18 Feb 2025
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation
Sihyun Yu
Meera Hahn
Dan Kondratyuk
Jinwoo Shin
Agrim Gupta
José Lezama
Irfan Essa
David A. Ross
Jonathan Huang
DiffM
VGen
706
8
0
18 Feb 2025
NaturalL2S: End-to-End High-quality Multispeaker Lip-to-Speech Synthesis with Differential Digital Signal Processing
Neural Networks (NN), 2025
Yifan Liang
Fangkun Liu
Andong Li
Xiaodong Li
C. Zheng
312
2
0
17 Feb 2025
Vision-Enhanced Time Series Forecasting via Latent Diffusion Models
Weilin Ruan
Siru Zhong
Haomin Wen
Yuxuan Liang
AI4TS
381
8
0
16 Feb 2025
TokenSynth: A Token-based Neural Synthesizer for Instrument Cloning and Text-to-Instrument
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Kyungsu Kim
Junghyun Koo
Sungho Lee
Haesun Joung
Kyogu Lee
381
2
0
13 Feb 2025
PoGDiff: Product-of-Gaussians Diffusion Models for Imbalanced Text-to-Image Generation
Ziyan Wang
Sizhe Wei
Xiaoming Huo
Hao Wang
DiffM
565
1
0
12 Feb 2025
Solving Linear-Gaussian Bayesian Inverse Problems with Decoupled Diffusion Sequential Monte Carlo
Filip Ekstrom Kelvinius
Zheng Zhao
Fredrik Lindsten
DiffM
405
4
0
10 Feb 2025
Understanding Representation Dynamics of Diffusion Models via Low-Dimensional Modeling
Xiao Li
Zekai Zhang
Xiang Li
Siyi Chen
Zhihui Zhu
Peng Wang
Qing Qu
DiffM
561
6
0
09 Feb 2025
Generating 3D Binding Molecules Using Shape-Conditioned Diffusion Models with Guidance
Ziqi Chen
Bo Peng
Tianhua Zhai
Daniel Adu-Ampratwum
Xia Ning
325
10
0
09 Feb 2025
Stochastic Forward-Backward Deconvolution: Training Diffusion Models with Finite Noisy Datasets
Haoye Lu
Qifan Wu
Yaoliang Yu
DiffM
409
9
0
08 Feb 2025
Conditional Diffusion Models are Medical Image Classifiers that Provide Explainability and Uncertainty for Free
Gian Mario Favero
Parham Saremi
Emily Kaczmarek
Brennan Nichyporuk
Tal Arbel
DiffM
MedIm
322
5
0
06 Feb 2025
DiffListener: Discrete Diffusion Model for Listener Generation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Siyeol Jung
Taehwan Kim
96
4
0
05 Feb 2025
A Diffusion Model Translator for Efficient Image-to-Image Translation
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Mengfei Xia
Yu Zhou
Ran Yi
Wenshu Fan
Wenping Wang
VLM
483
24
0
01 Feb 2025
Spectral Analysis of Diffusion Models with Application to Schedule Design
Roi Benita
Michael Elad
Joseph Keshet
DiffM
444
0
0
31 Jan 2025
EDSep: An Effective Diffusion-Based Method for Speech Source Separation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Jinwei Dong
Xinsheng Wang
Qirong Mao
329
5
0
28 Jan 2025
Exploring Preference-Guided Diffusion Model for Cross-Domain Recommendation
Knowledge Discovery and Data Mining (KDD), 2025
Xiaodong Li
Hengzhu Tang
Shuaiyi Nie
Xinghua Zhang
Li Gao
Suqi Cheng
D. Yin
Tingwen Liu
DiffM
334
6
0
20 Jan 2025
Block Flow: Learning Straight Flow on Data Blocks
Zibin Wang
Zhiyuan Ouyang
Xiangyun Zhang
220
0
0
20 Jan 2025
Generative diffusion model with inverse renormalization group flows
Kanta Masuki
Yuto Ashida
DiffM
285
6
0
17 Jan 2025
Simplified and Generalized Masked Diffusion for Discrete Data
Neural Information Processing Systems (NeurIPS), 2024
Jiaxin Shi
Kehang Han
Zehao Wang
Arnaud Doucet
Michalis K. Titsias
DiffM
617
324
0
17 Jan 2025
Pruning for Sparse Diffusion Models based on Gradient Flow
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Ben Wan
Tianyi Zheng
Zhaoyu Chen
Yuxiao Wang
Jia Wang
132
4
0
17 Jan 2025
Bridge-SR: Schr\"odinger Bridge for Efficient SR
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Chong Li
Zehua Chen
Fan Bao
Jun-Jie Zhu
DiffM
181
9
0
14 Jan 2025
S-Diff: An Anisotropic Diffusion Model for Collaborative Filtering in Spectral Domain
Web Search and Data Mining (WSDM), 2024
Rui Xia
Yanhua Cheng
Yongxiang Tang
Xiaocheng Liu
Xialong Liu
Lisong Wang
Peng Jiang
DiffM
390
8
0
03 Jan 2025
Cached Adaptive Token Merging: Dynamic Token Reduction and Redundant Computation Elimination in Diffusion Model
Omid Saghatchian
Atiyeh Gh. Moghadam
Ahmad Nickabadi
MoMe
340
8
0
03 Jan 2025
Text2Data: Low-Resource Data Generation with Textual Control
AAAI Conference on Artificial Intelligence (AAAI), 2024
Shiyu Wang
Yihao Feng
Tian Lan
Ning Yu
Yu Bai
Ran Xu
Han Wang
Caiming Xiong
Siyang Song
DiffM
354
0
0
03 Jan 2025
Melody-Guided Music Generation
Shaopeng Wei
Manzhen Wei
Haoyu Wang
Yu Zhao
Gang Kou
MGen
277
4
0
31 Dec 2024
Simultaneous Music Separation and Generation Using Multi-Track Latent Diffusion Models
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Tornike Karchkhadze
M. Izadi
Shlomo Dubnov
DiffM
355
12
0
31 Dec 2024
Memory-Centric Computing: Recent Advances in Processing-in-DRAM
O. Mutlu
Ataberk Olgun
Geraldo F. Oliveira
Ismail Emir Yüksel
333
11
0
26 Dec 2024
DiFiC: Your Diffusion Model Holds the Secret to Fine-Grained Clustering
Ruohong Yang
Peng Hu
Xi Peng
Xiting Liu
Yunfan Li
331
3
0
25 Dec 2024
TrojFlow: Flow Models are Natural Targets for Trojan Attacks
Zhengyang Qi
Xiaohua Xu
AAML
349
0
0
21 Dec 2024
Investigating the Effects of Diffusion-based Conditional Generative Speech Models Used for Speech Enhancement on Dysarthric Speech
Joanna Reszka
Parvaneh Janbakhshi
Tilak Purohit
Sadegh Mohammadi
DiffM
239
1
0
18 Dec 2024
Synthetic Time Series Data Generation for Healthcare Applications: A PCG Case Study
Annual Conference on Information Sciences and Systems (CISS), 2024
Ainaz Jamshidi
M. Arif
Sabir Ali Kalhoro
Alexander Gelbukh
MedIm
205
1
0
17 Dec 2024
C2F-TP: A Coarse-to-Fine Denoising Framework for Uncertainty-Aware Trajectory Prediction
AAAI Conference on Artificial Intelligence (AAAI), 2024
Zichen Wang
Hao Miao
Senzhang Wang
Renzhi Wang
Jianxin Wang
Jian Zhang
327
9
0
17 Dec 2024
SILA: Signal-to-Language Augmentation for Enhanced Control in Text-to-Audio Generation
Sonal Kumar
Prem Seetharaman
Justin Salamon
Dinesh Manocha
Oriol Nieto
273
1
0
13 Dec 2024
Interpreting Graphic Notation with MusicLDM: An AI Improvisation of Cornelius Cardew's Treatise
BigData Congress [Services Society] (BSS), 2024
Tornike Karchkhadze
Keren Shao
Shlomo Dubnov
247
0
0
12 Dec 2024
AppGen: Mobility-aware App Usage Behavior Generation for Mobile Users
Zihan Huang
Tong Li
Yong Li
206
2
0
10 Dec 2024
LMDM:Latent Molecular Diffusion Model For 3D Molecule Generation
Xiang Chen
DiffM
270
0
0
05 Dec 2024
DiffStyleTTS: Diffusion-based Hierarchical Prosody Modeling for Text-to-Speech with Diverse and Controllable Styles
International Conference on Computational Linguistics (COLING), 2024
Jiaxuan Liu
Zhaoci Liu
Yihan Hu
Yingying Gao
Shilei Zhang
Zhenhua Ling
DiffM
261
6
0
04 Dec 2024
Schedule On the Fly: Diffusion Time Prediction for Faster and Better Image Generation
Computer Vision and Pattern Recognition (CVPR), 2024
Zilyu Ye
Zhiyang Chen
Tiancheng Li
Zemin Huang
Weijian Luo
Guo-Jun Qi
DiffM
619
17
0
02 Dec 2024
From Audio Deepfake Detection to AI-Generated Music Detection -- A Pathway and Overview
Yupei Li
M. Milling
Lucia Specia
Björn Schuller
387
13
0
30 Nov 2024
CoDiff-VC: A Codec-Assisted Diffusion Model for Zero-shot Voice Conversion
Yuke Li
Xinfa Zhu
Hanzhao Li
Jixun Yao
WenJie Tian
XiPeng Yang
Yunlin Chen
Zhifei Li
Lei Xie
DiffM
487
1
0
28 Nov 2024
Comparison of Generative Learning Methods for Turbulence Surrogates
Claudia Drygala
Edmund Ross
F. Mare
Hanno Gottschalk
Francesca di Mare
Hanno Gottschalk
AI4CE
360
5
0
25 Nov 2024
TKG-DM: Training-free Chroma Key Content Generation Diffusion Model
Computer Vision and Pattern Recognition (CVPR), 2024
Ryugo Morita
Stanislav Frolov
Brian B. Moser
Takahiro Shirakawa
Ko Watanabe
Andreas Dengel
Jinjia Zhou
DiffM
436
4
0
23 Nov 2024
Frequency-Guided Posterior Sampling for Diffusion-Based Image Restoration
D. Thaker
Abhishek Goyal
René Vidal
DiffM
307
3
0
22 Nov 2024
Previous
1
2
3
4
5
6
...
21
22
23
Next
Page 5 of 23
Page
of 23
Go