Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2009.09761
Cited By
v1
v2
v3 (latest)
DiffWave: A Versatile Diffusion Model for Audio Synthesis
International Conference on Learning Representations (ICLR), 2020
21 September 2020
Zhifeng Kong
Ming-Yu Liu
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
DiffM
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"DiffWave: A Versatile Diffusion Model for Audio Synthesis"
50 / 1,133 papers shown
Title
Robust One-Shot Singing Voice Conversion
Naoya Takahashi
M. Singh
Yuki Mitsufuji
DiffM
265
9
0
20 Oct 2022
Differentially Private Diffusion Models
Tim Dockhorn
Tianshi Cao
Arash Vahdat
Karsten Kreis
DiffM
426
127
0
18 Oct 2022
TorchDIVA: An Extensible Computational Model of Speech Production built on an Open-Source Machine Learning Library
PLoS ONE (PLoS ONE), 2022
Sean M. Kinahan
J. Liss
Visar Berisha
48
2
0
17 Oct 2022
DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models
International Conference on Learning Representations (ICLR), 2022
Shansan Gong
Mukai Li
Jiangtao Feng
Zhiyong Wu
Lingpeng Kong
394
448
0
17 Oct 2022
DiffGAR: Model-Agnostic Restoration from Generative Artifacts Using Image-to-Image Diffusion Models
International Conference on Computer Science and Artificial Intelligence (ICCSAI), 2022
Yueqin Yin
Lianghua Huang
Yu Liu
Kaiqiang Huang
DiffM
138
12
0
16 Oct 2022
TransFusion: Transcribing Speech with Multinomial Diffusion
Matthew Baas
Kevin Eloff
Herman Kamper
DiffM
79
6
0
14 Oct 2022
Hierarchical Diffusion Models for Singing Voice Neural Vocoder
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Naoya Takahashi
Mayank Kumar
Singh
Yuki Mitsufuji
DiffM
258
18
0
14 Oct 2022
LION: Latent Point Diffusion Models for 3D Shape Generation
Neural Information Processing Systems (NeurIPS), 2022
Fangyin Wei
Arash Vahdat
Francis Williams
Zan Gojcic
Or Litany
Sanja Fidler
Karsten Kreis
DiffM
350
618
0
12 Oct 2022
Human Joint Kinematics Diffusion-Refinement for Stochastic Motion Prediction
AAAI Conference on Artificial Intelligence (AAAI), 2022
Dong Wei
Huaijiang Sun
Bin Li
Jianfeng Lu
Weiqing Li
Xiaoning Sun
Sheng-liang Hu
DiffM
VGen
197
61
0
12 Oct 2022
Unifying Diffusion Models' Latent Space, with Applications to CycleDiffusion and Guidance
Chen Henry Wu
Fernando de la Torre
DiffM
358
79
0
11 Oct 2022
GENIE: Higher-Order Denoising Diffusion Solvers
Neural Information Processing Systems (NeurIPS), 2022
Tim Dockhorn
Arash Vahdat
Karsten Kreis
DiffM
305
141
0
11 Oct 2022
GAN You Hear Me? Reclaiming Unconditional Speech Synthesis from Diffusion Models
Spoken Language Technology Workshop (SLT), 2022
Matthew Baas
Herman Kamper
DiffM
168
8
0
11 Oct 2022
DiffRoll: Diffusion-based Generative Music Transcription with Unsupervised Pretraining Capability
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
K. Cheuk
Ryosuke Sawata
Toshimitsu Uesaka
Naoki Murata
Naoya Takahashi
Shusuke Takahashi
Dorien Herremans
Yuki Mitsufuji
DiffM
162
21
0
11 Oct 2022
Sequential Neural Score Estimation: Likelihood-Free Inference with Conditional Score Based Diffusion Models
International Conference on Machine Learning (ICML), 2022
Louis Sharrock
J. Simons
Song Liu
Mark Beaumont
DiffM
259
50
0
10 Oct 2022
FP-Diffusion: Improving Score-based Diffusion Models by Enforcing the Underlying Score Fokker-Planck Equation
International Conference on Machine Learning (ICML), 2022
Chieh-Hsin Lai
Yuhta Takida
Naoki Murata
Toshimitsu Uesaka
Yuki Mitsufuji
Stefano Ermon
DiffM
226
38
0
09 Oct 2022
On Distillation of Guided Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2022
Chenlin Meng
Robin Rombach
Ruiqi Gao
Diederik P. Kingma
Stefano Ermon
Jonathan Ho
Tim Salimans
VLM
DiffM
224
694
0
06 Oct 2022
PSVRF: Learning to restore Pitch-Shifted Voice without reference
Yangfu Li
Xiaodan Lin
Jiaxin Yang
121
0
0
06 Oct 2022
DALL-E-Bot: Introducing Web-Scale Diffusion Models to Robotics
IEEE Robotics and Automation Letters (RA-L), 2022
Ivan Kapelyukh
Vitalis Vosylius
Edward Johns
LM&Ro
DiffM
492
175
0
05 Oct 2022
Imagen Video: High Definition Video Generation with Diffusion Models
Jonathan Ho
William Chan
Chitwan Saharia
Jay Whang
Ruiqi Gao
...
Diederik P. Kingma
Ben Poole
Mohammad Norouzi
David J. Fleet
Tim Salimans
VGen
408
1,843
0
05 Oct 2022
Progressive Text-to-Image Generation
Zhengcong Fei
Mingyuan Fan
Li Zhu
Junshi Huang
268
4
0
05 Oct 2022
Vision+X: A Survey on Multimodal Learning in the Light of Data
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Ye Zhu
Yuehua Wu
Andrii Zadaianchuk
Yan Yan
342
37
0
05 Oct 2022
WaveFit: An Iterative and Non-autoregressive Neural Vocoder based on Fixed-Point Iteration
Spoken Language Technology Workshop (SLT), 2022
Yuma Koizumi
Kohei Yatabe
Heiga Zen
M. Bacchiani
DiffM
194
33
0
03 Oct 2022
OCD: Learning to Overfit with Conditional Diffusion Models
International Conference on Machine Learning (ICML), 2022
Shahar Lutati
Lior Wolf
DiffM
319
10
0
02 Oct 2022
Protein structure generation via folding diffusion
Nature Communications (Nat Commun), 2022
Kevin E. Wu
Kevin Kaichuang Yang
Rianne van den Berg
James Zou
Alex X. Lu
Ava P. Amini
DiffM
353
253
0
30 Sep 2022
TabDDPM: Modelling Tabular Data with Diffusion Models
International Conference on Machine Learning (ICML), 2022
Akim Kotelnikov
Dmitry Baranchuk
Ivan Rubachev
Artem Babenko
DiffM
241
401
0
30 Sep 2022
Equivariant Energy-Guided SDE for Inverse Molecular Design
International Conference on Learning Representations (ICLR), 2022
Fan Bao
Min Zhao
Zhongkai Hao
Pei‐Yun Li
Chongxuan Li
Jun Zhu
DiffM
1.1K
79
0
30 Sep 2022
DreamFusion: Text-to-3D using 2D Diffusion
International Conference on Learning Representations (ICLR), 2022
Ben Poole
Ajay Jain
Jonathan T. Barron
B. Mildenhall
822
3,124
0
29 Sep 2022
ButterflyFlow: Building Invertible Layers with Butterfly Matrices
International Conference on Machine Learning (ICML), 2022
Chenlin Meng
Linqi Zhou
Kristy Choi
Tri Dao
Stefano Ermon
TPM
286
13
0
28 Sep 2022
On Investigating the Conservative Property of Score-Based Generative Models
International Conference on Machine Learning (ICML), 2022
Chen-Hao Chao
Wei-Fang Sun
Bo Wun Cheng
Chun-Yi Lee
248
15
0
26 Sep 2022
All are Worth Words: A ViT Backbone for Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2022
Fan Bao
Shen Nie
Kaiwen Xue
Yue Cao
Chongxuan Li
Hang Su
Jun Zhu
VLM
541
495
0
25 Sep 2022
Controllable Accented Text-to-Speech Synthesis
Rui Liu
Berrak Sisman
Guanglai Gao
Haizhou Li
177
6
0
22 Sep 2022
Mandarin Singing Voice Synthesis with Denoising Diffusion Probabilistic Wasserstein GAN
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2022
Yin-Ping Cho
Yu Tsao
Hsin-Min Wang
Yi-Wen Liu
DiffM
221
9
0
21 Sep 2022
Denoising Diffusion Error Correction Codes
International Conference on Learning Representations (ICLR), 2022
Yoni Choukroun
Lior Wolf
DiffM
184
37
0
16 Sep 2022
MDM: Molecular Diffusion Model for 3D Molecule Generation
AAAI Conference on Artificial Intelligence (AAAI), 2022
Lei Huang
Hengtong Zhang
Qifeng Bai
Ka-Chun Wong
DiffM
254
112
0
13 Sep 2022
Blurring Diffusion Models
International Conference on Learning Representations (ICLR), 2022
Emiel Hoogeboom
Tim Salimans
DiffM
353
92
0
12 Sep 2022
Soft Diffusion: Score Matching for General Corruptions
Giannis Daras
M. Delbracio
Hossein Talebi
A. Dimakis
P. Milanfar
DiffM
254
121
0
12 Sep 2022
AudioLM: a Language Modeling Approach to Audio Generation
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Zalan Borsos
Raphaël Marinier
Damien Vincent
Eugene Kharitonov
Olivier Pietquin
...
Dominik Roblek
O. Teboul
David Grangier
Marco Tagliasacchi
Neil Zeghidour
AuLLM
376
806
0
07 Sep 2022
Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow
International Conference on Learning Representations (ICLR), 2022
Xingchao Liu
Chengyue Gong
Qiang Liu
OOD
1.0K
1,944
0
07 Sep 2022
A Survey on Generative Diffusion Model
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2022
Hanqun Cao
Cheng Tan
Zhangyang Gao
Yilun Xu
Guangyong Chen
Pheng-Ann Heng
Stan Z. Li
MedIm
745
408
0
06 Sep 2022
First Hitting Diffusion Models for Generating Manifold, Graph and Categorical Data
Neural Information Processing Systems (NeurIPS), 2022
Mao Ye
Lemeng Wu
Qiang Liu
DiffM
352
21
0
02 Sep 2022
Diffusion Models: A Comprehensive Survey of Methods and Applications
ACM Computing Surveys (ACM CSUR), 2022
Ling Yang
Zhilong Zhang
Yingxia Shao
Shenda Hong
Runsheng Xu
Yue Zhao
Wentao Zhang
Tengjiao Wang
Ming-Hsuan Yang
DiffM
MedIm
1.4K
1,869
0
02 Sep 2022
Evaluating generative audio systems and their metrics
International Society for Music Information Retrieval Conference (ISMIR), 2022
Ashvala Vinay
Alexander Lerch
251
26
0
31 Aug 2022
Let us Build Bridges: Understanding and Extending Diffusion Generative Models
Xingchao Liu
Lemeng Wu
Mao Ye
Qiang Liu
DiffM
192
97
0
31 Aug 2022
Mel Spectrogram Inversion with Stable Pitch
International Society for Music Information Retrieval Conference (ISMIR), 2022
Bruno Di Giorgi
M. Levy
Richard Sharp
133
8
0
26 Aug 2022
Diffusion-based Time Series Imputation and Forecasting with Structured State Space Models
Juan Miguel Lopez Alcaraz
Nils Strodthoff
DiffM
329
233
0
19 Aug 2022
One-shot Generative Prior in Hankel-k-space for Parallel Imaging Reconstruction
IEEE Transactions on Medical Imaging (IEEE TMI), 2022
Hong Peng
Chenbo Jiang
Jing Cheng
Minghui Zhang
Shanshan Wang
Dong Liang
Qiegen Liu
DiffM
MedIm
247
22
0
15 Aug 2022
Wavelet Score-Based Generative Modeling
Neural Information Processing Systems (NeurIPS), 2022
Florentin Guth
Simon Coste
Valentin De Bortoli
S. Mallat
DiffM
233
77
0
09 Aug 2022
DDSP-based Singing Vocoders: A New Subtractive-based Synthesizer and A Comprehensive Evaluation
International Society for Music Information Retrieval Conference (ISMIR), 2022
Da-Yi Wu
Wen-Yi Hsiao
Fu-Rong Yang
Oscar D. Friedman
Warren Jackson
Scott Bruzenak
Yi-Wen Liu
Yi-Hsuan Yang
DiffM
235
26
0
09 Aug 2022
AdaCat: Adaptive Categorical Discretization for Autoregressive Models
Conference on Uncertainty in Artificial Intelligence (UAI), 2022
Qiyang Li
Ajay Jain
Pieter Abbeel
OffRL
184
4
0
03 Aug 2022
DeScoD-ECG: Deep Score-Based Diffusion Model for ECG Baseline Wander and Noise Removal
IEEE journal of biomedical and health informatics (IEEE JBHI), 2022
Huayu Li
G. Ditzler
Janet Roveda
Ao Li
DiffM
191
76
0
31 Jul 2022
Previous
1
2
3
...
19
20
21
22
23
Next