Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2009.09761
Cited By
v1
v2
v3 (latest)
DiffWave: A Versatile Diffusion Model for Audio Synthesis
International Conference on Learning Representations (ICLR), 2020
21 September 2020
Zhifeng Kong
Ming-Yu Liu
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
DiffM
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"DiffWave: A Versatile Diffusion Model for Audio Synthesis"
50 / 1,135 papers shown
Revisiting Energy Based Models as Policies: Ranking Noise Contrastive Estimation and Interpolating Energy Models
Sumeet Singh
Stephen Tu
Vikas Sindhwani
DiffM
285
11
0
11 Sep 2023
Diffusion-Based Co-Speech Gesture Generation Using Joint Text and Audio Representation
International Conference on Multimodal Interaction (ICMI), 2023
Anna Deichler
Shivam Mehta
Simon Alexanderson
Jonas Beskow
DiffM
227
30
0
11 Sep 2023
Discrete Denoising Diffusion Approach to Integer Factorization
International Conference on Artificial Neural Networks (ICANN), 2023
Kārlis Freivalds
Emīls Ozoliņš
Guntis Barzdins
DiffM
143
1
0
11 Sep 2023
Variations and Relaxations of Normalizing Flows
Keegan Kelly
Lorena Piedras
Sukrit Rao
David Samuel Roth
BDL
275
2
0
08 Sep 2023
Matcha-TTS: A fast TTS architecture with conditional flow matching
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Shivam Mehta
Ruibo Tu
Jonas Beskow
Éva Székely
G. Henter
312
179
0
06 Sep 2023
BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial Network
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Takashi Shibuya
Yuhta Takida
Yuki Mitsufuji
272
16
0
06 Sep 2023
sasdim: self-adaptive noise scaling diffusion model for spatial time series imputation
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Shunyang Zhang
Senzhang Wang
Xianzhen Tan
Ruochen Liu
Jian Zhang
Jianxin Wang
177
10
0
05 Sep 2023
DiffHPE: Robust, Coherent 3D Human Pose Lifting with Diffusion
Cédric Rommel
Eduardo Valle
Mickaël Chen
Souhaiel Khalfaoui
Renaud Marlet
Matthieu Cord
Patrick Pérez
224
18
0
04 Sep 2023
FinDiff: Diffusion Models for Financial Tabular Data Generation
International Conference on AI in Finance (ICAF), 2023
Timur Sattarov
Marco Schreyer
Damian Borth
DiffM
199
61
0
04 Sep 2023
NADiffuSE: Noise-aware Diffusion-based Model for Speech Enhancement
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2023
Wen Wang
Dongchao Yang
Qichen Ye
Bowen Cao
Yuexian Zou
DiffM
290
4
0
03 Sep 2023
Diffusion Models with Deterministic Normalizing Flow Priors
Mohsen Zand
Ali Etemad
Michael A. Greenspan
DiffM
380
4
0
03 Sep 2023
PathLDM: Text conditioned Latent Diffusion Model for Histopathology
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Srikar Yellapragada
Alexandros Graikos
Prateek Prasanna
Tahsin M. Kurc
Joel H. Saltz
Dimitris Samaras
AI4CE
365
56
0
01 Sep 2023
VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation
Xin Li
Wenqing Chu
Ye Wu
Weihang Yuan
Fanglong Liu
Tao Gui
Fu Li
Haocheng Feng
Errui Ding
Jingdong Wang
VGen
332
70
0
01 Sep 2023
LightGrad: Lightweight Diffusion Probabilistic Model for Text-to-Speech
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Jing Chen
Xingcheng Song
Zhendong Peng
Binbin Zhang
Fuping Pan
Zhiyong Wu
DiffM
163
24
0
31 Aug 2023
A Review of Differentiable Digital Signal Processing for Music & Speech Synthesis
Frontiers in Signal Processing (FSP), 2023
B. Hayes
Jordie Shier
Gyorgy Fazekas
Andrew Mcpherson
C. Saitis
240
41
0
29 Aug 2023
Elucidating the Exposure Bias in Diffusion Models
International Conference on Learning Representations (ICLR), 2023
Mang Ning
Mingxiao Li
Jianlin Su
A. A. Salah
Itir Onal Ertugrul
DiffM
521
73
0
29 Aug 2023
C2G2: Controllable Co-speech Gesture Generation with Latent Diffusion Model
Longbin Ji
Pengfei Wei
Yi Ren
Jinglin Liu
Chen Zhang
Xiang Yin
DiffM
149
3
0
29 Aug 2023
Transfusor: Transformer Diffusor for Controllable Human-like Generation of Vehicle Lane Changing Trajectories
Jiqian Dong
Sikai Chen
Samuel Labi
155
2
0
28 Aug 2023
Voice Conversion with Denoising Diffusion Probabilistic GAN Models
International Conference on Advanced Data Mining and Applications (ADMA), 2023
Xulong Zhang
Jianzong Wang
Ning Cheng
Jing Xiao
DiffM
150
8
0
28 Aug 2023
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
ACM Computing Surveys (ACM Comput. Surv.), 2023
Lin Geng Foo
Hossein Rahmani
Jing Liu
770
49
0
27 Aug 2023
DiffI2I: Efficient Diffusion Model for Image-to-Image Translation
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Bin Xia
Yulun Zhang
Shiyin Wang
Yitong Wang
Xing Wu
Yapeng Tian
Wenming Yang
Radu Timotfe
Luc Van Gool
DiffM
VLM
242
22
0
26 Aug 2023
Exploiting Time-Frequency Conformers for Music Audio Enhancement
ACM Multimedia (ACM MM), 2023
Yunkee Chae
Junghyun Koo
Sungho Lee
Kyogu Lee
181
8
0
24 Aug 2023
Audio Generation with Multiple Conditional Diffusion Model
AAAI Conference on Artificial Intelligence (AAAI), 2023
Zhifang Guo
Jianguo Mao
Ruijie Tao
Long Yan
Kazushige Ouchi
Hong Liu
Xiangdong Wang
DiffM
350
31
0
23 Aug 2023
Shape-conditioned 3D Molecule Generation via Equivariant Diffusion Models
Ziqi Chen
Bo Peng
Srinivas Parthasarathy
Xia Ning
DiffM
318
17
0
23 Aug 2023
Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning
Jiasheng Ye
Zaixiang Zheng
Yu Bao
Lihua Qian
Quanquan Gu
DiffM
647
34
0
23 Aug 2023
Convergence guarantee for consistency models
Junlong Lyu
Zhitang Chen
Shoubo Feng
DiffM
155
5
0
22 Aug 2023
Fast Inference and Update of Probabilistic Density Estimation on Trajectory Prediction
IEEE International Conference on Computer Vision (ICCV), 2023
Takahiro Maeda
Norimichi Ukita
241
46
0
17 Aug 2023
Enhancing Phrase Representation by Information Bottleneck Guided Text Diffusion Process for Keyphrase Extraction
International Conference on Language Resources and Evaluation (LREC), 2023
Yuanzhen Luo
Qingyu Zhou
F. Zhou
DiffM
211
3
0
17 Aug 2023
DiffV2S: Diffusion-based Video-to-Speech Synthesis with Vision-guided Speaker Embedding
IEEE International Conference on Computer Vision (ICCV), 2023
J. Choi
Joanna Hong
Y. Ro
DiffM
192
31
0
15 Aug 2023
iSTFTNet2: Faster and More Lightweight iSTFT-Based Neural Vocoder Using 1D-2D CNN
Interspeech (Interspeech), 2023
Takuhiro Kaneko
Hirokazu Kameoka
Kou Tanaka
Shogo Seki
166
8
0
14 Aug 2023
ModelScope Text-to-Video Technical Report
Jiuniu Wang
Hangjie Yuan
Dayou Chen
Yingya Zhang
Xiang Wang
Shiwei Zhang
VGen
DiffM
348
615
0
12 Aug 2023
Audio is all in one: speech-driven gesture synthetics using WavLM pre-trained model
Fan Zhang
Naye Ji
Fuxing Gao
Siyuan Zhao
Zhaohan Wang
Shunman Li
249
0
0
11 Aug 2023
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
Haohe Liu
Yiitan Yuan
Xubo Liu
Xinhao Mei
Qiuqiang Kong
Qiao Tian
Yuping Wang
Wenwu Wang
Yuxuan Wang
Mark D. Plumbley
DiffM
356
387
0
10 Aug 2023
On Error Propagation of Diffusion Models
International Conference on Learning Representations (ICLR), 2023
Yangming Li
M. Schaar
DiffM
259
25
0
09 Aug 2023
JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models
Conference on Algebraic Informatics (CAI), 2023
Peike Li
Bo-Yu Chen
Yao Yao
Yikai Wang
Allen Wang
Alex Jinpeng Wang
MGen
VLM
DiffM
672
53
0
09 Aug 2023
From Unimodal to Multimodal: improving sEMG-Based Pattern Recognition via deep generative models
Wentao Wei
Linyan Ren
109
2
0
08 Aug 2023
Diffusion Model in Causal Inference with Unmeasured Confounders
IEEE Symposium Series on Computational Intelligence (IEEE-SSCI), 2023
Tatsuhiro Shimizu
DiffM
270
6
0
07 Aug 2023
DiffDance: Cascaded Human Motion Diffusion Model for Dance Generation
ACM Multimedia (ACM MM), 2023
Qiaosong Qi
Le Zhuo
Aixi Zhang
Yue Liao
Fei Fang
Si Liu
Shuicheng Yan
223
38
0
05 Aug 2023
Improved Order Analysis and Design of Exponential Integrator for Diffusion Models Sampling
Qinsheng Zhang
Jiaming Song
Yongxin Chen
DiffM
188
16
0
04 Aug 2023
Synthesizing Long-Term Human Motions with Diffusion Models via Coherent Sampling
ACM Multimedia (ACM MM), 2023
Zhaohui Yang
Fuchun Sun
Ji-Rong Wen
DiffM
244
20
0
03 Aug 2023
MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Kai Chen
Yusong Wu
Haohe Liu
Marianna Nezhurina
Taylor Berg-Kirkpatrick
Shlomo Dubnov
DiffM
253
128
0
03 Aug 2023
From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion
Neural Information Processing Systems (NeurIPS), 2023
Robin San Roman
Yossi Adi
Antoine Deleforge
Romain Serizel
Gabriel Synnaeve
Alexandre Défossez
DiffM
255
37
0
02 Aug 2023
DAVIS: High-Quality Audio-Visual Separation with Generative Diffusion Models
Asian Conference on Computer Vision (ACCV), 2023
Chao Huang
Susan Liang
Yapeng Tian
Anurag Kumar
Chenliang Xu
DiffM
183
4
0
31 Jul 2023
Image Synthesis under Limited Data: A Survey and Taxonomy
International Journal of Computer Vision (IJCV), 2023
Mengping Yang
Zhe Wang
241
16
0
31 Jul 2023
A Novel DDPM-based Ensemble Approach for Energy Theft Detection in Smart Grids
Xun Yuan
Yang Yang
Asif Iqbal
P. Gope
Biplab Sikdar
DiffM
147
3
0
30 Jul 2023
RGB-D-Fusion: Image Conditioned Depth Diffusion of Humanoid Subjects
IEEE Access (IEEE Access), 2023
Sascha Kirch
Valeria Olyunina
Jan Ondřej
Rafael Pagés
Sergio Martín
Clara Pérez-Molina
200
3
0
29 Jul 2023
Minimally-Supervised Speech Synthesis with Conditional Diffusion Model and Language Model: A Comparative Study of Semantic Coding
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Chunyu Qiang
Hao Li
Hao Ni
He Qu
Ruibo Fu
Tao Wang
Longbiao Wang
Jianwu Dang
DiffM
199
16
0
28 Jul 2023
TransFusion: Generating Long, High Fidelity Time Series using Diffusion Models with Transformers
Machine Learning with Applications (MLWA), 2023
Md Fahim Sikder
R. Ramachandranpillai
Fredrik Heintz
DiffM
262
20
0
24 Jul 2023
Predict, Refine, Synthesize: Self-Guiding Diffusion Models for Probabilistic Time Series Forecasting
Neural Information Processing Systems (NeurIPS), 2023
Marcel Kollovieh
Abdul Fatir Ansari
Michael Bohlke-Schneider
Jasper Zschiegner
Hao Wang
Yuyang Wang
DiffM
AI4TS
316
90
0
21 Jul 2023
Progressive distillation diffusion for raw music generation
Svetlana Pavlova
DiffM
227
0
0
20 Jul 2023
Previous
1
2
3
...
13
14
15
...
21
22
23
Next
Page 14 of 23
Page
of 23
Go