Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2009.09761
Cited By
DiffWave: A Versatile Diffusion Model for Audio Synthesis
21 September 2020
Zhifeng Kong
Wei Ping
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
DiffM
BDL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DiffWave: A Versatile Diffusion Model for Audio Synthesis"
50 / 977 papers shown
Title
MCGM: Mask Conditional Text-to-Image Generative Model
Rami Skaik
Leonardo Rossi
Tomaso Fontanini
Andrea Prati
DiffM
28
0
0
01 Oct 2024
Recent Advances in Speech Language Models: A Survey
Wenqian Cui
Dianzhi Yu
Xiaoqi Jiao
Ziqiao Meng
Guangyan Zhang
Qichao Wang
Yiwen Guo
Irwin King
AuLLM
59
14
0
01 Oct 2024
Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner
Chenyou Fan
Chenjia Bai
Zhao Shan
Haoran He
Yang Zhang
Zhen Wang
31
3
0
30 Sep 2024
Simple and Fast Distillation of Diffusion Models
Zhenyu Zhou
Defang Chen
Can Wang
Chun Chen
Siwei Lyu
DiffM
35
5
0
29 Sep 2024
Gradient-free Decoder Inversion in Latent Diffusion Models
Seongmin Hong
Suh Yoon Jeon
Kyeonghyun Lee
Ernest K. Ryu
S. Chun
26
0
0
27 Sep 2024
Ordinary Differential Equations for Enhanced 12-Lead ECG Generation
Yakir Yehuda
Kira Radinsky
SyDa
16
0
0
26 Sep 2024
Learning Quantized Adaptive Conditions for Diffusion Models
Yuchen Liang
Yuchuan Tian
Lei Yu
Huao Tang
Jie Hu
Xiangzhong Fang
Hanting Chen
DiffM
32
0
0
26 Sep 2024
A Simple but Strong Baseline for Sounding Video Generation: Effective Adaptation of Audio and Video Diffusion Models for Joint Generation
Masato Ishii
Akio Hayakawa
Takashi Shibuya
Yuki Mitsufuji
VGen
DiffM
65
4
0
26 Sep 2024
TFG: Unified Training-Free Guidance for Diffusion Models
Haotian Ye
Haowei Lin
Jiaqi Han
Minkai Xu
Sheng Liu
Yitao Liang
Jianzhu Ma
James Zou
Stefano Ermon
31
13
0
24 Sep 2024
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
Weifeng Lin
Xinyu Wei
Renrui Zhang
Le Zhuo
Shitian Zhao
...
Junlin Xie
Junlin Xie
Yu Qiao
Peng Gao
Hongsheng Li
MLLM
DiffM
57
10
0
23 Sep 2024
A Comprehensive Survey with Critical Analysis for Deepfake Speech Detection
Lam Pham
Phat Lam
Dat Tran
Hieu Tang
Tin Nguyen
Alexander Schindler
Canh Vu
Alexander Polonsky
Canh Vu
46
3
0
23 Sep 2024
A Large Language Model and Denoising Diffusion Framework for Targeted Design of Microstructures with Commands in Natural Language
Nikita Kartashov
Nikolaos N. Vlassis
DiffM
AI4CE
25
1
0
22 Sep 2024
Sketching With Your Voice: "Non-Phonorealistic" Rendering of Sounds via Vocal Imitation
Matthew Caren
Kartik Chandra
J. Tenenbaum
Jonathan Ragan-Kelley
Karima Ma
33
0
0
20 Sep 2024
DNI: Dilutional Noise Initialization for Diffusion Video Editing
Sunjae Yoon
Gwanhyeong Koo
Ji Woo Hong
Chang D. Yoo
DiffM
36
2
0
19 Sep 2024
DPI-TTS: Directional Patch Interaction for Fast-Converging and Style Temporal Modeling in Text-to-Speech
Xin Qi
Ruibo Fu
Zhengqi Wen
Tao Wang
Chunyu Qiang
...
Xiaopeng Wang
Yuankun Xie
Yukun Liu
Xuefei Liu
Guanjun Li
DiffM
28
0
0
18 Sep 2024
DifFaiRec: Generative Fair Recommender with Conditional Diffusion Model
Zhenhao Jiang
Jicong Fan
FaML
13
1
0
18 Sep 2024
SpoofCeleb: Speech Deepfake Detection and SASV In The Wild
Jee-weon Jung
Yihan Wu
Xin Wang
Ji-Hoon Kim
Soumi Maiti
...
Joon Son Chung
Wangyou Zhang
Seyun Um
Shinnosuke Takamichi
Shinji Watanabe
62
1
0
18 Sep 2024
Improving Robustness of Diffusion-Based Zero-Shot Speech Synthesis via Stable Formant Generation
C. Han
Seokgi Lee
Gyuhyeon Nam
Gyeongsu Chae
DiffM
118
0
0
14 Sep 2024
Latent Space Score-based Diffusion Model for Probabilistic Multivariate Time Series Imputation
Guojun Liang
N. Abiri
Atiye Sadat Hashemi
Jens Lundstrom
Stefan Byttner
Prayag Tiwari
DiffM
41
0
0
13 Sep 2024
DFADD: The Diffusion and Flow-Matching Based Audio Deepfake Dataset
Jiawei Du
I-Ming Lin
I-Hsiang Chiu
Xuanjun Chen
Haibin Wu
Wenze Ren
Yu Tsao
Hung-yi Lee
Jyh-Shing Roger Jang
DiffM
35
2
0
13 Sep 2024
Text-To-Speech Synthesis In The Wild
Jee-weon Jung
Wangyou Zhang
Soumi Maiti
Yihan Wu
Xin Wang
...
Hye-jin Shim
Nicholas W. D. Evans
Joon Son Chung
Shinnosuke Takamichi
Shinji Watanabe
32
1
0
13 Sep 2024
Think Twice Before You Act: Improving Inverse Problem Solving With MCMC
Y. Zhu
Zehao Dou
Haoxin Zheng
Yasi Zhang
Ying Nian Wu
Ruiqi Gao
DiffM
25
4
0
13 Sep 2024
Sub-graph Based Diffusion Model for Link Prediction
Hang Li
Wei Jin
Geri Skenderi
Harry Shomer
Wenzhuo Tang
Wenqi Fan
Jiliang Tang
DiffM
28
0
0
13 Sep 2024
Alignment of Diffusion Models: Fundamentals, Challenges, and Future
Buhua Liu
Shitong Shao
Bao Li
Lichen Bai
Zhiqiang Xu
Haoyi Xiong
James Kwok
Sumi Helal
Zeke Xie
39
11
0
11 Sep 2024
Table-to-Text Generation with Pretrained Diffusion Models
Aleksei S. Krylov
Oleg D. Somov
40
1
0
10 Sep 2024
FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion Distillation
Takuhiro Kaneko
Hirokazu Kameoka
Kou Tanaka
Yuto Kondo
DiffM
40
0
0
03 Sep 2024
Applications and Advances of Artificial Intelligence in Music Generation:A Review
Yanxu Chen
Linshu Huang
Tian Gou
MGen
31
2
0
03 Sep 2024
DiffEyeSyn: Diffusion-based User-specific Eye Movement Synthesis
Chuhan Jiao
Guanhua Zhang
Zhiming Hu
Andreas Bulling
DiffM
28
1
0
02 Sep 2024
Diffusion Policy Policy Optimization
Allen Z. Ren
Justin Lidard
Lars L. Ankile
Anthony Simeonov
Pulkit Agrawal
Anirudha Majumdar
Benjamin Burchfiel
Hongkai Dai
Max Simchowitz
41
36
0
01 Sep 2024
FLUX that Plays Music
Zhengcong Fei
Mingyuan Fan
Changqian Yu
Junshi Huang
84
7
0
01 Sep 2024
Bridging User Dynamics: Transforming Sequential Recommendations with Schrödinger Bridge and Diffusion Models
Wenjia Xie
Rui Zhou
Hao Wang
Tingjia Shen
Enhong Chen
DiffM
36
9
0
30 Aug 2024
Spiking Diffusion Models
Jiahang Cao
Hanzhong Guo
Ziqing Wang
Deming Zhou
Hao Cheng
Qiang Zhang
Renjing Xu
DiffM
40
2
0
29 Aug 2024
Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas
Fabio Quattrini
Vittorio Pippi
Silvia Cascianelli
Rita Cucchiara
40
3
0
28 Aug 2024
Constrained Diffusion Models via Dual Training
Shervin Khalafi
Dongsheng Ding
Alejandro Ribeiro
35
3
0
27 Aug 2024
Atlas Gaussians Diffusion for 3D Generation
Haitao Yang
Yuan Dong
Hanwen Jiang
Dejia Xu
Georgios Pavlakos
Qixing Huang
3DGS
67
3
0
23 Aug 2024
Efficient Image-to-Image Diffusion Classifier for Adversarial Robustness
Hefei Mei
Minjing Dong
Chang Xu
AAML
51
0
0
16 Aug 2024
PeriodWave: Multi-Period Flow Matching for High-Fidelity Waveform Generation
Sang-Hoon Lee
Ha-Yeong Choi
Seong-Whan Lee
OOD
DiffM
AI4TS
43
5
0
14 Aug 2024
Unsupervised Blind Joint Dereverberation and Room Acoustics Estimation with Diffusion Models
Jean-Marie Lemercier
Eloi Moliner
Simon Welker
Vesa Valimaki
Timo Gerkmann
48
2
0
14 Aug 2024
Leveraging Priors via Diffusion Bridge for Time Series Generation
Jinseong Park
Seungyun Lee
Woojin Jeong
Yujin Choi
Jaewook Lee
DiffM
31
5
0
13 Aug 2024
MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation
Xiaofeng Mao
Zhengkai Jiang
Qilin Wang
Chencan Fu
Jiangning Zhang
Jiafu Wu
Yabiao Wang
Chengjie Wang
Wei Li
Mingmin Chi
72
4
0
06 Aug 2024
A Sharp Convergence Theory for The Probability Flow ODEs of Diffusion Models
Gen Li
Yuting Wei
Yuejie Chi
Yuxin Chen
DiffM
35
22
0
05 Aug 2024
Conditional LoRA Parameter Generation
Aaron Mueller
Millicent Li
Koyena Pal
Wangbo Zhao
Yukun Zhou
Jiuding Sun
Yonatan Belinkov
DiffM
41
3
0
02 Aug 2024
DiffusionCounterfactuals: Inferring High-dimensional Counterfactuals with Guidance of Causal Representations
Jiageng Zhu
Hanchen Xie
Jiazhi Li
Wael Abd-Almageed
DiffM
43
1
0
30 Jul 2024
MambaGesture: Enhancing Co-Speech Gesture Generation with Mamba and Disentangled Multi-Modality Fusion
Chencan Fu
Yabiao Wang
Jiangning Zhang
Zhengkai Jiang
Xiaofeng Mao
Jiafu Wu
Weijian Cao
Chengjie Wang
Yanhao Ge
Yong Liu
Mamba
35
2
0
29 Jul 2024
Piecewise deterministic generative models
Andrea Bertazzi
Alain Durmus
Dario Shariatian
Umut Simsekli
Éric Moulines
DiffM
27
0
0
28 Jul 2024
Self-Supervision Improves Diffusion Models for Tabular Data Imputation
Yixin Liu
Thalaiyasingam Ajanthan
Hisham Husain
Vu-Linh Nguyen
39
9
0
25 Jul 2024
Diffusion Models for Multi-Task Generative Modeling
Changyou Chen
Han Ding
Bunyamin Sisman
Yi Tian Xu
Ouye Xie
Benjamin Z. Yao
Son Dinh Tran
Belinda Zeng
DiffM
37
4
0
24 Jul 2024
WebRPG: Automatic Web Rendering Parameters Generation for Visual Presentation
Zirui Shao
Feiyu Gao
Hangdi Xing
Zepeng Zhu
Zhi Yu
Jiajun Bu
Qi Zheng
Cong Yao
21
2
0
22 Jul 2024
DiffuX2CT: Diffusion Learning to Reconstruct CT Images from Biplanar X-Rays
Xuhui Liu
Zhi Qiao
Runkun Liu
Hong Li
Juan Zhang
Xiantong Zhen
Zhen Qian
Baochang Zhang
MedIm
37
2
0
18 Jul 2024
SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow
Yuanzhi Zhu
Xingchao Liu
Qiang Liu
41
9
0
17 Jul 2024
Previous
1
2
3
4
5
...
18
19
20
Next