v1v2v3 (latest)

DiffWave: A Versatile Diffusion Model for Audio Synthesis

International Conference on Learning Representations (ICLR), 2020

21 September 2020

Papers citing "DiffWave: A Versatile Diffusion Model for Audio Synthesis"

50 / 1,133 papers shown

Title
Robust One-Shot Singing Voice Conversion Naoya Takahashi M. Singh Yuki Mitsufuji DiffM 265 9 0 20 Oct 2022
Differentially Private Diffusion Models Tim Dockhorn Tianshi Cao Arash Vahdat Karsten Kreis DiffM 426 127 0 18 Oct 2022
TorchDIVA: An Extensible Computational Model of Speech Production built on an Open-Source Machine Learning LibraryPLoS ONE (PLoS ONE), 2022 Sean M. Kinahan J. Liss Visar Berisha 48 2 0 17 Oct 2022
DiffuSeq: Sequence to Sequence Text Generation with Diffusion ModelsInternational Conference on Learning Representations (ICLR), 2022 Shansan Gong Mukai Li Jiangtao Feng Zhiyong Wu Lingpeng Kong 394 448 0 17 Oct 2022
DiffGAR: Model-Agnostic Restoration from Generative Artifacts Using Image-to-Image Diffusion ModelsInternational Conference on Computer Science and Artificial Intelligence (ICCSAI), 2022 Yueqin Yin Lianghua Huang Yu Liu Kaiqiang Huang DiffM 138 12 0 16 Oct 2022
TransFusion: Transcribing Speech with Multinomial Diffusion Matthew Baas Kevin Eloff Herman Kamper DiffM 79 6 0 14 Oct 2022
Hierarchical Diffusion Models for Singing Voice Neural VocoderIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022 Naoya Takahashi Mayank Kumar Singh Yuki Mitsufuji DiffM 258 18 0 14 Oct 2022
LION: Latent Point Diffusion Models for 3D Shape GenerationNeural Information Processing Systems (NeurIPS), 2022 Fangyin Wei Arash Vahdat Francis Williams Zan Gojcic Or Litany Sanja Fidler Karsten Kreis DiffM 350 618 0 12 Oct 2022
Human Joint Kinematics Diffusion-Refinement for Stochastic Motion PredictionAAAI Conference on Artificial Intelligence (AAAI), 2022 Dong Wei Huaijiang Sun Bin Li Jianfeng Lu Weiqing Li Xiaoning Sun Sheng-liang Hu DiffM VGen 197 61 0 12 Oct 2022
Unifying Diffusion Models' Latent Space, with Applications to CycleDiffusion and Guidance Chen Henry Wu Fernando de la Torre DiffM 358 79 0 11 Oct 2022
GENIE: Higher-Order Denoising Diffusion SolversNeural Information Processing Systems (NeurIPS), 2022 Tim Dockhorn Arash Vahdat Karsten Kreis DiffM 305 141 0 11 Oct 2022
GAN You Hear Me? Reclaiming Unconditional Speech Synthesis from Diffusion ModelsSpoken Language Technology Workshop (SLT), 2022 Matthew Baas Herman Kamper DiffM 168 8 0 11 Oct 2022
DiffRoll: Diffusion-based Generative Music Transcription with Unsupervised Pretraining CapabilityIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022 K. Cheuk Ryosuke Sawata Toshimitsu Uesaka Naoki Murata Naoya Takahashi Shusuke Takahashi Dorien Herremans Yuki Mitsufuji DiffM 162 21 0 11 Oct 2022
Sequential Neural Score Estimation: Likelihood-Free Inference with Conditional Score Based Diffusion ModelsInternational Conference on Machine Learning (ICML), 2022 Louis Sharrock J. Simons Song Liu Mark Beaumont DiffM 259 50 0 10 Oct 2022
FP-Diffusion: Improving Score-based Diffusion Models by Enforcing the Underlying Score Fokker-Planck EquationInternational Conference on Machine Learning (ICML), 2022 Chieh-Hsin Lai Yuhta Takida Naoki Murata Toshimitsu Uesaka Yuki Mitsufuji Stefano Ermon DiffM 226 38 0 09 Oct 2022
On Distillation of Guided Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2022 Chenlin Meng Robin Rombach Ruiqi Gao Diederik P. Kingma Stefano Ermon Jonathan Ho Tim Salimans VLM DiffM 224 694 0 06 Oct 2022
PSVRF: Learning to restore Pitch-Shifted Voice without reference Yangfu Li Xiaodan Lin Jiaxin Yang 121 0 0 06 Oct 2022
DALL-E-Bot: Introducing Web-Scale Diffusion Models to RoboticsIEEE Robotics and Automation Letters (RA-L), 2022 Ivan Kapelyukh Vitalis Vosylius Edward Johns LM&Ro DiffM 492 175 0 05 Oct 2022
Imagen Video: High Definition Video Generation with Diffusion Models Jonathan Ho William Chan Chitwan Saharia Jay Whang Ruiqi Gao ... Diederik P. Kingma Ben Poole Mohammad Norouzi David J. Fleet Tim Salimans VGen 408 1,843 0 05 Oct 2022
Progressive Text-to-Image Generation Zhengcong Fei Mingyuan Fan Li Zhu Junshi Huang 268 4 0 05 Oct 2022
Vision+X: A Survey on Multimodal Learning in the Light of DataIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022 Ye Zhu Yuehua Wu Andrii Zadaianchuk Yan Yan 342 37 0 05 Oct 2022
WaveFit: An Iterative and Non-autoregressive Neural Vocoder based on Fixed-Point IterationSpoken Language Technology Workshop (SLT), 2022 Yuma Koizumi Kohei Yatabe Heiga Zen M. Bacchiani DiffM 194 33 0 03 Oct 2022
OCD: Learning to Overfit with Conditional Diffusion ModelsInternational Conference on Machine Learning (ICML), 2022 Shahar Lutati Lior Wolf DiffM 319 10 0 02 Oct 2022
Protein structure generation via folding diffusionNature Communications (Nat Commun), 2022 Kevin E. Wu Kevin Kaichuang Yang Rianne van den Berg James Zou Alex X. Lu Ava P. Amini DiffM 353 253 0 30 Sep 2022
TabDDPM: Modelling Tabular Data with Diffusion ModelsInternational Conference on Machine Learning (ICML), 2022 Akim Kotelnikov Dmitry Baranchuk Ivan Rubachev Artem Babenko DiffM 241 401 0 30 Sep 2022
Equivariant Energy-Guided SDE for Inverse Molecular DesignInternational Conference on Learning Representations (ICLR), 2022 Fan Bao Min Zhao Zhongkai Hao Pei‐Yun Li Chongxuan Li Jun Zhu DiffM 1.1K 79 0 30 Sep 2022
DreamFusion: Text-to-3D using 2D DiffusionInternational Conference on Learning Representations (ICLR), 2022 Ben Poole Ajay Jain Jonathan T. Barron B. Mildenhall 822 3,124 0 29 Sep 2022
ButterflyFlow: Building Invertible Layers with Butterfly MatricesInternational Conference on Machine Learning (ICML), 2022 Chenlin Meng Linqi Zhou Kristy Choi Tri Dao Stefano Ermon TPM 286 13 0 28 Sep 2022
On Investigating the Conservative Property of Score-Based Generative ModelsInternational Conference on Machine Learning (ICML), 2022 Chen-Hao Chao Wei-Fang Sun Bo Wun Cheng Chun-Yi Lee 248 15 0 26 Sep 2022
All are Worth Words: A ViT Backbone for Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2022 Fan Bao Shen Nie Kaiwen Xue Yue Cao Chongxuan Li Hang Su Jun Zhu VLM 541 495 0 25 Sep 2022
Controllable Accented Text-to-Speech Synthesis Rui Liu Berrak Sisman Guanglai Gao Haizhou Li 177 6 0 22 Sep 2022
Mandarin Singing Voice Synthesis with Denoising Diffusion Probabilistic Wasserstein GANAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2022 Yin-Ping Cho Yu Tsao Hsin-Min Wang Yi-Wen Liu DiffM 221 9 0 21 Sep 2022
Denoising Diffusion Error Correction CodesInternational Conference on Learning Representations (ICLR), 2022 Yoni Choukroun Lior Wolf DiffM 184 37 0 16 Sep 2022
MDM: Molecular Diffusion Model for 3D Molecule GenerationAAAI Conference on Artificial Intelligence (AAAI), 2022 Lei Huang Hengtong Zhang Qifeng Bai Ka-Chun Wong DiffM 254 112 0 13 Sep 2022
Blurring Diffusion ModelsInternational Conference on Learning Representations (ICLR), 2022 Emiel Hoogeboom Tim Salimans DiffM 353 92 0 12 Sep 2022
Soft Diffusion: Score Matching for General Corruptions Giannis Daras M. Delbracio Hossein Talebi A. Dimakis P. Milanfar DiffM 254 121 0 12 Sep 2022
AudioLM: a Language Modeling Approach to Audio GenerationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022 Zalan Borsos Raphaël Marinier Damien Vincent Eugene Kharitonov Olivier Pietquin ... Dominik Roblek O. Teboul David Grangier Marco Tagliasacchi Neil Zeghidour AuLLM 376 806 0 07 Sep 2022
Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified FlowInternational Conference on Learning Representations (ICLR), 2022 Xingchao Liu Chengyue Gong Qiang Liu OOD 1.0K 1,944 0 07 Sep 2022
A Survey on Generative Diffusion ModelIEEE Transactions on Knowledge and Data Engineering (TKDE), 2022 Hanqun Cao Cheng Tan Zhangyang Gao Yilun Xu Guangyong Chen Pheng-Ann Heng Stan Z. Li MedIm 745 408 0 06 Sep 2022
First Hitting Diffusion Models for Generating Manifold, Graph and Categorical DataNeural Information Processing Systems (NeurIPS), 2022 Mao Ye Lemeng Wu Qiang Liu DiffM 352 21 0 02 Sep 2022
Diffusion Models: A Comprehensive Survey of Methods and ApplicationsACM Computing Surveys (ACM CSUR), 2022 Ling Yang Zhilong Zhang Yingxia Shao Shenda Hong Runsheng Xu Yue Zhao Wentao Zhang Tengjiao Wang Ming-Hsuan Yang DiffM MedIm 1.4K 1,869 0 02 Sep 2022
Evaluating generative audio systems and their metricsInternational Society for Music Information Retrieval Conference (ISMIR), 2022 Ashvala Vinay Alexander Lerch 251 26 0 31 Aug 2022
Let us Build Bridges: Understanding and Extending Diffusion Generative Models Xingchao Liu Lemeng Wu Mao Ye Qiang Liu DiffM 192 97 0 31 Aug 2022
Mel Spectrogram Inversion with Stable PitchInternational Society for Music Information Retrieval Conference (ISMIR), 2022 Bruno Di Giorgi M. Levy Richard Sharp 133 8 0 26 Aug 2022
Diffusion-based Time Series Imputation and Forecasting with Structured State Space Models Juan Miguel Lopez Alcaraz Nils Strodthoff DiffM 329 233 0 19 Aug 2022
One-shot Generative Prior in Hankel-k-space for Parallel Imaging ReconstructionIEEE Transactions on Medical Imaging (IEEE TMI), 2022 Hong Peng Chenbo Jiang Jing Cheng Minghui Zhang Shanshan Wang Dong Liang Qiegen Liu DiffM MedIm 247 22 0 15 Aug 2022
Wavelet Score-Based Generative ModelingNeural Information Processing Systems (NeurIPS), 2022 Florentin Guth Simon Coste Valentin De Bortoli S. Mallat DiffM 233 77 0 09 Aug 2022
DDSP-based Singing Vocoders: A New Subtractive-based Synthesizer and A Comprehensive EvaluationInternational Society for Music Information Retrieval Conference (ISMIR), 2022 Da-Yi Wu Wen-Yi Hsiao Fu-Rong Yang Oscar D. Friedman Warren Jackson Scott Bruzenak Yi-Wen Liu Yi-Hsuan Yang DiffM 235 26 0 09 Aug 2022
AdaCat: Adaptive Categorical Discretization for Autoregressive ModelsConference on Uncertainty in Artificial Intelligence (UAI), 2022 Qiyang Li Ajay Jain Pieter Abbeel OffRL 184 4 0 03 Aug 2022
DeScoD-ECG: Deep Score-Based Diffusion Model for ECG Baseline Wander and Noise RemovalIEEE journal of biomedical and health informatics (IEEE JBHI), 2022 Huayu Li G. Ditzler Janet Roveda Ao Li DiffM 191 76 0 31 Jul 2022

All Papers

DiffWave: A Versatile Diffusion Model for Audio Synthesis

Papers citing "DiffWave: A Versatile Diffusion Model for Audio Synthesis"