Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.03499
Cited By
WaveNet: A Generative Model for Raw Audio
12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"WaveNet: A Generative Model for Raw Audio"
50 / 3,038 papers shown
Title
A Survey of Deep Learning Audio Generation Methods
Matej Bozic
Marko Horvat
VLM
MedIm
52
0
0
31 May 2024
RapVerse: Coherent Vocals and Whole-Body Motions Generations from Text
Jiaben Chen
Xin Yan
Yihang Chen
Siyuan Cen
Qinwei Ma
Haoyu Zhen
Kaizhi Qian
Lie Lu
Chuang Gan
38
0
0
30 May 2024
Fill in the Gap! Combining Self-supervised Representation Learning with Neural Audio Synthesis for Speech Inpainting
Ihab Asaad
Maxime Jacquelin
Olivier Perrotin
Laurent Girin
Thomas Hueber
33
0
0
30 May 2024
Auxiliary Knowledge-Induced Learning for Automatic Multi-Label Medical Document Classification
Xindi Wang
Robert E. Mercer
Frank Rudzicz
32
0
0
29 May 2024
Predicting Parking Availability in Singapore with Cross-Domain Data: A New Dataset and A Data-Driven Approach
Huaiwu Zhang
Yutong Xia
Siru Zhong
Kun Wang
Zekun Tong
Qingsong Wen
Roger Zimmermann
Yuxuan Liang
28
0
0
29 May 2024
Proof of Quality: A Costless Paradigm for Trustless Generative AI Model Inference on Blockchains
Zhenjie Zhang
Yuyang Rao
Hao Xiao
Xiaokui Xiao
Yin Yang
26
4
0
28 May 2024
Sok: Comprehensive Security Overview, Challenges, and Future Directions of Voice-Controlled Systems
Haozhe Xu
Cong Wu
Yangyang Gu
Xingcan Shang
Jing Chen
Kun He
Ruiying Du
34
3
0
27 May 2024
CNN-based Compressor Mass Flow Estimator in Industrial Aircraft Vapor Cycle System
Justin Reverdi
Sixin Zhang
Said Aoues
Fabrice Gamboa
Serge Gratton
Thomas Pellegrini
34
0
0
27 May 2024
EEG-DBNet: A Dual-Branch Network for Temporal-Spectral Decoding in Motor-Imagery Brain-Computer Interfaces
Xicheng Lou
Xinwei Li
Hongying Meng
Jun Hu
Meili Xu
Yue Zhao
Jiazhang Yang
Zhangyong Li
27
1
0
25 May 2024
The Rarity of Musical Audio Signals Within the Space of Possible Audio Generation
Nick Collins
MGen
33
0
0
23 May 2024
Fisher Flow Matching for Generative Modeling over Discrete Data
Oscar Davis
Samuel Kessler
Mircea Petrache
.Ismail .Ilkan Ceylan
Michael M. Bronstein
A. Bose
42
16
0
23 May 2024
Leveraging 2D Information for Long-term Time Series Forecasting with Vanilla Transformers
Xin Cheng
Xiuying Chen
Shuqi Li
Di Luo
Xun Wang
Dongyan Zhao
Rui Yan
AI4TS
66
1
0
22 May 2024
A Versatile Diffusion Transformer with Mixture of Noise Levels for Audiovisual Generation
Gwanghyun Kim
Alonso Martinez
Yu-Chuan Su
Brendan Jou
José Lezama
...
Lijun Yu
Lu Jiang
A. Jansen
Jacob Walker
Krishna Somandepalli
28
8
0
22 May 2024
DiffNorm: Self-Supervised Normalization for Non-autoregressive Speech-to-speech Translation
Weiting Tan
Jingyu Zhang
Lingfeng Shen
Daniel Khashabi
Philipp Koehn
27
0
0
22 May 2024
PT43D: A Probabilistic Transformer for Generating 3D Shapes from Single Highly-Ambiguous RGB Images
Yiheng Xiong
Angela Dai
ViT
27
0
0
20 May 2024
Deep Ensemble Art Style Recognition
Orfeas Menis-Mastromichalakis
Natasa Sofou
Giorgos Stamou
3DPC
18
11
0
19 May 2024
Switched Flow Matching: Eliminating Singularities via Switching ODEs
Qunxi Zhu
Wei Lin
33
1
0
19 May 2024
Generative Artificial Intelligence: A Systematic Review and Applications
S. S. Sengar
Affan Bin Hasan
Sanjay Kumar
Fiona Carroll
MedIm
28
51
0
17 May 2024
FLEXIBLE: Forecasting Cellular Traffic by Leveraging Explicit Inductive Graph-Based Learning
D. Ngo
Kandaraj Piamrat
Ons Aouedi
Thomas Hassan
Philippe Raipin-Parvédy
AI4TS
26
0
0
14 May 2024
FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation
Jianyi Chen
Wei Xue
Xu Tan
Zhen Ye
Qi-fei Liu
Yi-Ting Guo
42
2
0
13 May 2024
Beyond traditional Magnetic Resonance processing with Artificial Intelligence
Amir Jahangiri
Vladislav Orekhov
13
0
0
13 May 2024
Multi-Scale Dilated Convolution Network for Long-Term Time Series Forecasting
Feifei Li
Suhan Guo
Feng Han
Jian Zhao
Shen Furao
36
1
0
09 May 2024
The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio
Yuankun Xie
Yi Lu
Ruibo Fu
Zhengqi Wen
Zhiyong Wang
...
Xiaopeng Wang
Yukun Liu
Haonan Cheng
Long Ye
Yi Sun
47
15
0
08 May 2024
HILCodec: High Fidelity and Lightweight Neural Audio Codec
S. Ahn
Beom Jun Woo
Mingrui Han
Chanyeong Moon
Nam Soo Kim
21
6
0
08 May 2024
VAEneu: A New Avenue for VAE Application on Probabilistic Forecasting
Alireza Koochali
Ensiye Tahaei
Andreas Dengel
Sheraz Ahmed
AI4TS
28
1
0
07 May 2024
Detecting music deepfakes is easy but actually hard
Darius Afchar
Gabriel Meseguer-Brocal
Romain Hennequin
63
6
0
07 May 2024
UniGen: Unified Modeling of Initial Agent States and Trajectories for Generating Autonomous Driving Scenarios
R. Mahjourian
Rongbing Mu
Valerii Likhosherstov
Paul Mougin
Xiukun Huang
Joao Messias
Shimon Whiteson
26
7
0
06 May 2024
Embedded Distributed Inference of Deep Neural Networks: A Systematic Review
Federico Nicolás Peccia
Oliver Bringmann
30
0
0
06 May 2024
Multi-Modality Spatio-Temporal Forecasting via Self-Supervised Learning
Jiewen Deng
Renhe Jiang
Jiaqi Zhang
Xuan Song
AI4TS
25
3
0
06 May 2024
ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers
Yuzhe Gu
Enmao Diao
27
4
0
30 Apr 2024
CONTUNER: Singing Voice Beautifying with Pitch and Expressiveness Condition
Jianzong Wang
Pengcheng Li
Xulong Zhang
Ning Cheng
Jing Xiao
26
0
0
30 Apr 2024
Evaluating the effectiveness of predicting covariates in LSTM Networks for Time Series Forecasting
Gareth Davies
AI4TS
27
1
0
29 Apr 2024
TI-ASU: Toward Robust Automatic Speech Understanding through Text-to-speech Imputation Against Missing Speech Modality
Tiantian Feng
Xuan Shi
Rahul Gupta
Shrikanth S. Narayanan
41
0
0
27 Apr 2024
Any-Quantile Probabilistic Forecasting of Short-Term Electricity Demand
Slawek Smyl
Boris N. Oreshkin
Paweł Pełka
Grzegorz Dudek
AI4TS
37
0
0
26 Apr 2024
LM-IGTD: a 2D image generator for low-dimensional and mixed-type tabular data to leverage the potential of convolutional neural networks
Vanesa Gómez-Martínez
F. J. Lara-Abelenda
Pablo Peiro-Corbacho
David Chushig-Muzo
C. Granja
C. Soguero-Ruíz
LMTD
27
1
0
26 Apr 2024
An Investigation of Time-Frequency Representation Discriminators for High-Fidelity Vocoder
Yicheng Gu
Xueyao Zhang
Liumeng Xue
Haizhou Li
Zhizheng Wu
28
2
0
26 Apr 2024
Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges
Badri N. Patro
Vijay Srinivas Agneeswaran
Mamba
46
38
0
24 Apr 2024
Music Style Transfer With Diffusion Model
Hong Huang
Yuyi Wang
Luyao Li
Jun Lin
DiffM
22
0
0
23 Apr 2024
FlashSpeech: Efficient Zero-Shot Speech Synthesis
Zhen Ye
Zeqian Ju
Haohe Liu
Xu Tan
Jianyi Chen
...
Weizhen Bian
Shulin He
Qi-fei Liu
Yi-Ting Guo
Wei Xue
38
16
0
23 Apr 2024
LVNS-RAVE: Diversified audio generation with RAVE and Latent Vector Novelty Search
Jinyue Guo
Anna-Maria Christodoulou
Balint Laczko
K. Glette
23
0
0
22 Apr 2024
Audio Anti-Spoofing Detection: A Survey
Menglu Li
Yasaman Ahmadiadli
Xiao-Ping Zhang
42
17
0
22 Apr 2024
Large Language Models: From Notes to Musical Form
Lilac Atassi
21
0
0
18 Apr 2024
Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach
Mir Rayat Imtiaz Hossain
Mennatullah Siam
Leonid Sigal
James J. Little
VLM
38
5
0
17 Apr 2024
Decoupled Weight Decay for Any
p
p
p
Norm
N. Outmezguine
Noam Levi
21
2
0
16 Apr 2024
Hardware-aware training of models with synaptic delays for digital event-driven neuromorphic processors
A. Patiño-Saucedo
Roy Meijer
Amirreza Yousefzadeh
M. Gomony
Federico Corradi
Paul Detterer
Laura Garrido-Regife
B. Linares-Barranco
Manolis Sifalakis
11
2
0
16 Apr 2024
Long-form music generation with latent diffusion
Zach Evans
Julian Parker
CJ Carr
Zack Zukowski
Josiah Taylor
Jordi Pons
MGen
DiffM
38
39
0
16 Apr 2024
A Survey on Deep Learning for Theorem Proving
Zhaoyu Li
Jialiang Sun
Logan Murphy
Qidong Su
Zenan Li
Xian Zhang
Kaiyu Yang
Xujie Si
LRM
42
21
0
15 Apr 2024
High Significant Fault Detection in Azure Core Workload Insights
Pranay Lohia
Laurent Boué
Sharath Ranganath
Vijay Srinivas Agneeswaran
AI4CE
16
1
0
14 Apr 2024
Foundational GPT Model for MEG
Richard Csaky
M. Es
Oiwi Parker Jones
M. Woolrich
34
2
0
14 Apr 2024
Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrapping
Kevin Zhang
Luka Chkhetiani
Francis McCann Ramirez
Yash Khare
Andrea Vanzo
...
Ruben Bousbib
Taufiquzzaman Peyash
Michael Nguyen
Dillon Pulliam
Domenic Donato
27
2
0
10 Apr 2024
Previous
1
2
3
...
5
6
7
...
59
60
61
Next