Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.14356
Cited By
Upsampling artifacts in neural audio synthesis
27 October 2020
Jordi Pons
Santiago Pascual
Giulio Cengarle
Joan Serra
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Upsampling artifacts in neural audio synthesis"
40 / 40 papers shown
Title
QINCODEC: Neural Audio Compression with Implicit Neural Codebooks
Zineb Lahrichi
Gaëtan Hadjeres
Gaël Richard
Geoffroy Peeters
42
0
0
19 Mar 2025
Artifact-free Sound Quality in DNN-based Closed-loop Systems for Audio Processing
chuan Wen
Guy Torfs
Sarah Verhulst
36
0
0
17 Feb 2025
Why disentanglement-based speaker anonymization systems fail at preserving emotions?
Ünal Ege Gaznepoglu
Nils Peters
83
0
0
22 Jan 2025
Wavehax: Aliasing-Free Neural Waveform Synthesis Based on 2D Convolution and Harmonic Prior for Reliable Complex Spectrogram Estimation
Reo Yoneyama
Atsushi Miyashita
Ryuichi Yamamoto
T. Toda
27
1
0
11 Nov 2024
Diff-MST: Differentiable Mixing Style Transfer
Soumya Sai Vanka
Christian Steinmetz
Jean-Baptiste Rolland
Joshua Reiss
George Fazekas
23
4
0
11 Jul 2024
JenGAN: Stacked Shifted Filters in GAN-Based Speech Synthesis
Hyunjae Cho
Junhyeok Lee
Wonbin Jung
16
0
0
10 Jun 2024
Ambisonizer: Neural Upmixing as Spherical Harmonics Generation
Yongyi Zang
Yifan Wang
Minglun Lee
21
1
0
22 May 2024
The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio
Yuankun Xie
Yi Lu
Ruibo Fu
Zhengqi Wen
Zhiyong Wang
...
Xiaopeng Wang
Yukun Liu
Haonan Cheng
Long Ye
Yi Sun
47
15
0
08 May 2024
PromptCodec: High-Fidelity Neural Speech Codec using Disentangled Representation Learning based Adaptive Feature-aware Prompt Encoders
Yu Pan
Lei Ma
Jianjun Zhao
32
4
0
03 Apr 2024
Distinguishing Neural Speech Synthesis Models Through Fingerprints in Speech Waveforms
Chu Yuan Zhang
Jiangyan Yi
Jianhua Tao
Chenglong Wang
Xinrui Yan
8
2
0
13 Sep 2023
A Review of Differentiable Digital Signal Processing for Music & Speech Synthesis
B. Hayes
Jordie Shier
Gyorgy Fazekas
Andrew Mcpherson
C. Saitis
27
21
0
29 Aug 2023
Mono-to-stereo through parametric stereo generation
Joan Serra
D. Scaini
Santiago Pascual
Daniel Arteaga
Jordi Pons
J. Breebaart
Giulio Cengarle
DiffM
13
4
0
26 Jun 2023
High-Fidelity Audio Compression with Improved RVQGAN
Rithesh Kumar
Prem Seetharaman
Alejandro Luebs
I. Kumar
Kundan Kumar
33
282
0
11 Jun 2023
A Survey on Audio Diffusion Models: Text To Speech Synthesis and Enhancement in Generative AI
Chenshuang Zhang
Chaoning Zhang
Sheng Zheng
Mengchun Zhang
Maryam Qamar
Sung-Ho Bae
In So Kweon
DiffM
MedIm
41
64
0
23 Mar 2023
Native Multi-Band Audio Coding within Hyper-Autoencoded Reconstruction Propagation Networks
Darius Petermann
Inseon Jang
Minje Kim
11
1
0
14 Mar 2023
Adversarial Permutation Invariant Training for Universal Sound Separation
Emilian Postolache
Jordi Pons
Santiago Pascual
Joan Serra
VLM
21
6
0
21 Oct 2022
Music Separation Enhancement with Generative Modeling
N. Schaffer
Boaz Cogan
Ethan Manilow
Max Morrison
Prem Seetharaman
Bryan Pardo
20
9
0
26 Aug 2022
Automatic music mixing with deep learning and out-of-domain data
Marco A. Martínez Ramírez
Wei-Hsiang Liao
Giorgio Fabbro
Stefan Uhlich
Chihiro Nagashima
Yuki Mitsufuji
29
24
0
24 Aug 2022
PodcastMix: A dataset for separating music and speech in podcasts
Nico M. Schmidt
Jordi Pons
M. Miron
11
2
0
15 Jul 2022
DrumGAN VST: A Plugin for Drum Sound Analysis/Synthesis With Autoencoding Generative Adversarial Networks
J. Nistal
Cyran Aouameur
Ithan Velarde
Stefan Lattner
GAN
37
4
0
29 Jun 2022
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
Taejun Bak
Junmo Lee
Hanbin Bae
Jinhyeok Yang
Jaesung Bae
Young-Sun Joo
23
27
0
27 Jun 2022
Streaming non-autoregressive model for any-to-many voice conversion
Ziyi Chen
Haoran Miao
Pengyuan Zhang
19
8
0
15 Jun 2022
BigVGAN: A Universal Neural Vocoder with Large-Scale Training
Sang-gil Lee
Wei Ping
Boris Ginsburg
Bryan Catanzaro
Sung-Hoon Yoon
17
224
0
09 Jun 2022
Universal Speech Enhancement with Score-based Diffusion
Joan Serra
Santiago Pascual
Jordi Pons
R. O. Araz
D. Scaini
DiffM
17
95
0
07 Jun 2022
Fully Convolutional Fractional Scaling
Michael Soloveitchik
M. Werman
18
0
0
20 Mar 2022
On loss functions and evaluation metrics for music source separation
Enric Gusó
Jordi Pons
Santiago Pascual
Joan Serra
11
19
0
16 Feb 2022
EcoFlow: Efficient Convolutional Dataflows for Low-Power Neural Network Accelerators
Lois Orosa
Skanda Koppula
Yaman Umuroglu
Konstantinos Kanellopoulos
Juan Gómez Luna
Michaela Blott
K. Vissers
O. Mutlu
38
4
0
04 Feb 2022
PostGAN: A GAN-Based Post-Processor to Enhance the Quality of Coded Speech
Srikanth Korse
N. Pia
Kishan Gupta
Guillaume Fuchs
39
14
0
31 Jan 2022
The HCCL-DKU system for fake audio generation task of the 2022 ICASSP ADD Challenge
Ziyi Chen
Hua Hua
Yuxiang Zhang
Ming Li
Pengyuan Zhang
19
0
0
29 Jan 2022
Upsampling layers for music source separation
Jordi Pons
Joan Serra
Santiago Pascual
Giulio Cengarle
Daniel Arteaga
D. Scaini
17
2
0
23 Nov 2021
Hybrid Spectrogram and Waveform Source Separation
Alexandre Défossez
13
160
0
05 Nov 2021
An investigation of pre-upsampling generative modelling and Generative Adversarial Networks in audio super resolution
James King
Ramón Vinas Torné
Alexander Campbell
Pietro Lio'
DiffM
14
1
0
30 Sep 2021
Adversarial Auto-Encoding for Packet Loss Concealment
Santiago Pascual
Joan Serra
Jordi Pons
24
27
0
07 Jul 2021
Fre-GAN: Adversarial Frequency-consistent Audio Synthesis
Ji-Hoon Kim
Sang-Hoon Lee
Ji-Hyun Lee
Seong-Whan Lee
16
53
0
04 Jun 2021
Generalized Spoofing Detection Inspired from Audio Generation Artifacts
Yang Gao
Tyler Vuong
Mahsa Elyasi
Gaurav Bharaj
Rita Singh
15
19
0
08 Apr 2021
GAN Vocoder: Multi-Resolution Discriminator Is All You Need
J. You
Dalhyun Kim
Gyuhyeon Nam
Geumbyeol Hwang
Gyeongsu Chae
13
27
0
09 Mar 2021
Self-Supervised VQ-VAE for One-Shot Music Style Transfer
Ondřej Cífka
A. Ozerov
Umut Simsekli
G. Richard
19
27
0
10 Feb 2021
High Fidelity Speech Synthesis with Adversarial Networks
Mikolaj Binkowski
Jeff Donahue
Sander Dieleman
Aidan Clark
Erich Elsen
Norman Casagrande
Luis C. Cobo
Karen Simonyan
223
239
0
25 Sep 2019
Wave-U-Net: A Multi-Scale Neural Network for End-to-End Audio Source Separation
Daniel Stoller
Sebastian Ewert
S. Dixon
AI4TS
104
588
0
08 Jun 2018
Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network
Wenzhe Shi
Jose Caballero
Ferenc Huszár
J. Totz
Andrew P. Aitken
Rob Bishop
Daniel Rueckert
Zehan Wang
SupR
195
5,175
0
16 Sep 2016
1