Upsampling artifacts in neural audio synthesis

27 October 2020

Papers citing "Upsampling artifacts in neural audio synthesis"

40 / 40 papers shown

Title
QINCODEC: Neural Audio Compression with Implicit Neural Codebooks Zineb Lahrichi Gaëtan Hadjeres Gaël Richard Geoffroy Peeters 42 0 0 19 Mar 2025
Artifact-free Sound Quality in DNN-based Closed-loop Systems for Audio Processing chuan Wen Guy Torfs Sarah Verhulst 36 0 0 17 Feb 2025
Why disentanglement-based speaker anonymization systems fail at preserving emotions? Ünal Ege Gaznepoglu Nils Peters 83 0 0 22 Jan 2025
Wavehax: Aliasing-Free Neural Waveform Synthesis Based on 2D Convolution and Harmonic Prior for Reliable Complex Spectrogram Estimation Reo Yoneyama Atsushi Miyashita Ryuichi Yamamoto T. Toda 27 1 0 11 Nov 2024
Diff-MST: Differentiable Mixing Style Transfer Soumya Sai Vanka Christian Steinmetz Jean-Baptiste Rolland Joshua Reiss George Fazekas 23 4 0 11 Jul 2024
JenGAN: Stacked Shifted Filters in GAN-Based Speech Synthesis Hyunjae Cho Junhyeok Lee Wonbin Jung 16 0 0 10 Jun 2024
Ambisonizer: Neural Upmixing as Spherical Harmonics Generation Yongyi Zang Yifan Wang Minglun Lee 21 1 0 22 May 2024
The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio Yuankun Xie Yi Lu Ruibo Fu Zhengqi Wen Zhiyong Wang ... Xiaopeng Wang Yukun Liu Haonan Cheng Long Ye Yi Sun 47 15 0 08 May 2024
PromptCodec: High-Fidelity Neural Speech Codec using Disentangled Representation Learning based Adaptive Feature-aware Prompt Encoders Yu Pan Lei Ma Jianjun Zhao 32 4 0 03 Apr 2024
Distinguishing Neural Speech Synthesis Models Through Fingerprints in Speech Waveforms Chu Yuan Zhang Jiangyan Yi Jianhua Tao Chenglong Wang Xinrui Yan 8 2 0 13 Sep 2023
A Review of Differentiable Digital Signal Processing for Music & Speech Synthesis B. Hayes Jordie Shier Gyorgy Fazekas Andrew Mcpherson C. Saitis 27 21 0 29 Aug 2023
Mono-to-stereo through parametric stereo generation Joan Serra D. Scaini Santiago Pascual Daniel Arteaga Jordi Pons J. Breebaart Giulio Cengarle DiffM 13 4 0 26 Jun 2023
High-Fidelity Audio Compression with Improved RVQGAN Rithesh Kumar Prem Seetharaman Alejandro Luebs I. Kumar Kundan Kumar 33 282 0 11 Jun 2023
A Survey on Audio Diffusion Models: Text To Speech Synthesis and Enhancement in Generative AI Chenshuang Zhang Chaoning Zhang Sheng Zheng Mengchun Zhang Maryam Qamar Sung-Ho Bae In So Kweon DiffM MedIm 41 64 0 23 Mar 2023
Native Multi-Band Audio Coding within Hyper-Autoencoded Reconstruction Propagation Networks Darius Petermann Inseon Jang Minje Kim 11 1 0 14 Mar 2023
Adversarial Permutation Invariant Training for Universal Sound Separation Emilian Postolache Jordi Pons Santiago Pascual Joan Serra VLM 21 6 0 21 Oct 2022
Music Separation Enhancement with Generative Modeling N. Schaffer Boaz Cogan Ethan Manilow Max Morrison Prem Seetharaman Bryan Pardo 20 9 0 26 Aug 2022
Automatic music mixing with deep learning and out-of-domain data Marco A. Martínez Ramírez Wei-Hsiang Liao Giorgio Fabbro Stefan Uhlich Chihiro Nagashima Yuki Mitsufuji 29 24 0 24 Aug 2022
PodcastMix: A dataset for separating music and speech in podcasts Nico M. Schmidt Jordi Pons M. Miron 11 2 0 15 Jul 2022
DrumGAN VST: A Plugin for Drum Sound Analysis/Synthesis With Autoencoding Generative Adversarial Networks J. Nistal Cyran Aouameur Ithan Velarde Stefan Lattner GAN 37 4 0 29 Jun 2022
Avocodo: Generative Adversarial Network for Artifact-free Vocoder Taejun Bak Junmo Lee Hanbin Bae Jinhyeok Yang Jaesung Bae Young-Sun Joo 23 27 0 27 Jun 2022
Streaming non-autoregressive model for any-to-many voice conversion Ziyi Chen Haoran Miao Pengyuan Zhang 19 8 0 15 Jun 2022
BigVGAN: A Universal Neural Vocoder with Large-Scale Training Sang-gil Lee Wei Ping Boris Ginsburg Bryan Catanzaro Sung-Hoon Yoon 17 224 0 09 Jun 2022
Universal Speech Enhancement with Score-based Diffusion Joan Serra Santiago Pascual Jordi Pons R. O. Araz D. Scaini DiffM 17 95 0 07 Jun 2022
Fully Convolutional Fractional Scaling Michael Soloveitchik M. Werman 18 0 0 20 Mar 2022
On loss functions and evaluation metrics for music source separation Enric Gusó Jordi Pons Santiago Pascual Joan Serra 11 19 0 16 Feb 2022
EcoFlow: Efficient Convolutional Dataflows for Low-Power Neural Network Accelerators Lois Orosa Skanda Koppula Yaman Umuroglu Konstantinos Kanellopoulos Juan Gómez Luna Michaela Blott K. Vissers O. Mutlu 38 4 0 04 Feb 2022
PostGAN: A GAN-Based Post-Processor to Enhance the Quality of Coded Speech Srikanth Korse N. Pia Kishan Gupta Guillaume Fuchs 39 14 0 31 Jan 2022
The HCCL-DKU system for fake audio generation task of the 2022 ICASSP ADD Challenge Ziyi Chen Hua Hua Yuxiang Zhang Ming Li Pengyuan Zhang 19 0 0 29 Jan 2022
Upsampling layers for music source separation Jordi Pons Joan Serra Santiago Pascual Giulio Cengarle Daniel Arteaga D. Scaini 17 2 0 23 Nov 2021
Hybrid Spectrogram and Waveform Source Separation Alexandre Défossez 13 160 0 05 Nov 2021
An investigation of pre-upsampling generative modelling and Generative Adversarial Networks in audio super resolution James King Ramón Vinas Torné Alexander Campbell Pietro Lio' DiffM 14 1 0 30 Sep 2021
Adversarial Auto-Encoding for Packet Loss Concealment Santiago Pascual Joan Serra Jordi Pons 24 27 0 07 Jul 2021
Fre-GAN: Adversarial Frequency-consistent Audio Synthesis Ji-Hoon Kim Sang-Hoon Lee Ji-Hyun Lee Seong-Whan Lee 16 53 0 04 Jun 2021
Generalized Spoofing Detection Inspired from Audio Generation Artifacts Yang Gao Tyler Vuong Mahsa Elyasi Gaurav Bharaj Rita Singh 15 19 0 08 Apr 2021
GAN Vocoder: Multi-Resolution Discriminator Is All You Need J. You Dalhyun Kim Gyuhyeon Nam Geumbyeol Hwang Gyeongsu Chae 13 27 0 09 Mar 2021
Self-Supervised VQ-VAE for One-Shot Music Style Transfer Ondřej Cífka A. Ozerov Umut Simsekli G. Richard 19 27 0 10 Feb 2021
High Fidelity Speech Synthesis with Adversarial Networks Mikolaj Binkowski Jeff Donahue Sander Dieleman Aidan Clark Erich Elsen Norman Casagrande Luis C. Cobo Karen Simonyan 223 239 0 25 Sep 2019
Wave-U-Net: A Multi-Scale Neural Network for End-to-End Audio Source Separation Daniel Stoller Sebastian Ewert S. Dixon AI4TS 104 588 0 08 Jun 2018
Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network Wenzhe Shi Jose Caballero Ferenc Huszár J. Totz Andrew P. Aitken Rob Bishop Daniel Rueckert Zehan Wang SupR 195 5,175 0 16 Sep 2016