v1v2 (latest)

Probability density distillation with generative adversarial networks for high-quality parallel waveform generation

9 April 2019

Papers citing "Probability density distillation with generative adversarial networks for high-quality parallel waveform generation"

34 / 34 papers shown

Wavehax: Aliasing-Free Neural Waveform Synthesis Based on 2D Convolution and Harmonic Prior for Reliable Complex Spectrogram EstimationIEEE Transactions on Audio, Speech, and Language Processing (IEEE TASLP), 2024

333

11 Nov 2024

Evaluating Neural Networks Architectures for Spring Reverb Modelling

Francesco Papaleo

Xavier Lizarraga-Seijas

Frederic Font

187

08 Sep 2024

A Survey of Deep Learning Audio Generation Methods

Matej Bozic

Marko Horvat

VLM MedIm

344

31 May 2024

NeRAF: 3D Scene Infused Neural Radiance and Acoustic Fields

Amandine Brunetto

Sascha Hornauer

Fabien Moutarde

623

28 May 2024

Building a Luganda Text-to-Speech Model From Crowdsourced Data

202

16 May 2024

Collaborative Watermarking for Adversarial Speech SynthesisIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Lauri Juvela

Xin Wang

275

26 Sep 2023

Multi-Loss Convolutional Network with Time-Frequency Attention for Speech Enhancement

208

15 Jun 2023

Learning Noise-independent Speech Representation for High-quality Voice Conversion for Noisy Target SpeakersInterspeech (Interspeech), 2022

Shan Yang

200

02 Jul 2022

A Systematic Comparison of Phonetic Aware Techniques for Speech EnhancementInterspeech (Interspeech), 2022

Yossi Adi

307

22 Jun 2022

A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker ExtractionInterspeech (Interspeech), 2022

Zexu Pan

Meng Ge

Haizhou Li

307

31 Mar 2022

Audio representations for deep learning in sound synthesis: A reviewACS/IEEE International Conference on Computer Systems and Applications (AICCSA), 2021

Anastasia Natsiou

Seán O'Leary

AI4TS

187

07 Jan 2022

CaloFlow II: Even Faster and Still Accurate Generation of Calorimeter Showers with Normalizing Flows

Claudius Krause

David Shih

209

21 Oct 2021

FlowVocoder: A small Footprint Neural Vocoder based Normalizing flow for Speech SynthesisInterspeech (Interspeech), 2021

Manh Luong

Viet-Anh Tran

137

27 Sep 2021

A Survey on Neural Speech Synthesis

Xu Tan

439

442

29 Jun 2021

Distilling the Knowledge from Conditional Normalizing Flows

342

24 Jun 2021

Glow-WaveGAN: Learning Speech Representations from GAN-based Variational Auto-Encoder For High Fidelity Flow-based Speech SynthesisInterspeech (Interspeech), 2021

Jian Cong

Shan Yang

Lei Xie

Jane Polak Scowcroft

DRL

218

21 Jun 2021

Unified Source-Filter GAN: Unified Source-filter Network Based On Factorization of Quasi-Periodic Parallel WaveGANInterspeech (Interspeech), 2021

Reo Yoneyama

Yi-Chiao Wu

Tomoki Toda

273

10 Apr 2021

AudioVisual Speech Synthesis: A brief literature review

Efthymios Georgiou

Athanasios Katsamanis

101

18 Feb 2021

Efficient neural networks for real-time modeling of analog dynamic range compression

C. Steinmetz

Joshua D. Reiss

272

11 Feb 2021

Improved parallel WaveGAN vocoder with perceptually weighted spectrogram lossSpoken Language Technology Workshop (SLT), 2021

164

19 Jan 2021

I'm Sorry for Your Loss: Spectrally-Based Audio Distances Are Bad at Pitch

Joseph P. Turian

Max Henry

274

08 Dec 2020

Single channel voice separation for unknown number of speakers under reverberant and noisy settings

Shlomo E. Chazan

Lior Wolf

Eliya Nachmani

Yossi Adi

281

04 Nov 2020

Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminatorsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

240

27 Oct 2020

Audio Dequantization for High Fidelity Audio Generation in Flow-based Neural Vocoder

236

16 Aug 2020

Real Time Speech Enhancement in the Waveform DomainInterspeech (Interspeech), 2020

Alexandre Défossez

Gabriel Synnaeve

Yossi Adi

580

606

23 Jun 2020

GAN Memory with No ForgettingNeural Information Processing Systems (NeurIPS), 2020

Lawrence Carin

389

147

13 Jun 2020

End-to-End Adversarial Text-to-Speech

435

192

05 Jun 2020

FeatherWave: An efficient high-fidelity neural vocoder with multi-band linear prediction

Heng Lu

148

12 May 2020

Multi-band MelGAN: Faster Waveform Generation for High-Quality Text-to-Speech

Shan Yang

Lei Xie

262

232

11 May 2020

On Leveraging Pretrained GANs for Generation with Limited DataInternational Conference on Machine Learning (ICML), 2020

Miaoyun Zhao

Yulai Cong

Lawrence Carin

299

26 Feb 2020

WaveFlow: A Compact Flow-based Model for Raw AudioInternational Conference on Machine Learning (ICML), 2019

336

132

03 Dec 2019

Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogramIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

Ryuichi Yamamoto

Eunwoo Song

Jae-Min Kim

572

963

25 Oct 2019

MelGAN: Generative Adversarial Networks for Conditional Waveform SynthesisNeural Information Processing Systems (NeurIPS), 2019

Aaron Courville

545

1,105

08 Oct 2019

High Fidelity Speech Synthesis with Adversarial NetworksInternational Conference on Learning Representations (ICLR), 2019

775

263

25 Sep 2019