v1v2 (latest)

High Fidelity Speech Synthesis with Adversarial Networks

International Conference on Learning Representations (ICLR), 2019

25 September 2019

Papers citing "High Fidelity Speech Synthesis with Adversarial Networks"

50 / 153 papers shown

A Survey on Neural Speech Synthesis

Xu Tan

349

435

29 Jun 2021

AI based Presentation Creator With Customized Audio Content Delivery

Muvazima Mansoor

Srikanth Chandar

Ramamoorthy Srinath

176

27 Jun 2021

Glow-WaveGAN: Learning Speech Representations from GAN-based Variational Auto-Encoder For High Fidelity Flow-based Speech SynthesisInterspeech (Interspeech), 2021

Jian Cong

Shan Yang

Lei Xie

Jane Polak Scowcroft

DRL

166

21 Jun 2021

WaveGrad 2: Iterative Refinement for Text-to-Speech SynthesisInterspeech (Interspeech), 2021

Najim Dehak

213

17 Jun 2021

Non Gaussian Denoising Diffusion Models

Eliya Nachmani

Robin San Roman

Lior Wolf

VLM DiffM

163

14 Jun 2021

Catch-A-Waveform: Learning to Generate Audio from a Single Short ExampleNeural Information Processing Systems (NeurIPS), 2021

Gal Greshler

Tamar Rott Shaham

T. Michaeli

197

11 Jun 2021

Sprachsynthese -- State-of-the-Art in englischer und deutscher Sprache

René Peinl

153

11 Jun 2021

Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-SpeechInternational Conference on Machine Learning (ICML), 2021

298

1,151

11 Jun 2021

Fre-GAN: Adversarial Frequency-consistent Audio SynthesisInterspeech (Interspeech), 2021

198

04 Jun 2021

NVC-Net: End-to-End Adversarial Voice ConversionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Bac Nguyen Cong

Fabien Cardinaux

AAML

195

02 Jun 2021

ItôTTS and ItôWave: Linear Stochastic Differential Equation Is All You Need For Audio Generation

Shoule Wu

Ziqiang Shi

DiffM

238

17 May 2021

Grad-TTS: A Diffusion Probabilistic Model for Text-to-SpeechInternational Conference on Machine Learning (ICML), 2021

395

660

13 May 2021

VQCPC-GAN: Variable-Length Adversarial Audio Synthesis Using Vector-Quantized Contrastive Predictive CodingIEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2021

235

04 May 2021

VideoGPT: Video Generation using VQ-VAE and Transformers

Pieter Abbeel

632

643

20 Apr 2021

Noise Estimation for Generative Diffusion Models

Robin San-Roman

Eliya Nachmani

Lior Wolf

DiffM

288

117

06 Apr 2021

Deepfakes Generation and Detection: State-of-the-art, open challenges, countermeasures, and way forward

504

410

25 Feb 2021

MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in FramesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

127

25 Feb 2021

AudioVisual Speech Synthesis: A brief literature review

Efthymios Georgiou

Athanasios Katsamanis

18 Feb 2021

High Fidelity Speech Regeneration with Application to Speech EnhancementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Yossi Adi

154

31 Jan 2021

Fully Non-autoregressive Neural Machine Translation: Tricks of the TradeFindings (Findings), 2020

Jiatao Gu

X. Kong

246

144

31 Dec 2020

MelGlow: Efficient Waveform Generative Network Based on Location-Variable ConvolutionSpoken Language Technology Workshop (SLT), 2020

153

03 Dec 2020

A Comprehensive Survey on Deep Music Generation: Multi-level Representations, Algorithms, Evaluations, and Future Directions

268

143

13 Nov 2020

Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesis

196

106

06 Nov 2020

StyleMelGAN: An Efficient High-Fidelity Adversarial Vocoder with Temporal Adaptive Normalization

Ahmed Mustafa

N. Pia

Guillaume Fuchs

180

03 Nov 2020

Speech Synthesis and Control Using Differentiable DSP

181

28 Oct 2020

Upsampling artifacts in neural audio synthesisIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

213

27 Oct 2020

Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminatorsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

175

27 Oct 2020

CLAR: Contrastive Learning of Auditory RepresentationsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2020

Haider Al-Tahan

Y. Mohsenzadeh

SSL

380

19 Oct 2020

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Jungil Kong

Jaehyeon Kim

Jaekyoung Bae

493

2,433

12 Oct 2020

The NU Voice Conversion System for the Voice Conversion Challenge 2020: On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural Vocoders

Wen-Chin Huang

Patrick Lumban Tobing

Yi-Chiao Wu

Kazuhiro Kobayashi

Tomoki Toda

172

09 Oct 2020

DiffWave: A Versatile Diffusion Model for Audio SynthesisInternational Conference on Learning Representations (ICLR), 2020

672

1,759

21 Sep 2020

HiFiSinger: Towards High-Fidelity Neural Singing Voice Synthesis

Xu Tan

218

106

03 Sep 2020

WaveGrad: Estimating Gradients for Waveform GenerationInternational Conference on Learning Representations (ICLR), 2020

410

886

02 Sep 2020

Prosody Learning Mechanism for Speech Synthesis System Without Text Length LimitInterspeech (Interspeech), 2020

138

13 Aug 2020

An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep LearningIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020

Haizhou Li

440

389

09 Aug 2020

A Spectral Energy Distance for Parallel Speech Synthesis

236

03 Aug 2020

Adversarially Trained Multi-Singer Sequence-To-Sequence Singing Synthesizer

Jie Wu

Jian Luan

158

18 Jun 2020

FastPitch: Parallel Text-to-speech with Pitch PredictionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Adrian Lañcucki

265

388

11 Jun 2020

HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial NetworksInterspeech (Interspeech), 2020

Jiaqi Su

Zeyu Jin

Adam Finkelstein

171

155

10 Jun 2020

End-to-End Adversarial Text-to-Speech

317

192

05 Jun 2020

Speech-to-Singing Conversion based on Boundary Equilibrium GANInterspeech (Interspeech), 2020

Da-Yi Wu

Yi-Hsuan Yang

GAN

205

28 May 2020

Quasi-Periodic Parallel WaveGAN Vocoder: A Non-autoregressive Pitch-dependent Dilated Convolution Model for Parametric Speech Generation

Hisashi Kawai

152

18 May 2020

Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis

271

131

12 May 2020

Multi-band MelGAN: Faster Waveform Generation for High-Quality Text-to-Speech

Shan Yang

Lei Xie

225

11 May 2020

GACELA -- A generative adversarial context encoder for long audio inpainting

266

11 May 2020

Cotatron: Transcription-Guided Speech Encoder for Any-to-Many Voice Conversion without Parallel Data

Seung-won Park

Doo-young Kim

Myun-chul Joe

155

07 May 2020

Conditional Spoken Digit Generation with StyleGANInterspeech (Interspeech), 2020

224

28 Apr 2020

Transformation-based Adversarial Video Prediction on Large-Scale Data

1.0K

09 Mar 2020

A Limited-Capacity Minimax Theorem for Non-Convex Games or: How I Learned to Stop Worrying about Mixed-Nash and Love Neural Nets

Gauthier Gidel

David Balduzzi

Wojciech M. Czarnecki

M. Garnelo

Yoram Bachrach

255

14 Feb 2020

Score and Lyrics-Free Singing Voice GenerationInternational Conference on Innovative Computing and Cloud Computing (ICCC), 2019

170

26 Dec 2019