WaveGlow: A Flow-based Generative Network for Speech Synthesis

31 October 2018

Papers citing "WaveGlow: A Flow-based Generative Network for Speech Synthesis"

50 / 525 papers shown

Title
Generative Speech Coding with Predictive Variance Regularization W. Kleijn Andrew Storus Michael Chinen Tom Denton Felicia S. C. Lim Alejandro Luebs Jan Skoglund Hengchin Yeh 29 67 0 18 Feb 2021
AudioVisual Speech Synthesis: A brief literature review Efthymios Georgiou Athanasios Katsamanis 21 0 0 18 Feb 2021
Context-Aware Prosody Correction for Text-Based Speech Editing Max Morrison Lucas Rencker Zeyu Jin Nicholas J. Bryan Juan-Pablo Caceres Bryan Pardo 30 28 0 16 Feb 2021
Axial Residual Networks for CycleGAN-based Voice Conversion J. You Gyuhyeon Nam Dalhyun Kim Gyeongsu Chae 16 3 0 16 Feb 2021
PeriodNet: A non-autoregressive waveform generation model with a structure separating periodic and aperiodic components Yukiya Hono Shinji Takaki Kei Hashimoto Keiichiro Oura Yoshihiko Nankaku K. Tokuda 22 16 0 15 Feb 2021
VARA-TTS: Non-Autoregressive Text-to-Speech Synthesis based on Very Deep VAE with Residual Attention Peng Liu Yuewen Cao Songxiang Liu Na Hu Guangzhi Li Chao Weng Dan Su 42 22 0 12 Feb 2021
Low Bit-Rate Wideband Speech Coding: A Deep Generative Model based Approach Gang Min Xiongwei Zhang Xia Zou Xiangyang Liu 6 0 0 04 Feb 2021
Generacion de voces artificiales infantiles en castellano con acento costarricense A. Alvarez-Blanco Eugenia Córdoba-Warner Marvin Coto-Jiménez Vivian Fallas-Lopez Maribel Morales Rodríguez 6 0 0 02 Feb 2021
Generative Spoken Language Modeling from Raw Audio Kushal Lakhotia Evgeny Kharitonov Wei-Ning Hsu Yossi Adi Adam Polyak ... Tu Nguyen Jade Copet Alexei Baevski A. Mohamed Emmanuel Dupoux AuLLM 199 345 0 01 Feb 2021
Universal Neural Vocoding with Parallel WaveNet Yunlong Jiao Adam Gabry's Georgi Tinchev Bartosz Putrycz Daniel Korzekwa V. Klimkov 36 42 0 01 Feb 2021
Expressive Neural Voice Cloning Paarth Neekhara Shehzeen Samarah Hussain Shlomo Dubnov F. Koushanfar Julian McAuley DiffM 24 30 0 30 Jan 2021
Text-Free Image-to-Speech Synthesis Using Learned Segmental Units Wei-Ning Hsu David Harwath Christopher Song James R. Glass CLIP 37 66 0 31 Dec 2020
Speech Synthesis as Augmentation for Low-Resource ASR Deblin Bagchi Shannon Wotherspoon Zhuolin Jiang P. Muthukumar 12 2 0 23 Dec 2020
Incremental Text-to-Speech Synthesis Using Pseudo Lookahead with Large Pretrained Language Model Takaaki Saeki Shinnosuke Takamichi Hiroshi Saruwatari 8 16 0 23 Dec 2020
Parallel WaveNet conditioned on VAE latent vectors Jonas Rohnke Thomas Merritt Jaime Lorenzo-Trueba Adam Gabry's Vatsal Aggarwal Alexis Moinet Roberto Barra-Chicote 28 3 0 17 Dec 2020
Few Shot Adaptive Normalization Driven Multi-Speaker Speech Synthesis Neeraj Kumar Srishti Goel Ankur Narang Brejesh Lall 29 5 0 14 Dec 2020
Full-Glow: Fully conditional Glow for more realistic image generation Moein Sorkhei G. Henter Hedvig Kjellström 25 6 0 10 Dec 2020
Using previous acoustic context to improve Text-to-Speech synthesis Pilar Oplustil Gallegos Simon King 29 11 0 07 Dec 2020
EfficientTTS: An Efficient and High-Quality Text-to-Speech Architecture Chenfeng Miao Shuang Liang Zhencheng Liu Minchuan Chen Jun Ma Shaojun Wang Jing Xiao 22 38 0 07 Dec 2020
Text-to-speech for the hearing impaired Josef Schlittenlacher T. Baer 14 0 0 03 Dec 2020
Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training Haohan Guo Heng Lu Na Hu Chunlei Zhang Shan Yang Lei Xie Dan Su Dong Yu AAML 27 12 0 03 Dec 2020
MelGlow: Efficient Waveform Generative Network Based on Location-Variable Convolution Zhen Zeng Jianzong Wang Ning Cheng Jing Xiao 14 8 0 03 Dec 2020
FBWave: Efficient and Scalable Neural Vocoders for Streaming Text-To-Speech on the Edge Bichen Wu Qing He Peizhao Zhang T. Koehler Kurt Keutzer Peter Vajda 31 6 0 25 Nov 2020
Empirical Evaluation of Deep Learning Model Compression Techniques on the WaveNet Vocoder Sam Davis Giuseppe Coccia Sam Gooch Julian Mack 14 0 0 20 Nov 2020
Universal MelGAN: A Robust Neural Vocoder for High-Fidelity Waveform Generation in Multiple Domains Won Jang D. Lim Jaesam Yoon 25 31 0 19 Nov 2020
Towards transformation-resilient provenance detection of digital media Jamie Hayes Krishnamurthy Dvijotham Dvijotham Yutian Chen Sander Dieleman Pushmeet Kohli Norman Casagrande 18 3 0 14 Nov 2020
Denoising-and-Dereverberation Hierarchical Neural Vocoder for Robust Waveform Generation Yang Ai Haoyu Li Xin Wang Junichi Yamagishi Zhenhua Ling 17 4 0 08 Nov 2020
Fine-grained Style Modeling, Transfer and Prediction in Text-to-Speech Synthesis via Phone-Level Content-Style Disentanglement Daxin Tan Tan Lee 31 21 0 08 Nov 2020
Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesis Ron J. Weiss RJ Skerry-Ryan Eric Battenberg Soroosh Mariooryad Diederik P. Kingma 24 98 0 06 Nov 2020
Can We Trust Deep Speech Prior? Ying Shi Haolin Chen Zhiyuan Tang Lantian Li Dong Wang Jiqing Han 27 1 0 04 Nov 2020
StyleMelGAN: An Efficient High-Fidelity Adversarial Vocoder with Temporal Adaptive Normalization Ahmed Mustafa N. Pia Guillaume Fuchs 22 72 0 03 Nov 2020
Speech Synthesis and Control Using Differentiable DSP Giorgio Fabbro Vladimir Golkov Thomas Kemp Daniel Cremers 28 12 0 28 Oct 2020
Upsampling artifacts in neural audio synthesis Jordi Pons Santiago Pascual Giulio Cengarle Joan Serrà 35 62 0 27 Oct 2020
Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators Ryuichi Yamamoto Eunwoo Song Min-Jae Hwang Jae-Min Kim 29 18 0 27 Oct 2020
AISHELL-3: A Multi-speaker Mandarin TTS Corpus and the Baselines Yao Shi Hui Bu Xin Xu Shaojing Zhang Ming Li 35 219 0 22 Oct 2020
NU-GAN: High resolution neural upsampling with GAN Rithesh Kumar Kundan Kumar Vicki Anand Yoshua Bengio Aaron Courville 27 25 0 22 Oct 2020
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis Jungil Kong Jaehyeon Kim Jaekyoung Bae 54 1,869 0 12 Oct 2020
The NU Voice Conversion System for the Voice Conversion Challenge 2020: On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural Vocoders Wen-Chin Huang Patrick Lumban Tobing Yi-Chiao Wu Kazuhiro Kobayashi T. Toda 21 8 0 09 Oct 2020
Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling Jonathan Shen Ye Jia Mike Chrzanowski Yu Zhang Isaac Elias Heiga Zen Yonghui Wu 27 112 0 08 Oct 2020
Improving Sequential Latent Variable Models with Autoregressive Flows Joseph Marino Lei Chen Jiawei He Stephan Mandt BDL AI4TS 30 12 0 07 Oct 2020
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics Hirokazu Kameoka Takuhiro Kaneko Kou Tanaka Nobukatsu Hojo Shogo Seki DiffM 28 21 0 06 Oct 2020
Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion Che-Jui Chang 17 5 0 30 Sep 2020
FluentNet: End-to-End Detection of Speech Disfluency with Deep Learning Tedd Kourkounakis Amirhossein Hajavi Ali Etemad 24 22 0 23 Sep 2020
DiffWave: A Versatile Diffusion Model for Audio Synthesis Zhifeng Kong Ming-Yu Liu Jiaji Huang Kexin Zhao Bryan Catanzaro DiffM BDL 36 1,397 0 21 Sep 2020
Exploration of End-to-end Synthesisers forZero Resource Speech Challenge 2020 Karthik Pandia D.S. Anusha Prakash M. M. H. Murthy 10 4 0 10 Sep 2020
What the Future Brings: Investigating the Impact of Lookahead for Incremental Neural TTS Brooke Stephenson Laurent Besacier Laurent Girin Thomas Hueber 12 13 0 04 Sep 2020
WaveGrad: Estimating Gradients for Waveform Generation Nanxin Chen Yu Zhang Heiga Zen Ron J. Weiss Mohammad Norouzi William Chan DiffM BDL 16 773 0 02 Sep 2020
Voice Conversion Challenge 2020: Intra-lingual semi-parallel and cross-lingual voice conversion Yi Zhao Wen-Chin Huang Xiaohai Tian Junichi Yamagishi Rohan Kumar Das Tomi Kinnunen Zhenhua Ling T. Toda 27 206 0 28 Aug 2020
Nonparallel Voice Conversion with Augmented Classifier Star Generative Adversarial Networks Hirokazu Kameoka Takuhiro Kaneko Kou Tanaka Nobukatsu Hojo 18 20 0 27 Aug 2020
Efficient neural speech synthesis for low-resource languages through multilingual modeling M. D. Korte Jaebok Kim E. Klabbers 10 19 0 20 Aug 2020