Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1811.00002
Cited By
WaveGlow: A Flow-based Generative Network for Speech Synthesis
31 October 2018
R. Prenger
Rafael Valle
Bryan Catanzaro
Re-assign community
ArXiv
PDF
HTML
Papers citing
"WaveGlow: A Flow-based Generative Network for Speech Synthesis"
50 / 525 papers shown
Title
Generative Speech Coding with Predictive Variance Regularization
W. Kleijn
Andrew Storus
Michael Chinen
Tom Denton
Felicia S. C. Lim
Alejandro Luebs
Jan Skoglund
Hengchin Yeh
29
67
0
18 Feb 2021
AudioVisual Speech Synthesis: A brief literature review
Efthymios Georgiou
Athanasios Katsamanis
21
0
0
18 Feb 2021
Context-Aware Prosody Correction for Text-Based Speech Editing
Max Morrison
Lucas Rencker
Zeyu Jin
Nicholas J. Bryan
Juan-Pablo Caceres
Bryan Pardo
30
28
0
16 Feb 2021
Axial Residual Networks for CycleGAN-based Voice Conversion
J. You
Gyuhyeon Nam
Dalhyun Kim
Gyeongsu Chae
16
3
0
16 Feb 2021
PeriodNet: A non-autoregressive waveform generation model with a structure separating periodic and aperiodic components
Yukiya Hono
Shinji Takaki
Kei Hashimoto
Keiichiro Oura
Yoshihiko Nankaku
K. Tokuda
22
16
0
15 Feb 2021
VARA-TTS: Non-Autoregressive Text-to-Speech Synthesis based on Very Deep VAE with Residual Attention
Peng Liu
Yuewen Cao
Songxiang Liu
Na Hu
Guangzhi Li
Chao Weng
Dan Su
42
22
0
12 Feb 2021
Low Bit-Rate Wideband Speech Coding: A Deep Generative Model based Approach
Gang Min
Xiongwei Zhang
Xia Zou
Xiangyang Liu
6
0
0
04 Feb 2021
Generacion de voces artificiales infantiles en castellano con acento costarricense
A. Alvarez-Blanco
Eugenia Córdoba-Warner
Marvin Coto-Jiménez
Vivian Fallas-Lopez
Maribel Morales Rodríguez
6
0
0
02 Feb 2021
Generative Spoken Language Modeling from Raw Audio
Kushal Lakhotia
Evgeny Kharitonov
Wei-Ning Hsu
Yossi Adi
Adam Polyak
...
Tu Nguyen
Jade Copet
Alexei Baevski
A. Mohamed
Emmanuel Dupoux
AuLLM
199
345
0
01 Feb 2021
Universal Neural Vocoding with Parallel WaveNet
Yunlong Jiao
Adam Gabry's
Georgi Tinchev
Bartosz Putrycz
Daniel Korzekwa
V. Klimkov
36
42
0
01 Feb 2021
Expressive Neural Voice Cloning
Paarth Neekhara
Shehzeen Samarah Hussain
Shlomo Dubnov
F. Koushanfar
Julian McAuley
DiffM
24
30
0
30 Jan 2021
Text-Free Image-to-Speech Synthesis Using Learned Segmental Units
Wei-Ning Hsu
David Harwath
Christopher Song
James R. Glass
CLIP
37
66
0
31 Dec 2020
Speech Synthesis as Augmentation for Low-Resource ASR
Deblin Bagchi
Shannon Wotherspoon
Zhuolin Jiang
P. Muthukumar
12
2
0
23 Dec 2020
Incremental Text-to-Speech Synthesis Using Pseudo Lookahead with Large Pretrained Language Model
Takaaki Saeki
Shinnosuke Takamichi
Hiroshi Saruwatari
8
16
0
23 Dec 2020
Parallel WaveNet conditioned on VAE latent vectors
Jonas Rohnke
Thomas Merritt
Jaime Lorenzo-Trueba
Adam Gabry's
Vatsal Aggarwal
Alexis Moinet
Roberto Barra-Chicote
28
3
0
17 Dec 2020
Few Shot Adaptive Normalization Driven Multi-Speaker Speech Synthesis
Neeraj Kumar
Srishti Goel
Ankur Narang
Brejesh Lall
29
5
0
14 Dec 2020
Full-Glow: Fully conditional Glow for more realistic image generation
Moein Sorkhei
G. Henter
Hedvig Kjellström
25
6
0
10 Dec 2020
Using previous acoustic context to improve Text-to-Speech synthesis
Pilar Oplustil Gallegos
Simon King
29
11
0
07 Dec 2020
EfficientTTS: An Efficient and High-Quality Text-to-Speech Architecture
Chenfeng Miao
Shuang Liang
Zhencheng Liu
Minchuan Chen
Jun Ma
Shaojun Wang
Jing Xiao
22
38
0
07 Dec 2020
Text-to-speech for the hearing impaired
Josef Schlittenlacher
T. Baer
14
0
0
03 Dec 2020
Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training
Haohan Guo
Heng Lu
Na Hu
Chunlei Zhang
Shan Yang
Lei Xie
Dan Su
Dong Yu
AAML
27
12
0
03 Dec 2020
MelGlow: Efficient Waveform Generative Network Based on Location-Variable Convolution
Zhen Zeng
Jianzong Wang
Ning Cheng
Jing Xiao
14
8
0
03 Dec 2020
FBWave: Efficient and Scalable Neural Vocoders for Streaming Text-To-Speech on the Edge
Bichen Wu
Qing He
Peizhao Zhang
T. Koehler
Kurt Keutzer
Peter Vajda
31
6
0
25 Nov 2020
Empirical Evaluation of Deep Learning Model Compression Techniques on the WaveNet Vocoder
Sam Davis
Giuseppe Coccia
Sam Gooch
Julian Mack
14
0
0
20 Nov 2020
Universal MelGAN: A Robust Neural Vocoder for High-Fidelity Waveform Generation in Multiple Domains
Won Jang
D. Lim
Jaesam Yoon
25
31
0
19 Nov 2020
Towards transformation-resilient provenance detection of digital media
Jamie Hayes
Krishnamurthy Dvijotham
Dvijotham
Yutian Chen
Sander Dieleman
Pushmeet Kohli
Norman Casagrande
18
3
0
14 Nov 2020
Denoising-and-Dereverberation Hierarchical Neural Vocoder for Robust Waveform Generation
Yang Ai
Haoyu Li
Xin Wang
Junichi Yamagishi
Zhenhua Ling
17
4
0
08 Nov 2020
Fine-grained Style Modeling, Transfer and Prediction in Text-to-Speech Synthesis via Phone-Level Content-Style Disentanglement
Daxin Tan
Tan Lee
31
21
0
08 Nov 2020
Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesis
Ron J. Weiss
RJ Skerry-Ryan
Eric Battenberg
Soroosh Mariooryad
Diederik P. Kingma
24
98
0
06 Nov 2020
Can We Trust Deep Speech Prior?
Ying Shi
Haolin Chen
Zhiyuan Tang
Lantian Li
Dong Wang
Jiqing Han
27
1
0
04 Nov 2020
StyleMelGAN: An Efficient High-Fidelity Adversarial Vocoder with Temporal Adaptive Normalization
Ahmed Mustafa
N. Pia
Guillaume Fuchs
22
72
0
03 Nov 2020
Speech Synthesis and Control Using Differentiable DSP
Giorgio Fabbro
Vladimir Golkov
Thomas Kemp
Daniel Cremers
28
12
0
28 Oct 2020
Upsampling artifacts in neural audio synthesis
Jordi Pons
Santiago Pascual
Giulio Cengarle
Joan Serrà
35
62
0
27 Oct 2020
Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators
Ryuichi Yamamoto
Eunwoo Song
Min-Jae Hwang
Jae-Min Kim
29
18
0
27 Oct 2020
AISHELL-3: A Multi-speaker Mandarin TTS Corpus and the Baselines
Yao Shi
Hui Bu
Xin Xu
Shaojing Zhang
Ming Li
35
219
0
22 Oct 2020
NU-GAN: High resolution neural upsampling with GAN
Rithesh Kumar
Kundan Kumar
Vicki Anand
Yoshua Bengio
Aaron Courville
27
25
0
22 Oct 2020
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Jungil Kong
Jaehyeon Kim
Jaekyoung Bae
54
1,869
0
12 Oct 2020
The NU Voice Conversion System for the Voice Conversion Challenge 2020: On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural Vocoders
Wen-Chin Huang
Patrick Lumban Tobing
Yi-Chiao Wu
Kazuhiro Kobayashi
T. Toda
21
8
0
09 Oct 2020
Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling
Jonathan Shen
Ye Jia
Mike Chrzanowski
Yu Zhang
Isaac Elias
Heiga Zen
Yonghui Wu
27
112
0
08 Oct 2020
Improving Sequential Latent Variable Models with Autoregressive Flows
Joseph Marino
Lei Chen
Jiawei He
Stephan Mandt
BDL
AI4TS
30
12
0
07 Oct 2020
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics
Hirokazu Kameoka
Takuhiro Kaneko
Kou Tanaka
Nobukatsu Hojo
Shogo Seki
DiffM
28
21
0
06 Oct 2020
Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion
Che-Jui Chang
17
5
0
30 Sep 2020
FluentNet: End-to-End Detection of Speech Disfluency with Deep Learning
Tedd Kourkounakis
Amirhossein Hajavi
Ali Etemad
24
22
0
23 Sep 2020
DiffWave: A Versatile Diffusion Model for Audio Synthesis
Zhifeng Kong
Ming-Yu Liu
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
DiffM
BDL
36
1,397
0
21 Sep 2020
Exploration of End-to-end Synthesisers forZero Resource Speech Challenge 2020
Karthik Pandia D.S.
Anusha Prakash
M. M.
H. Murthy
10
4
0
10 Sep 2020
What the Future Brings: Investigating the Impact of Lookahead for Incremental Neural TTS
Brooke Stephenson
Laurent Besacier
Laurent Girin
Thomas Hueber
12
13
0
04 Sep 2020
WaveGrad: Estimating Gradients for Waveform Generation
Nanxin Chen
Yu Zhang
Heiga Zen
Ron J. Weiss
Mohammad Norouzi
William Chan
DiffM
BDL
16
773
0
02 Sep 2020
Voice Conversion Challenge 2020: Intra-lingual semi-parallel and cross-lingual voice conversion
Yi Zhao
Wen-Chin Huang
Xiaohai Tian
Junichi Yamagishi
Rohan Kumar Das
Tomi Kinnunen
Zhenhua Ling
T. Toda
27
206
0
28 Aug 2020
Nonparallel Voice Conversion with Augmented Classifier Star Generative Adversarial Networks
Hirokazu Kameoka
Takuhiro Kaneko
Kou Tanaka
Nobukatsu Hojo
18
20
0
27 Aug 2020
Efficient neural speech synthesis for low-resource languages through multilingual modeling
M. D. Korte
Jaebok Kim
E. Klabbers
10
19
0
20 Aug 2020
Previous
1
2
3
...
10
11
7
8
9
Next