Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1904.04472
Cited By
v1
v2 (latest)
Probability density distillation with generative adversarial networks for high-quality parallel waveform generation
9 April 2019
Ryuichi Yamamoto
Eunwoo Song
Jae-Min Kim
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Probability density distillation with generative adversarial networks for high-quality parallel waveform generation"
34 / 34 papers shown
Wavehax: Aliasing-Free Neural Waveform Synthesis Based on 2D Convolution and Harmonic Prior for Reliable Complex Spectrogram Estimation
IEEE Transactions on Audio, Speech, and Language Processing (IEEE TASLP), 2024
Reo Yoneyama
Atsushi Miyashita
Ryuichi Yamamoto
Tomoki Toda
333
5
0
11 Nov 2024
Evaluating Neural Networks Architectures for Spring Reverb Modelling
Francesco Papaleo
Xavier Lizarraga-Seijas
Frederic Font
187
0
0
08 Sep 2024
A Survey of Deep Learning Audio Generation Methods
Matej Bozic
Marko Horvat
VLM
MedIm
344
9
0
31 May 2024
NeRAF: 3D Scene Infused Neural Radiance and Acoustic Fields
Amandine Brunetto
Sascha Hornauer
Fabien Moutarde
623
11
0
28 May 2024
Building a Luganda Text-to-Speech Model From Crowdsourced Data
Sulaiman Kagumire
Andrew Katumba
J. Nakatumba‐Nabende
John Quinn
202
2
0
16 May 2024
Collaborative Watermarking for Adversarial Speech Synthesis
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Lauri Juvela
Xin Wang
275
22
0
26 Sep 2023
Multi-Loss Convolutional Network with Time-Frequency Attention for Speech Enhancement
Liang Wan
Hongqing Liu
Yi Zhou
Jie Ji
208
3
0
15 Jun 2023
Learning Noise-independent Speech Representation for High-quality Voice Conversion for Noisy Target Speakers
Interspeech (Interspeech), 2022
Liumeng Xue
Shan Yang
Na Hu
Jane Polak Scowcroft
Linfu Xie
200
4
0
02 Jul 2022
A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement
Interspeech (Interspeech), 2022
Or Tal
Moshe Mandel
Felix Kreuk
Yossi Adi
AAML
307
11
0
22 Jun 2022
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction
Interspeech (Interspeech), 2022
Zexu Pan
Meng Ge
Haizhou Li
307
26
0
31 Mar 2022
Audio representations for deep learning in sound synthesis: A review
ACS/IEEE International Conference on Computer Systems and Applications (AICCSA), 2021
Anastasia Natsiou
Seán O'Leary
AI4TS
187
28
0
07 Jan 2022
CaloFlow II: Even Faster and Still Accurate Generation of Calorimeter Showers with Normalizing Flows
Claudius Krause
David Shih
209
71
0
21 Oct 2021
FlowVocoder: A small Footprint Neural Vocoder based Normalizing flow for Speech Synthesis
Interspeech (Interspeech), 2021
Manh Luong
Viet-Anh Tran
137
3
0
27 Sep 2021
A Survey on Neural Speech Synthesis
Xu Tan
Tao Qin
Frank Soong
Tie-Yan Liu
AI4TS
439
442
0
29 Jun 2021
Distilling the Knowledge from Conditional Normalizing Flows
Dmitry Baranchuk
Vladimir Aliev
Artem Babenko
BDL
342
5
0
24 Jun 2021
Glow-WaveGAN: Learning Speech Representations from GAN-based Variational Auto-Encoder For High Fidelity Flow-based Speech Synthesis
Interspeech (Interspeech), 2021
Jian Cong
Shan Yang
Lei Xie
Jane Polak Scowcroft
DRL
218
29
0
21 Jun 2021
Unified Source-Filter GAN: Unified Source-filter Network Based On Factorization of Quasi-Periodic Parallel WaveGAN
Interspeech (Interspeech), 2021
Reo Yoneyama
Yi-Chiao Wu
Tomoki Toda
273
12
0
10 Apr 2021
AudioVisual Speech Synthesis: A brief literature review
Efthymios Georgiou
Athanasios Katsamanis
101
0
0
18 Feb 2021
Efficient neural networks for real-time modeling of analog dynamic range compression
C. Steinmetz
Joshua D. Reiss
272
40
0
11 Feb 2021
Improved parallel WaveGAN vocoder with perceptually weighted spectrogram loss
Spoken Language Technology Workshop (SLT), 2021
Eunwoo Song
Ryuichi Yamamoto
Min-Jae Hwang
Jin-Seob Kim
Ohsung Kwon
Jae-Min Kim
164
18
0
19 Jan 2021
I'm Sorry for Your Loss: Spectrally-Based Audio Distances Are Bad at Pitch
Joseph P. Turian
Max Henry
274
36
0
08 Dec 2020
Single channel voice separation for unknown number of speakers under reverberant and noisy settings
Shlomo E. Chazan
Lior Wolf
Eliya Nachmani
Yossi Adi
281
32
0
04 Nov 2020
Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Ryuichi Yamamoto
Eunwoo Song
Min-Jae Hwang
Jae-Min Kim
240
19
0
27 Oct 2020
Audio Dequantization for High Fidelity Audio Generation in Flow-based Neural Vocoder
Hyun-Wook Yoon
Sang-Hoon Lee
Hyeong-Rae Noh
Seong-Whan Lee
236
12
0
16 Aug 2020
Real Time Speech Enhancement in the Waveform Domain
Interspeech (Interspeech), 2020
Alexandre Défossez
Gabriel Synnaeve
Yossi Adi
580
606
0
23 Jun 2020
GAN Memory with No Forgetting
Neural Information Processing Systems (NeurIPS), 2020
Yulai Cong
Miaoyun Zhao
Jianqiao Li
Sijia Wang
Lawrence Carin
CLL
389
147
0
13 Jun 2020
End-to-End Adversarial Text-to-Speech
Jeff Donahue
Sander Dieleman
Mikolaj Binkowski
Erich Elsen
Karen Simonyan
435
192
0
05 Jun 2020
FeatherWave: An efficient high-fidelity neural vocoder with multi-band linear prediction
Qiao Tian
Zewang Zhang
Heng Lu
Linghui Chen
Shan Liu
148
22
0
12 May 2020
Multi-band MelGAN: Faster Waveform Generation for High-Quality Text-to-Speech
Geng Yang
Shan Yang
Kai-Chun Liu
Peng Fang
Wei Chen
Lei Xie
262
232
0
11 May 2020
On Leveraging Pretrained GANs for Generation with Limited Data
International Conference on Machine Learning (ICML), 2020
Miaoyun Zhao
Yulai Cong
Lawrence Carin
299
22
0
26 Feb 2020
WaveFlow: A Compact Flow-based Model for Raw Audio
International Conference on Machine Learning (ICML), 2019
Ming-Yu Liu
Kainan Peng
Kexin Zhao
Z. Song
336
132
0
03 Dec 2019
Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Ryuichi Yamamoto
Eunwoo Song
Jae-Min Kim
572
963
0
25 Oct 2019
MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis
Neural Information Processing Systems (NeurIPS), 2019
Kundan Kumar
Rithesh Kumar
T. Boissière
L. Gestin
Wei Zhen Teoh
Jose M. R. Sotelo
A. D. Brébisson
Yoshua Bengio
Aaron Courville
GAN
545
1,105
0
08 Oct 2019
High Fidelity Speech Synthesis with Adversarial Networks
International Conference on Learning Representations (ICLR), 2019
Mikolaj Binkowski
Jeff Donahue
Sander Dieleman
Aidan Clark
Erich Elsen
Norman Casagrande
Luis C. Cobo
Karen Simonyan
775
263
0
25 Sep 2019
1
Page 1 of 1