Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1904.04472
Cited By
v1
v2 (latest)
Probability density distillation with generative adversarial networks for high-quality parallel waveform generation
9 April 2019
Ryuichi Yamamoto
Eunwoo Song
Jae-Min Kim
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Probability density distillation with generative adversarial networks for high-quality parallel waveform generation"
29 / 29 papers shown
Title
Wavehax: Aliasing-Free Neural Waveform Synthesis Based on 2D Convolution and Harmonic Prior for Reliable Complex Spectrogram Estimation
Reo Yoneyama
Atsushi Miyashita
Ryuichi Yamamoto
Tomoki Toda
150
3
0
11 Nov 2024
NeRAF: 3D Scene Infused Neural Radiance and Acoustic Fields
Amandine Brunetto
Sascha Hornauer
Fabien Moutarde
321
8
0
28 May 2024
Multi-Loss Convolutional Network with Time-Frequency Attention for Speech Enhancement
Liang Wan
Hongqing Liu
Yi Zhou
Jie Ji
116
3
0
15 Jun 2023
Learning Noise-independent Speech Representation for High-quality Voice Conversion for Noisy Target Speakers
Interspeech (Interspeech), 2022
Liumeng Xue
Shan Yang
Na Hu
Jane Polak Scowcroft
Linfu Xie
89
3
0
02 Jul 2022
A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement
Interspeech (Interspeech), 2022
Or Tal
Moshe Mandel
Felix Kreuk
Yossi Adi
AAML
150
10
0
22 Jun 2022
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction
Interspeech (Interspeech), 2022
Zexu Pan
Meng Ge
Haizhou Li
164
22
0
31 Mar 2022
Audio representations for deep learning in sound synthesis: A review
ACS/IEEE International Conference on Computer Systems and Applications (AICCSA), 2021
Anastasia Natsiou
Seán O'Leary
AI4TS
102
23
0
07 Jan 2022
CaloFlow II: Even Faster and Still Accurate Generation of Calorimeter Showers with Normalizing Flows
Claudius Krause
David Shih
141
69
0
21 Oct 2021
FlowVocoder: A small Footprint Neural Vocoder based Normalizing flow for Speech Synthesis
Interspeech (Interspeech), 2021
Manh Luong
Viet-Anh Tran
83
3
0
27 Sep 2021
A Survey on Neural Speech Synthesis
Xu Tan
Tao Qin
Frank Soong
Tie-Yan Liu
AI4TS
239
420
0
29 Jun 2021
Glow-WaveGAN: Learning Speech Representations from GAN-based Variational Auto-Encoder For High Fidelity Flow-based Speech Synthesis
Interspeech (Interspeech), 2021
Jian Cong
Shan Yang
Lei Xie
Jane Polak Scowcroft
DRL
140
29
0
21 Jun 2021
Unified Source-Filter GAN: Unified Source-filter Network Based On Factorization of Quasi-Periodic Parallel WaveGAN
Interspeech (Interspeech), 2021
Reo Yoneyama
Yi-Chiao Wu
Tomoki Toda
142
12
0
10 Apr 2021
AudioVisual Speech Synthesis: A brief literature review
Efthymios Georgiou
Athanasios Katsamanis
49
0
0
18 Feb 2021
Efficient neural networks for real-time modeling of analog dynamic range compression
C. Steinmetz
Joshua D. Reiss
139
37
0
11 Feb 2021
Improved parallel WaveGAN vocoder with perceptually weighted spectrogram loss
Spoken Language Technology Workshop (SLT), 2021
Eunwoo Song
Ryuichi Yamamoto
Min-Jae Hwang
Jin-Seob Kim
Ohsung Kwon
Jae-Min Kim
94
17
0
19 Jan 2021
I'm Sorry for Your Loss: Spectrally-Based Audio Distances Are Bad at Pitch
Joseph P. Turian
Max Henry
151
35
0
08 Dec 2020
Single channel voice separation for unknown number of speakers under reverberant and noisy settings
Shlomo E. Chazan
Lior Wolf
Eliya Nachmani
Yossi Adi
150
32
0
04 Nov 2020
Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Ryuichi Yamamoto
Eunwoo Song
Min-Jae Hwang
Jae-Min Kim
125
19
0
27 Oct 2020
Audio Dequantization for High Fidelity Audio Generation in Flow-based Neural Vocoder
Hyun-Wook Yoon
Sang-Hoon Lee
Hyeong-Rae Noh
Seong-Whan Lee
133
12
0
16 Aug 2020
Real Time Speech Enhancement in the Waveform Domain
Interspeech (Interspeech), 2020
Alexandre Défossez
Gabriel Synnaeve
Yossi Adi
240
550
0
23 Jun 2020
GAN Memory with No Forgetting
Neural Information Processing Systems (NeurIPS), 2020
Yulai Cong
Miaoyun Zhao
Jianqiao Li
Sijia Wang
Lawrence Carin
CLL
172
139
0
13 Jun 2020
End-to-End Adversarial Text-to-Speech
Jeff Donahue
Sander Dieleman
Mikolaj Binkowski
Erich Elsen
Karen Simonyan
224
190
0
05 Jun 2020
FeatherWave: An efficient high-fidelity neural vocoder with multi-band linear prediction
Qiao Tian
Zewang Zhang
Heng Lu
Linghui Chen
Shan Liu
100
22
0
12 May 2020
Multi-band MelGAN: Faster Waveform Generation for High-Quality Text-to-Speech
Geng Yang
Shan Yang
Kai-Chun Liu
Peng Fang
Wei Chen
Lei Xie
192
220
0
11 May 2020
On Leveraging Pretrained GANs for Generation with Limited Data
International Conference on Machine Learning (ICML), 2020
Miaoyun Zhao
Yulai Cong
Lawrence Carin
179
22
0
26 Feb 2020
WaveFlow: A Compact Flow-based Model for Raw Audio
International Conference on Machine Learning (ICML), 2019
Ming-Yu Liu
Kainan Peng
Kexin Zhao
Z. Song
229
128
0
03 Dec 2019
Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Ryuichi Yamamoto
Eunwoo Song
Jae-Min Kim
298
917
0
25 Oct 2019
MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis
Neural Information Processing Systems (NeurIPS), 2019
Kundan Kumar
Rithesh Kumar
T. Boissière
L. Gestin
Wei Zhen Teoh
Jose M. R. Sotelo
A. D. Brébisson
Yoshua Bengio
Aaron Courville
GAN
348
1,051
0
08 Oct 2019
High Fidelity Speech Synthesis with Adversarial Networks
International Conference on Learning Representations (ICLR), 2019
Mikolaj Binkowski
Jeff Donahue
Sander Dieleman
Aidan Clark
Erich Elsen
Norman Casagrande
Luis C. Cobo
Karen Simonyan
439
255
0
25 Sep 2019
1