Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1902.01605
Cited By
A variance modeling framework based on variational autoencoders for speech enhancement
5 February 2019
Simon Leglaive
Laurent Girin
Radu Horaud
DRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A variance modeling framework based on variational autoencoders for speech enhancement"
48 / 48 papers shown
Title
SEED: Speaker Embedding Enhancement Diffusion Model
KiHyun Nam
Jungwoo Heo
Jee-weon Jung
Gangin Park
Chaeyoung Jung
Ha-Jin Yu
Joon Son Chung
DiffM
69
0
0
22 May 2025
Combining Deterministic Enhanced Conditions with Dual-Streaming Encoding for Diffusion-Based Speech Enhancement
Hao Shi
Xugang Lu
Kazuki Shimada
Tatsuya Kawahara
DiffM
55
0
0
20 May 2025
Bayesian Cox model with graph-structured variable selection priors for multi-omics biomarker identification
Tobias Østmo Hermansen
M. Zucknick
Zhi Zhao
107
0
0
17 Mar 2025
Deep Generative Modeling for Identification of Noisy, Non-Stationary Dynamical Systems
Doris Voina
Steven Brunton
J. Nathan Kutz
DiffM
79
2
0
02 Oct 2024
Sound Source Separation Using Latent Variational Block-Wise Disentanglement
Karim Helwani
M. Togami
Paris Smaragdis
Michael M. Goodwin
BDL
DRL
87
1
0
08 Feb 2024
Self-supervised learning with diffusion-based multichannel speech enhancement for speaker verification under noisy conditions
Sandipana Dowerah
Ajinkya Kulkarni
Romain Serizel
D. Jouvet
DiffM
129
1
0
05 Jul 2023
Unsupervised speech enhancement with deep dynamical generative speech and noise models
Xiaoyu Lin
Simon Leglaive
Laurent Girin
Xavier Alameda-Pineda
56
3
0
13 Jun 2023
SE-Bridge: Speech Enhancement with Consistent Brownian Bridge
Zhibin Qiu
Mengfan Fu
Gang Hua
G. Altenbek
Hao Huang
DiffM
82
5
0
23 May 2023
Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders
Hao Shi
Kazuki Shimada
M. Hirano
Takashi Shibuya
Yuichiro Koyama
Zhi-Wei Zhong
Shusuke Takahashi
Tatsuya Kawahara
Yuki Mitsufuji
DiffM
94
16
0
18 May 2023
Integrating Uncertainty into Neural Network-based Speech Enhancement
Hu Fang
Dennis Becker
S. Wermter
Timo Gerkmann
UQCV
67
3
0
15 May 2023
Partially Adaptive Multichannel Joint Reduction of Ego-noise and Environmental Noise
Hu Fang
Niklas Wittmer
Johannes Twiefel
S. Wermter
Timo Gerkmann
35
3
0
27 Mar 2023
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
Jean-Marie Lemercier
Julius Richter
Simon Welker
Timo Gerkmann
DiffM
257
92
0
22 Dec 2022
Deep neural network techniques for monaural speech enhancement: state of the art analysis
P. Ochieng
119
22
0
01 Dec 2022
A Two-Stage Deep Representation Learning-Based Speech Enhancement Method Using Variational Autoencoder and Adversarial Training
Yang Xiang
Jesper Lisby Højvang
M. Rasmussen
M. G. Christensen
DRL
69
6
0
16 Nov 2022
Fast and efficient speech enhancement with variational autoencoders
M. Sadeghi
Romain Serizel
DRL
BDL
60
3
0
02 Nov 2022
A weighted-variance variational autoencoder model for speech enhancement
A. Golmakani
M. Sadeghi
Xavier Alameda-Pineda
Romain Serizel
80
1
0
02 Nov 2022
Audio-visual speech enhancement with a deep Kalman filter generative model
A. Golmakani
M. Sadeghi
Romain Serizel
DiffM
57
7
0
02 Nov 2022
SRTNet: Time Domain Speech Enhancement Via Stochastic Refinement
Zhibin Qiu
Mengfan Fu
Yinfeng Yu
Lili Yin
Gang Hua
Hao-Ming Huang
DiffM
145
19
0
30 Oct 2022
Speech Enhancement and Dereverberation with Diffusion-based Generative Models
Julius Richter
Simon Welker
Jean-Marie Lemercier
Bunlong Lay
Timo Gerkmann
DiffM
93
207
0
11 Aug 2022
A deep representation learning speech enhancement method using
β
β
β
-VAE
Yang Xiang
Jesper Lisby Højvang
M. Rasmussen
M. G. Christensen
DRL
60
2
0
11 May 2022
Learning and controlling the source-filter representation of speech with a variational autoencoder
Samir Sadok
Simon Leglaive
Laurent Girin
Xavier Alameda-Pineda
Renaud Séguier
SSL
DRL
BDL
115
14
0
14 Apr 2022
Expression-preserving face frontalization improves visually assisted speech processing
Zhiqi Kang
M. Sadeghi
Radu Horaud
Xavier Alameda-Pineda
CVBM
114
8
0
06 Apr 2022
Speech Enhancement with Score-Based Generative Models in the Complex STFT Domain
Simon Welker
Julius Richter
Timo Gerkmann
DiffM
114
117
0
31 Mar 2022
A Sparsity-promoting Dictionary Model for Variational Autoencoders
M. Sadeghi
P. Magron
53
3
0
29 Mar 2022
Integrating Statistical Uncertainty into Neural Network-Based Speech Enhancement
Hu Fang
Tal Peer
S. Wermter
Timo Gerkmann
71
6
0
04 Mar 2022
The impact of removing head movements on audio-visual speech enhancement
Zhiqi Kang
M. Sadeghi
Radu Horaud
Xavier Alameda-Pineda
Jacob Donley
Anurag Kumar
CVBM
53
5
0
01 Feb 2022
A Bayesian Permutation training deep representation learning method for speech enhancement with variational autoencoder
Yang Xiang
Jesper Lisby Højvang
M. Rasmussen
M. G. Christensen
BDL
DRL
51
4
0
24 Jan 2022
FastMVAE2: On improving and accelerating the fast variational autoencoder-based source separation algorithm for determined mixtures
Li Li
Hirokazu Kameoka
S. Makino
DRL
77
8
0
28 Sep 2021
Unsupervised Speech Enhancement using Dynamical Variational Auto-Encoders
Xiaoyu Bie
Simon Leglaive
Xavier Alameda-Pineda
Laurent Girin
DiffM
101
55
0
23 Jun 2021
A Benchmark of Dynamical Variational Autoencoders applied to Speech Spectrogram Modeling
Xiaoyu Bie
Laurent Girin
Simon Leglaive
Thomas Hueber
Xavier Alameda-Pineda
55
12
0
11 Jun 2021
Disentanglement Learning for Variational Autoencoders Applied to Audio-Visual Speech Enhancement
Guillaume Carbajal
Julius Richter
Timo Gerkmann
DRL
87
15
0
19 May 2021
Variational Autoencoder for Speech Enhancement with a Noise-Aware Encoder
Hu Fang
Guillaume Carbajal
S. Wermter
Timo Gerkmann
110
59
0
17 Feb 2021
Guided Variational Autoencoder for Speech Enhancement With a Supervised Classifier
Guillaume Carbajal
Julius Richter
Timo Gerkmann
DRL
SSL
50
17
0
12 Feb 2021
Switching Variational Auto-Encoders for Noise-Agnostic Audio-visual Speech Enhancement
M. Sadeghi
Xavier Alameda-Pineda
31
10
0
08 Feb 2021
Can We Trust Deep Speech Prior?
Ying Shi
Haolin Chen
Zhiyuan Tang
Lantian Li
Dong Wang
Jiqing Han
58
1
0
04 Nov 2020
Dynamical Variational Autoencoders: A Comprehensive Review
Laurent Girin
Simon Leglaive
Xiaoyu Bie
Julien Diard
Thomas Hueber
Xavier Alameda-Pineda
BDL
141
222
0
28 Aug 2020
Deep Variational Generative Models for Audio-visual Speech Separation
V. Nguyen
M. Sadeghi
Elisa Ricci
Xavier Alameda-Pineda
SSL
DRL
40
9
0
17 Aug 2020
Mixture of Inference Networks for VAE-based Audio-visual Speech Enhancement
M. Sadeghi
Xavier Alameda-Pineda
79
21
0
23 Dec 2019
Robust Unsupervised Audio-visual Speech Enhancement Using a Mixture of Variational Autoencoders
M. Sadeghi
Xavier Alameda-Pineda
46
19
0
10 Nov 2019
A Recurrent Variational Autoencoder for Speech Enhancement
Simon Leglaive
Xavier Alameda-Pineda
Laurent Girin
Radu Horaud
DRL
140
79
0
24 Oct 2019
Audio-visual Speech Enhancement Using Conditional Variational Auto-Encoders
M. Sadeghi
Simon Leglaive
Xavier Alameda-Pineda
Laurent Girin
Radu Horaud
DiffM
108
66
0
07 Aug 2019
A Statistically Principled and Computationally Efficient Approach to Speech Enhancement using Variational Autoencoders
Manuel Pariente
Antoine Deleforge
Emmanuel Vincent
72
21
0
03 May 2019
Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition
Kazuki Shimada
Yoshiaki Bando
Masato Mimura
Katsutoshi Itoyama
Kazuyoshi Yoshii
Tatsuya Kawahara
63
54
0
22 Mar 2019
A Deep Generative Model of Speech Complex Spectrograms
Aditya Arie Nugraha
Kouhei Sekiguchi
Kazuyoshi Yoshii
47
19
0
08 Mar 2019
Fast Multichannel Source Separation Based on Jointly Diagonalizable Spatial Covariance Matrices
Kouhei Sekiguchi
A. Nugraha
Yoshiaki Bando
Kazuyoshi Yoshii
18
37
0
08 Mar 2019
Speech enhancement with variational autoencoders and alpha-stable distributions
Simon Leglaive
Umut Simsekli
Antoine Liutkus
Laurent Girin
Radu Horaud
DRL
63
36
0
08 Feb 2019
Fast MVAE: Joint separation and classification of mixed sources based on multichannel variational autoencoder with auxiliary classifier
Li Li
Hirokazu Kameoka
S. Makino
DRL
50
28
0
16 Dec 2018
Semi-supervised multichannel speech enhancement with variational autoencoders and non-negative matrix factorization
Simon Leglaive
Laurent Girin
Radu Horaud
BDL
106
60
0
16 Nov 2018
1