ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.01605
  4. Cited By
A variance modeling framework based on variational autoencoders for
  speech enhancement

A variance modeling framework based on variational autoencoders for speech enhancement

5 February 2019
Simon Leglaive
Laurent Girin
Radu Horaud
    DRL
ArXiv (abs)PDFHTML

Papers citing "A variance modeling framework based on variational autoencoders for speech enhancement"

48 / 48 papers shown
Title
SEED: Speaker Embedding Enhancement Diffusion Model
SEED: Speaker Embedding Enhancement Diffusion Model
KiHyun Nam
Jungwoo Heo
Jee-weon Jung
Gangin Park
Chaeyoung Jung
Ha-Jin Yu
Joon Son Chung
DiffM
69
0
0
22 May 2025
Combining Deterministic Enhanced Conditions with Dual-Streaming Encoding for Diffusion-Based Speech Enhancement
Combining Deterministic Enhanced Conditions with Dual-Streaming Encoding for Diffusion-Based Speech Enhancement
Hao Shi
Xugang Lu
Kazuki Shimada
Tatsuya Kawahara
DiffM
55
0
0
20 May 2025
Bayesian Cox model with graph-structured variable selection priors for multi-omics biomarker identification
Bayesian Cox model with graph-structured variable selection priors for multi-omics biomarker identification
Tobias Østmo Hermansen
M. Zucknick
Zhi Zhao
107
0
0
17 Mar 2025
Deep Generative Modeling for Identification of Noisy, Non-Stationary
  Dynamical Systems
Deep Generative Modeling for Identification of Noisy, Non-Stationary Dynamical Systems
Doris Voina
Steven Brunton
J. Nathan Kutz
DiffM
79
2
0
02 Oct 2024
Sound Source Separation Using Latent Variational Block-Wise
  Disentanglement
Sound Source Separation Using Latent Variational Block-Wise Disentanglement
Karim Helwani
M. Togami
Paris Smaragdis
Michael M. Goodwin
BDLDRL
87
1
0
08 Feb 2024
Self-supervised learning with diffusion-based multichannel speech
  enhancement for speaker verification under noisy conditions
Self-supervised learning with diffusion-based multichannel speech enhancement for speaker verification under noisy conditions
Sandipana Dowerah
Ajinkya Kulkarni
Romain Serizel
D. Jouvet
DiffM
129
1
0
05 Jul 2023
Unsupervised speech enhancement with deep dynamical generative speech
  and noise models
Unsupervised speech enhancement with deep dynamical generative speech and noise models
Xiaoyu Lin
Simon Leglaive
Laurent Girin
Xavier Alameda-Pineda
56
3
0
13 Jun 2023
SE-Bridge: Speech Enhancement with Consistent Brownian Bridge
SE-Bridge: Speech Enhancement with Consistent Brownian Bridge
Zhibin Qiu
Mengfan Fu
Gang Hua
G. Altenbek
Hao Huang
DiffM
82
5
0
23 May 2023
Diffusion-Based Speech Enhancement with Joint Generative and Predictive
  Decoders
Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders
Hao Shi
Kazuki Shimada
M. Hirano
Takashi Shibuya
Yuichiro Koyama
Zhi-Wei Zhong
Shusuke Takahashi
Tatsuya Kawahara
Yuki Mitsufuji
DiffM
94
16
0
18 May 2023
Integrating Uncertainty into Neural Network-based Speech Enhancement
Integrating Uncertainty into Neural Network-based Speech Enhancement
Hu Fang
Dennis Becker
S. Wermter
Timo Gerkmann
UQCV
67
3
0
15 May 2023
Partially Adaptive Multichannel Joint Reduction of Ego-noise and
  Environmental Noise
Partially Adaptive Multichannel Joint Reduction of Ego-noise and Environmental Noise
Hu Fang
Niklas Wittmer
Johannes Twiefel
S. Wermter
Timo Gerkmann
35
3
0
27 Mar 2023
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech
  Enhancement and Dereverberation
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
Jean-Marie Lemercier
Julius Richter
Simon Welker
Timo Gerkmann
DiffM
257
92
0
22 Dec 2022
Deep neural network techniques for monaural speech enhancement: state of
  the art analysis
Deep neural network techniques for monaural speech enhancement: state of the art analysis
P. Ochieng
119
22
0
01 Dec 2022
A Two-Stage Deep Representation Learning-Based Speech Enhancement Method
  Using Variational Autoencoder and Adversarial Training
A Two-Stage Deep Representation Learning-Based Speech Enhancement Method Using Variational Autoencoder and Adversarial Training
Yang Xiang
Jesper Lisby Højvang
M. Rasmussen
M. G. Christensen
DRL
69
6
0
16 Nov 2022
Fast and efficient speech enhancement with variational autoencoders
Fast and efficient speech enhancement with variational autoencoders
M. Sadeghi
Romain Serizel
DRLBDL
60
3
0
02 Nov 2022
A weighted-variance variational autoencoder model for speech enhancement
A weighted-variance variational autoencoder model for speech enhancement
A. Golmakani
M. Sadeghi
Xavier Alameda-Pineda
Romain Serizel
80
1
0
02 Nov 2022
Audio-visual speech enhancement with a deep Kalman filter generative
  model
Audio-visual speech enhancement with a deep Kalman filter generative model
A. Golmakani
M. Sadeghi
Romain Serizel
DiffM
57
7
0
02 Nov 2022
SRTNet: Time Domain Speech Enhancement Via Stochastic Refinement
SRTNet: Time Domain Speech Enhancement Via Stochastic Refinement
Zhibin Qiu
Mengfan Fu
Yinfeng Yu
Lili Yin
Gang Hua
Hao-Ming Huang
DiffM
145
19
0
30 Oct 2022
Speech Enhancement and Dereverberation with Diffusion-based Generative
  Models
Speech Enhancement and Dereverberation with Diffusion-based Generative Models
Julius Richter
Simon Welker
Jean-Marie Lemercier
Bunlong Lay
Timo Gerkmann
DiffM
93
207
0
11 Aug 2022
A deep representation learning speech enhancement method using
  $β$-VAE
A deep representation learning speech enhancement method using βββ-VAE
Yang Xiang
Jesper Lisby Højvang
M. Rasmussen
M. G. Christensen
DRL
60
2
0
11 May 2022
Learning and controlling the source-filter representation of speech with
  a variational autoencoder
Learning and controlling the source-filter representation of speech with a variational autoencoder
Samir Sadok
Simon Leglaive
Laurent Girin
Xavier Alameda-Pineda
Renaud Séguier
SSLDRLBDL
115
14
0
14 Apr 2022
Expression-preserving face frontalization improves visually assisted
  speech processing
Expression-preserving face frontalization improves visually assisted speech processing
Zhiqi Kang
M. Sadeghi
Radu Horaud
Xavier Alameda-Pineda
CVBM
114
8
0
06 Apr 2022
Speech Enhancement with Score-Based Generative Models in the Complex
  STFT Domain
Speech Enhancement with Score-Based Generative Models in the Complex STFT Domain
Simon Welker
Julius Richter
Timo Gerkmann
DiffM
114
117
0
31 Mar 2022
A Sparsity-promoting Dictionary Model for Variational Autoencoders
A Sparsity-promoting Dictionary Model for Variational Autoencoders
M. Sadeghi
P. Magron
53
3
0
29 Mar 2022
Integrating Statistical Uncertainty into Neural Network-Based Speech
  Enhancement
Integrating Statistical Uncertainty into Neural Network-Based Speech Enhancement
Hu Fang
Tal Peer
S. Wermter
Timo Gerkmann
71
6
0
04 Mar 2022
The impact of removing head movements on audio-visual speech enhancement
The impact of removing head movements on audio-visual speech enhancement
Zhiqi Kang
M. Sadeghi
Radu Horaud
Xavier Alameda-Pineda
Jacob Donley
Anurag Kumar
CVBM
53
5
0
01 Feb 2022
A Bayesian Permutation training deep representation learning method for
  speech enhancement with variational autoencoder
A Bayesian Permutation training deep representation learning method for speech enhancement with variational autoencoder
Yang Xiang
Jesper Lisby Højvang
M. Rasmussen
M. G. Christensen
BDLDRL
51
4
0
24 Jan 2022
FastMVAE2: On improving and accelerating the fast variational
  autoencoder-based source separation algorithm for determined mixtures
FastMVAE2: On improving and accelerating the fast variational autoencoder-based source separation algorithm for determined mixtures
Li Li
Hirokazu Kameoka
S. Makino
DRL
77
8
0
28 Sep 2021
Unsupervised Speech Enhancement using Dynamical Variational
  Auto-Encoders
Unsupervised Speech Enhancement using Dynamical Variational Auto-Encoders
Xiaoyu Bie
Simon Leglaive
Xavier Alameda-Pineda
Laurent Girin
DiffM
101
55
0
23 Jun 2021
A Benchmark of Dynamical Variational Autoencoders applied to Speech
  Spectrogram Modeling
A Benchmark of Dynamical Variational Autoencoders applied to Speech Spectrogram Modeling
Xiaoyu Bie
Laurent Girin
Simon Leglaive
Thomas Hueber
Xavier Alameda-Pineda
55
12
0
11 Jun 2021
Disentanglement Learning for Variational Autoencoders Applied to
  Audio-Visual Speech Enhancement
Disentanglement Learning for Variational Autoencoders Applied to Audio-Visual Speech Enhancement
Guillaume Carbajal
Julius Richter
Timo Gerkmann
DRL
87
15
0
19 May 2021
Variational Autoencoder for Speech Enhancement with a Noise-Aware
  Encoder
Variational Autoencoder for Speech Enhancement with a Noise-Aware Encoder
Hu Fang
Guillaume Carbajal
S. Wermter
Timo Gerkmann
110
59
0
17 Feb 2021
Guided Variational Autoencoder for Speech Enhancement With a Supervised
  Classifier
Guided Variational Autoencoder for Speech Enhancement With a Supervised Classifier
Guillaume Carbajal
Julius Richter
Timo Gerkmann
DRLSSL
50
17
0
12 Feb 2021
Switching Variational Auto-Encoders for Noise-Agnostic Audio-visual
  Speech Enhancement
Switching Variational Auto-Encoders for Noise-Agnostic Audio-visual Speech Enhancement
M. Sadeghi
Xavier Alameda-Pineda
31
10
0
08 Feb 2021
Can We Trust Deep Speech Prior?
Can We Trust Deep Speech Prior?
Ying Shi
Haolin Chen
Zhiyuan Tang
Lantian Li
Dong Wang
Jiqing Han
58
1
0
04 Nov 2020
Dynamical Variational Autoencoders: A Comprehensive Review
Dynamical Variational Autoencoders: A Comprehensive Review
Laurent Girin
Simon Leglaive
Xiaoyu Bie
Julien Diard
Thomas Hueber
Xavier Alameda-Pineda
BDL
141
222
0
28 Aug 2020
Deep Variational Generative Models for Audio-visual Speech Separation
Deep Variational Generative Models for Audio-visual Speech Separation
V. Nguyen
M. Sadeghi
Elisa Ricci
Xavier Alameda-Pineda
SSLDRL
40
9
0
17 Aug 2020
Mixture of Inference Networks for VAE-based Audio-visual Speech
  Enhancement
Mixture of Inference Networks for VAE-based Audio-visual Speech Enhancement
M. Sadeghi
Xavier Alameda-Pineda
79
21
0
23 Dec 2019
Robust Unsupervised Audio-visual Speech Enhancement Using a Mixture of
  Variational Autoencoders
Robust Unsupervised Audio-visual Speech Enhancement Using a Mixture of Variational Autoencoders
M. Sadeghi
Xavier Alameda-Pineda
46
19
0
10 Nov 2019
A Recurrent Variational Autoencoder for Speech Enhancement
A Recurrent Variational Autoencoder for Speech Enhancement
Simon Leglaive
Xavier Alameda-Pineda
Laurent Girin
Radu Horaud
DRL
140
79
0
24 Oct 2019
Audio-visual Speech Enhancement Using Conditional Variational
  Auto-Encoders
Audio-visual Speech Enhancement Using Conditional Variational Auto-Encoders
M. Sadeghi
Simon Leglaive
Xavier Alameda-Pineda
Laurent Girin
Radu Horaud
DiffM
108
66
0
07 Aug 2019
A Statistically Principled and Computationally Efficient Approach to
  Speech Enhancement using Variational Autoencoders
A Statistically Principled and Computationally Efficient Approach to Speech Enhancement using Variational Autoencoders
Manuel Pariente
Antoine Deleforge
Emmanuel Vincent
72
21
0
03 May 2019
Unsupervised Speech Enhancement Based on Multichannel NMF-Informed
  Beamforming for Noise-Robust Automatic Speech Recognition
Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition
Kazuki Shimada
Yoshiaki Bando
Masato Mimura
Katsutoshi Itoyama
Kazuyoshi Yoshii
Tatsuya Kawahara
63
54
0
22 Mar 2019
A Deep Generative Model of Speech Complex Spectrograms
A Deep Generative Model of Speech Complex Spectrograms
Aditya Arie Nugraha
Kouhei Sekiguchi
Kazuyoshi Yoshii
47
19
0
08 Mar 2019
Fast Multichannel Source Separation Based on Jointly Diagonalizable
  Spatial Covariance Matrices
Fast Multichannel Source Separation Based on Jointly Diagonalizable Spatial Covariance Matrices
Kouhei Sekiguchi
A. Nugraha
Yoshiaki Bando
Kazuyoshi Yoshii
18
37
0
08 Mar 2019
Speech enhancement with variational autoencoders and alpha-stable
  distributions
Speech enhancement with variational autoencoders and alpha-stable distributions
Simon Leglaive
Umut Simsekli
Antoine Liutkus
Laurent Girin
Radu Horaud
DRL
63
36
0
08 Feb 2019
Fast MVAE: Joint separation and classification of mixed sources based on
  multichannel variational autoencoder with auxiliary classifier
Fast MVAE: Joint separation and classification of mixed sources based on multichannel variational autoencoder with auxiliary classifier
Li Li
Hirokazu Kameoka
S. Makino
DRL
50
28
0
16 Dec 2018
Semi-supervised multichannel speech enhancement with variational
  autoencoders and non-negative matrix factorization
Semi-supervised multichannel speech enhancement with variational autoencoders and non-negative matrix factorization
Simon Leglaive
Laurent Girin
Radu Horaud
BDL
106
60
0
16 Nov 2018
1