ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.12271
  4. Cited By
Unsupervised Speech Enhancement using Dynamical Variational
  Auto-Encoders
v1v2v3 (latest)

Unsupervised Speech Enhancement using Dynamical Variational Auto-Encoders

23 June 2021
Xiaoyu Bie
Simon Leglaive
Xavier Alameda-Pineda
Laurent Girin
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Unsupervised Speech Enhancement using Dynamical Variational Auto-Encoders"

28 / 28 papers shown
Title
Capacity-Net-Based RIS Precoding Design without Channel Estimation for mmWave MIMO System
Capacity-Net-Based RIS Precoding Design without Channel Estimation for mmWave MIMO SystemIEEE International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC), 2024
Chun-Yuan Huang
Po-Heng Chou
Wan-Jen Huang
Ying-Ren Chien
Yu Tsao
92
3
0
30 Sep 2025
LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models
LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models
Beilong Tang
Bang Zeng
Ming Li
AI4TS
256
5
0
10 Apr 2025
AudioMiXR: Spatial Audio Object Manipulation with 6DoF for Sound Design in Augmented Reality
AudioMiXR: Spatial Audio Object Manipulation with 6DoF for Sound Design in Augmented RealityProceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (IMWUT), 2025
Brandon Woodard
Margarita Geleta
Joseph J. LaViola Jr.
Andrea Fanelli
Rhonda Wilson
822
24
0
05 Feb 2025
Diffusion-based Unsupervised Audio-visual Speech Enhancement
Diffusion-based Unsupervised Audio-visual Speech EnhancementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Jean-Eudes Ayilo
Mostafa Sadeghi
Romain Serizel
Xavier Alameda-Pineda
DiffM
289
6
0
04 Oct 2024
Restorative Speech Enhancement: A Progressive Approach Using SE and
  Codec Modules
Restorative Speech Enhancement: A Progressive Approach Using SE and Codec Modules
Hsin-Tien Chiang
Hao Zhang
Yong Xu
Meng Yu
Dong Yu
217
1
0
02 Oct 2024
The PESQetarian: On the Relevance of Goodhart's Law for Speech
  Enhancement
The PESQetarian: On the Relevance of Goodhart's Law for Speech Enhancement
Danilo de Oliveira
Simon Welker
Julius Richter
Timo Gerkmann
146
19
0
05 Jun 2024
Objective and subjective evaluation of speech enhancement methods in the
  UDASE task of the 7th CHiME challenge
Objective and subjective evaluation of speech enhancement methods in the UDASE task of the 7th CHiME challenge
Simon Leglaive
Matthieu Fraticelli
Hend ElGhazaly
Léonie Borne
Mostafa Sadeghi
Scott Wisdom
Manuel Pariente
J. Hershey
Daniel Pressnitzer
Jon P. Barker
268
16
0
02 Feb 2024
Mixture of Dynamical Variational Autoencoders for Multi-Source
  Trajectory Modeling and Separation
Mixture of Dynamical Variational Autoencoders for Multi-Source Trajectory Modeling and Separation
Xiaoyu Lin
Laurent Girin
Xavier Alameda-Pineda
217
3
0
07 Dec 2023
Unsupervised speech enhancement with diffusion-based generative models
Unsupervised speech enhancement with diffusion-based generative modelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Berné Nortier
Mostafa Sadeghi
Romain Serizel
DiffM
184
17
0
19 Sep 2023
Posterior sampling algorithms for unsupervised speech enhancement with
  recurrent variational autoencoder
Posterior sampling algorithms for unsupervised speech enhancement with recurrent variational autoencoderIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Mostafa Sadeghi
Romain Serizel
BDL
121
3
0
19 Sep 2023
RVAE-EM: Generative speech dereverberation based on recurrent
  variational auto-encoder and convolutive transfer function
RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer functionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Pengyu Wang
Xiaofei Li
DRLDiffM
205
8
0
15 Sep 2023
Noise-aware Speech Enhancement using Diffusion Probabilistic Model
Noise-aware Speech Enhancement using Diffusion Probabilistic ModelInterspeech (Interspeech), 2023
Yuchen Hu
Cheng Chen
Ruizhe Li
Qiu-shi Zhu
Eng Siong Chng
DiffM
278
14
0
16 Jul 2023
The Ethical Implications of Generative Audio Models: A Systematic
  Literature Review
The Ethical Implications of Generative Audio Models: A Systematic Literature ReviewAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2023
J. Barnett
239
47
0
07 Jul 2023
Unsupervised speech enhancement with deep dynamical generative speech
  and noise models
Unsupervised speech enhancement with deep dynamical generative speech and noise modelsInterspeech (Interspeech), 2023
Xiaoyu Lin
Simon Leglaive
Laurent Girin
Xavier Alameda-Pineda
140
4
0
13 Jun 2023
SE-Bridge: Speech Enhancement with Consistent Brownian Bridge
SE-Bridge: Speech Enhancement with Consistent Brownian Bridge
Zhibin Qiu
Mengfan Fu
Gang Hua
G. Altenbek
Hao Huang
DiffM
151
8
0
23 May 2023
Integrating Uncertainty into Neural Network-based Speech Enhancement
Integrating Uncertainty into Neural Network-based Speech EnhancementIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Hu Fang
Dennis Becker
S. Wermter
Timo Gerkmann
UQCV
179
4
0
15 May 2023
Speech Modeling with a Hierarchical Transformer Dynamical VAE
Speech Modeling with a Hierarchical Transformer Dynamical VAEIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Xiaoyu Lin
Xiaoyu Bie
Simon Leglaive
Laurent Girin
Xavier Alameda-Pineda
BDL
178
3
0
07 Mar 2023
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech
  Enhancement and Dereverberation
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and DereverberationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Jean-Marie Lemercier
Julius Richter
Simon Welker
Timo Gerkmann
DiffM
431
159
0
22 Dec 2022
Deep neural network techniques for monaural speech enhancement: state of
  the art analysis
Deep neural network techniques for monaural speech enhancement: state of the art analysisArtificial Intelligence Review (Artif Intell Rev), 2022
P. Ochieng
237
34
0
01 Dec 2022
Cold Diffusion for Speech Enhancement
Cold Diffusion for Speech EnhancementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Hao Yen
François Germain
Gordon Wichern
Jonathan Le Roux
DiffM
329
54
0
04 Nov 2022
A weighted-variance variational autoencoder model for speech enhancement
A weighted-variance variational autoencoder model for speech enhancementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
A. Golmakani
M. Sadeghi
Xavier Alameda-Pineda
Romain Serizel
229
2
0
02 Nov 2022
Audio-visual speech enhancement with a deep Kalman filter generative
  model
Audio-visual speech enhancement with a deep Kalman filter generative modelIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
A. Golmakani
M. Sadeghi
Romain Serizel
DiffM
85
8
0
02 Nov 2022
A Training and Inference Strategy Using Noisy and Enhanced Speech as
  Target for Speech Enhancement without Clean Speech
A Training and Inference Strategy Using Noisy and Enhanced Speech as Target for Speech Enhancement without Clean SpeechInterspeech (Interspeech), 2022
Li-Wei Chen
Yao-Fei Cheng
Hung-Shin Lee
Yu Tsao
Hsin-Min Wang
173
4
0
27 Oct 2022
Speech Enhancement and Dereverberation with Diffusion-based Generative Models
Speech Enhancement and Dereverberation with Diffusion-based Generative ModelsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Julius Richter
Simon Welker
Jean-Marie Lemercier
Bunlong Lay
Timo Gerkmann
DiffM
349
315
0
11 Aug 2022
Learning and controlling the source-filter representation of speech with
  a variational autoencoder
Learning and controlling the source-filter representation of speech with a variational autoencoderSpeech Communication (Speech Commun.), 2022
Samir Sadok
Simon Leglaive
Laurent Girin
Xavier Alameda-Pineda
Renaud Séguier
SSLDRLBDL
273
14
0
14 Apr 2022
Speech Enhancement with Score-Based Generative Models in the Complex
  STFT Domain
Speech Enhancement with Score-Based Generative Models in the Complex STFT DomainInterspeech (Interspeech), 2022
Simon Welker
Julius Richter
Timo Gerkmann
DiffM
307
147
0
31 Mar 2022
Speech Enhancement Based on Cyclegan with Noise-informed Training
Speech Enhancement Based on Cyclegan with Noise-informed Training
Wen-Yuan Ting
Syu-Siang Wang
Hsin-Li Chang
B. Su
Yu Tsao
GAN
226
7
0
19 Oct 2021
MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only
  on noisy/ reverberated speech
MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speechIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Szu-Wei Fu
Cheng Yu
Kuo-Hsuan Hung
Mirco Ravanelli
Yu Tsao
217
59
0
12 Oct 2021
1