Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2106.12271
Cited By
v1
v2
v3 (latest)
Unsupervised Speech Enhancement using Dynamical Variational Auto-Encoders
23 June 2021
Xiaoyu Bie
Simon Leglaive
Xavier Alameda-Pineda
Laurent Girin
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Unsupervised Speech Enhancement using Dynamical Variational Auto-Encoders"
28 / 28 papers shown
Title
Capacity-Net-Based RIS Precoding Design without Channel Estimation for mmWave MIMO System
IEEE International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC), 2024
Chun-Yuan Huang
Po-Heng Chou
Wan-Jen Huang
Ying-Ren Chien
Yu Tsao
76
3
0
30 Sep 2025
LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models
Beilong Tang
Bang Zeng
Ming Li
AI4TS
240
3
0
10 Apr 2025
AudioMiXR: Spatial Audio Object Manipulation with 6DoF for Sound Design in Augmented Reality
Proceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (IMWUT), 2025
Brandon Woodard
Margarita Geleta
Joseph J. LaViola Jr.
Andrea Fanelli
Rhonda Wilson
778
22
0
05 Feb 2025
Diffusion-based Unsupervised Audio-visual Speech Enhancement
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Jean-Eudes Ayilo
Mostafa Sadeghi
Romain Serizel
Xavier Alameda-Pineda
DiffM
265
6
0
04 Oct 2024
Restorative Speech Enhancement: A Progressive Approach Using SE and Codec Modules
Hsin-Tien Chiang
Hao Zhang
Yong Xu
Meng Yu
Dong Yu
177
1
0
02 Oct 2024
The PESQetarian: On the Relevance of Goodhart's Law for Speech Enhancement
Danilo de Oliveira
Simon Welker
Julius Richter
Timo Gerkmann
146
19
0
05 Jun 2024
Objective and subjective evaluation of speech enhancement methods in the UDASE task of the 7th CHiME challenge
Simon Leglaive
Matthieu Fraticelli
Hend ElGhazaly
Léonie Borne
Mostafa Sadeghi
Scott Wisdom
Manuel Pariente
J. Hershey
Daniel Pressnitzer
Jon P. Barker
244
16
0
02 Feb 2024
Mixture of Dynamical Variational Autoencoders for Multi-Source Trajectory Modeling and Separation
Xiaoyu Lin
Laurent Girin
Xavier Alameda-Pineda
189
3
0
07 Dec 2023
Unsupervised speech enhancement with diffusion-based generative models
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Berné Nortier
Mostafa Sadeghi
Romain Serizel
DiffM
172
16
0
19 Sep 2023
Posterior sampling algorithms for unsupervised speech enhancement with recurrent variational autoencoder
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Mostafa Sadeghi
Romain Serizel
BDL
113
3
0
19 Sep 2023
RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Pengyu Wang
Xiaofei Li
DRL
DiffM
173
8
0
15 Sep 2023
Noise-aware Speech Enhancement using Diffusion Probabilistic Model
Interspeech (Interspeech), 2023
Yuchen Hu
Cheng Chen
Ruizhe Li
Qiu-shi Zhu
Eng Siong Chng
DiffM
262
14
0
16 Jul 2023
The Ethical Implications of Generative Audio Models: A Systematic Literature Review
AAAI/ACM Conference on AI, Ethics, and Society (AIES), 2023
J. Barnett
223
46
0
07 Jul 2023
Unsupervised speech enhancement with deep dynamical generative speech and noise models
Interspeech (Interspeech), 2023
Xiaoyu Lin
Simon Leglaive
Laurent Girin
Xavier Alameda-Pineda
132
4
0
13 Jun 2023
SE-Bridge: Speech Enhancement with Consistent Brownian Bridge
Zhibin Qiu
Mengfan Fu
Gang Hua
G. Altenbek
Hao Huang
DiffM
151
8
0
23 May 2023
Integrating Uncertainty into Neural Network-based Speech Enhancement
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Hu Fang
Dennis Becker
S. Wermter
Timo Gerkmann
UQCV
167
4
0
15 May 2023
Speech Modeling with a Hierarchical Transformer Dynamical VAE
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Xiaoyu Lin
Xiaoyu Bie
Simon Leglaive
Laurent Girin
Xavier Alameda-Pineda
BDL
174
3
0
07 Mar 2023
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Jean-Marie Lemercier
Julius Richter
Simon Welker
Timo Gerkmann
DiffM
403
156
0
22 Dec 2022
Deep neural network techniques for monaural speech enhancement: state of the art analysis
Artificial Intelligence Review (Artif Intell Rev), 2022
P. Ochieng
225
33
0
01 Dec 2022
Cold Diffusion for Speech Enhancement
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Hao Yen
François Germain
Gordon Wichern
Jonathan Le Roux
DiffM
293
54
0
04 Nov 2022
A weighted-variance variational autoencoder model for speech enhancement
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
A. Golmakani
M. Sadeghi
Xavier Alameda-Pineda
Romain Serizel
225
2
0
02 Nov 2022
Audio-visual speech enhancement with a deep Kalman filter generative model
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
A. Golmakani
M. Sadeghi
Romain Serizel
DiffM
81
8
0
02 Nov 2022
A Training and Inference Strategy Using Noisy and Enhanced Speech as Target for Speech Enhancement without Clean Speech
Interspeech (Interspeech), 2022
Li-Wei Chen
Yao-Fei Cheng
Hung-Shin Lee
Yu Tsao
Hsin-Min Wang
169
4
0
27 Oct 2022
Speech Enhancement and Dereverberation with Diffusion-based Generative Models
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Julius Richter
Simon Welker
Jean-Marie Lemercier
Bunlong Lay
Timo Gerkmann
DiffM
325
311
0
11 Aug 2022
Learning and controlling the source-filter representation of speech with a variational autoencoder
Speech Communication (Speech Commun.), 2022
Samir Sadok
Simon Leglaive
Laurent Girin
Xavier Alameda-Pineda
Renaud Séguier
SSL
DRL
BDL
229
14
0
14 Apr 2022
Speech Enhancement with Score-Based Generative Models in the Complex STFT Domain
Interspeech (Interspeech), 2022
Simon Welker
Julius Richter
Timo Gerkmann
DiffM
279
146
0
31 Mar 2022
Speech Enhancement Based on Cyclegan with Noise-informed Training
Wen-Yuan Ting
Syu-Siang Wang
Hsin-Li Chang
B. Su
Yu Tsao
GAN
218
7
0
19 Oct 2021
MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Szu-Wei Fu
Cheng Yu
Kuo-Hsuan Hung
Mirco Ravanelli
Yu Tsao
217
59
0
12 Oct 2021
1