v1v2v3 (latest)

Unsupervised Speech Enhancement using Dynamical Variational Auto-Encoders

23 June 2021

Xiaoyu Bie

Simon Leglaive

Xavier Alameda-Pineda

Laurent Girin

DiffM

ArXiv (abs)PDF HTML

Papers citing "Unsupervised Speech Enhancement using Dynamical Variational Auto-Encoders"

28 / 28 papers shown

Capacity-Net-Based RIS Precoding Design without Channel Estimation for mmWave MIMO SystemIEEE International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC), 2024

116

30 Sep 2025

LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models

268

10 Apr 2025

AudioMiXR: Spatial Audio Object Manipulation with 6DoF for Sound Design in Augmented RealityProceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (IMWUT), 2025

Brandon Woodard

Margarita Geleta

Joseph J. LaViola Jr.

Andrea Fanelli

Rhonda Wilson

862

05 Feb 2025

Diffusion-based Unsupervised Audio-visual Speech EnhancementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

Jean-Eudes Ayilo

Mostafa Sadeghi

Romain Serizel

Xavier Alameda-Pineda

DiffM

309

04 Oct 2024

Restorative Speech Enhancement: A Progressive Approach Using SE and Codec Modules

Hsin-Tien Chiang

Hao Zhang

Yong Xu

Meng Yu

Dong Yu

229

02 Oct 2024

The PESQetarian: On the Relevance of Goodhart's Law for Speech Enhancement

146

05 Jun 2024

Objective and subjective evaluation of speech enhancement methods in the UDASE task of the 7th CHiME challenge

300

02 Feb 2024

Mixture of Dynamical Variational Autoencoders for Multi-Source Trajectory Modeling and Separation

Xiaoyu Lin

Laurent Girin

Xavier Alameda-Pineda

221

07 Dec 2023

Unsupervised speech enhancement with diffusion-based generative modelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

192

19 Sep 2023

Posterior sampling algorithms for unsupervised speech enhancement with recurrent variational autoencoderIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Mostafa Sadeghi

Romain Serizel

BDL

157

19 Sep 2023

RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer functionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Pengyu Wang

Xiaofei Li

DRL DiffM

221

15 Sep 2023

Noise-aware Speech Enhancement using Diffusion Probabilistic ModelInterspeech (Interspeech), 2023

Yuchen Hu

294

16 Jul 2023

The Ethical Implications of Generative Audio Models: A Systematic Literature ReviewAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2023

J. Barnett

251

07 Jul 2023

Unsupervised speech enhancement with deep dynamical generative speech and noise modelsInterspeech (Interspeech), 2023

Xiaoyu Lin

Simon Leglaive

Laurent Girin

Xavier Alameda-Pineda

144

13 Jun 2023

SE-Bridge: Speech Enhancement with Consistent Brownian Bridge

163

23 May 2023

Integrating Uncertainty into Neural Network-based Speech EnhancementIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023

179

15 May 2023

Speech Modeling with a Hierarchical Transformer Dynamical VAEIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Xavier Alameda-Pineda

BDL

178

07 Mar 2023

StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and DereverberationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

439

161

22 Dec 2022

Deep neural network techniques for monaural speech enhancement: state of the art analysisArtificial Intelligence Review (Artif Intell Rev), 2022

P. Ochieng

263

01 Dec 2022

Cold Diffusion for Speech EnhancementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

361

04 Nov 2022

A weighted-variance variational autoencoder model for speech enhancementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

A. Golmakani

M. Sadeghi

Xavier Alameda-Pineda

Romain Serizel

229

02 Nov 2022

Audio-visual speech enhancement with a deep Kalman filter generative modelIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

02 Nov 2022

A Training and Inference Strategy Using Noisy and Enhanced Speech as Target for Speech Enhancement without Clean SpeechInterspeech (Interspeech), 2022

Li-Wei Chen

Yao-Fei Cheng

Hung-Shin Lee

Yu Tsao

Hsin-Min Wang

197

27 Oct 2022

Speech Enhancement and Dereverberation with Diffusion-based Generative ModelsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

353

318

11 Aug 2022

Learning and controlling the source-filter representation of speech with a variational autoencoderSpeech Communication (Speech Commun.), 2022

Samir Sadok

Simon Leglaive

Laurent Girin

Xavier Alameda-Pineda

Renaud Séguier

SSL DRL BDL

285

14 Apr 2022

Speech Enhancement with Score-Based Generative Models in the Complex STFT DomainInterspeech (Interspeech), 2022

331

147

31 Mar 2022

Speech Enhancement Based on Cyclegan with Noise-informed Training

246

19 Oct 2021

MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speechIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Mirco Ravanelli

225

12 Oct 2021