ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1912.10647
  4. Cited By
Mixture of Inference Networks for VAE-based Audio-visual Speech
  Enhancement
v1v2v3v4 (latest)

Mixture of Inference Networks for VAE-based Audio-visual Speech Enhancement

IEEE Transactions on Signal Processing (IEEE Trans. Signal Process.), 2019
23 December 2019
M. Sadeghi
Xavier Alameda-Pineda
ArXiv (abs)PDFHTML

Papers citing "Mixture of Inference Networks for VAE-based Audio-visual Speech Enhancement"

12 / 12 papers shown
Title
AV-CrossNet: an Audiovisual Complex Spectral Mapping Network for Speech
  Separation By Leveraging Narrow- and Cross-Band Modeling
AV-CrossNet: an Audiovisual Complex Spectral Mapping Network for Speech Separation By Leveraging Narrow- and Cross-Band Modeling
Vahid Ahmadi Kalkhorani
Cheng Yu
Anurag Kumar
Ke Tan
Buye Xu
DeLiang Wang
289
5
0
17 Jun 2024
Cortex Inspired Learning to Recover Damaged Signal Modality with ReD-SOM
  Model
Cortex Inspired Learning to Recover Damaged Signal Modality with ReD-SOM ModelIEEE International Joint Conference on Neural Network (IJCNN), 2023
Artem R. Muliukov
Laurent Rodriguez
Benoit Miramond
105
1
0
27 Jul 2023
SE-Bridge: Speech Enhancement with Consistent Brownian Bridge
SE-Bridge: Speech Enhancement with Consistent Brownian Bridge
Zhibin Qiu
Mengfan Fu
Gang Hua
G. Altenbek
Hao Huang
DiffM
151
8
0
23 May 2023
Audio-visual speech enhancement with a deep Kalman filter generative
  model
Audio-visual speech enhancement with a deep Kalman filter generative modelIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
A. Golmakani
M. Sadeghi
Romain Serizel
DiffM
85
8
0
02 Nov 2022
SRTNet: Time Domain Speech Enhancement Via Stochastic Refinement
SRTNet: Time Domain Speech Enhancement Via Stochastic RefinementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Zhibin Qiu
Mengfan Fu
Yinfeng Yu
Lili Yin
Gang Hua
Hao-Ming Huang
DiffM
231
22
0
30 Oct 2022
Expression-preserving face frontalization improves visually assisted
  speech processing
Expression-preserving face frontalization improves visually assisted speech processingInternational Journal of Computer Vision (IJCV), 2022
Zhiqi Kang
M. Sadeghi
Radu Horaud
Xavier Alameda-Pineda
CVBM
355
8
0
06 Apr 2022
VoViT: Low Latency Graph-based Audio-Visual Voice Separation Transformer
VoViT: Low Latency Graph-based Audio-Visual Voice Separation TransformerEuropean Conference on Computer Vision (ECCV), 2022
Juan F. Montesinos
V. S. Kadandale
G. Haro
ViT
222
25
0
08 Mar 2022
The impact of removing head movements on audio-visual speech enhancement
The impact of removing head movements on audio-visual speech enhancementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Zhiqi Kang
M. Sadeghi
Radu Horaud
Xavier Alameda-Pineda
Jacob Donley
Anurag Kumar
CVBM
153
5
0
01 Feb 2022
Unsupervised Speech Enhancement using Dynamical Variational
  Auto-Encoders
Unsupervised Speech Enhancement using Dynamical Variational Auto-Encoders
Xiaoyu Bie
Simon Leglaive
Xavier Alameda-Pineda
Laurent Girin
DiffM
264
60
0
23 Jun 2021
Variational Structured Attention Networks for Deep Visual Representation
  Learning
Variational Structured Attention Networks for Deep Visual Representation LearningIEEE Transactions on Image Processing (TIP), 2021
Guanglei Yang
Paolo Rota
Xavier Alameda-Pineda
Dan Xu
M. Ding
Elisa Ricci
3DPC
144
5
0
05 Mar 2021
Switching Variational Auto-Encoders for Noise-Agnostic Audio-visual
  Speech Enhancement
Switching Variational Auto-Encoders for Noise-Agnostic Audio-visual Speech EnhancementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
M. Sadeghi
Xavier Alameda-Pineda
76
12
0
08 Feb 2021
Deep Variational Generative Models for Audio-visual Speech Separation
Deep Variational Generative Models for Audio-visual Speech Separation
V. Nguyen
M. Sadeghi
Elisa Ricci
Xavier Alameda-Pineda
SSLDRL
186
10
0
17 Aug 2020
1