ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1908.02590
  4. Cited By
Audio-visual Speech Enhancement Using Conditional Variational
  Auto-Encoders

Audio-visual Speech Enhancement Using Conditional Variational Auto-Encoders

7 August 2019
M. Sadeghi
Simon Leglaive
Xavier Alameda-Pineda
Laurent Girin
Radu Horaud
    DiffM
ArXivPDFHTML

Papers citing "Audio-visual Speech Enhancement Using Conditional Variational Auto-Encoders"

14 / 14 papers shown
Title
Reading to Listen at the Cocktail Party: Multi-Modal Speech Separation
Reading to Listen at the Cocktail Party: Multi-Modal Speech Separation
Akam Rahimi
Triantafyllos Afouras
Andrew Zisserman
40
28
0
02 Jan 2025
Diffusion-based Unsupervised Audio-visual Speech Enhancement
Diffusion-based Unsupervised Audio-visual Speech Enhancement
Jean-Eudes Ayilo
Mostafa Sadeghi
Romain Serizel
Xavier Alameda-Pineda
DiffM
25
0
0
04 Oct 2024
FlowAVSE: Efficient Audio-Visual Speech Enhancement with Conditional
  Flow Matching
FlowAVSE: Efficient Audio-Visual Speech Enhancement with Conditional Flow Matching
Chaeyoung Jung
Suyeon Lee
Ji-Hoon Kim
Joon Son Chung
DiffM
47
4
0
13 Jun 2024
Missingness-resilient Video-enhanced Multimodal Disfluency Detection
Missingness-resilient Video-enhanced Multimodal Disfluency Detection
Payal Mohapatra
Shamika Likhite
Subrata Biswas
Bashima Islam
Qi Zhu
49
2
0
11 Jun 2024
Audio-Visual Speech Enhancement in Noisy Environments via Emotion-Based
  Contextual Cues
Audio-Visual Speech Enhancement in Noisy Environments via Emotion-Based Contextual Cues
Tassadaq Hussain
K. Dashtipour
Yu Tsao
Amir Hussain
29
2
0
26 Feb 2024
Audio-visual video-to-speech synthesis with synthesized input audio
Audio-visual video-to-speech synthesis with synthesized input audio
Triantafyllos Kefalas
Yannis Panagakis
M. Pantic
VGen
DiffM
38
1
0
31 Jul 2023
Integrating Uncertainty into Neural Network-based Speech Enhancement
Integrating Uncertainty into Neural Network-based Speech Enhancement
Hu Fang
Dennis Becker
S. Wermter
Timo Gerkmann
UQCV
32
2
0
15 May 2023
Neural Target Speech Extraction: An Overview
Neural Target Speech Extraction: An Overview
Kateřina Žmolíková
Marc Delcroix
Tsubasa Ochiai
K. Kinoshita
JanHonza'' vCernocký
Dong Yu
23
86
0
31 Jan 2023
A weighted-variance variational autoencoder model for speech enhancement
A weighted-variance variational autoencoder model for speech enhancement
A. Golmakani
M. Sadeghi
Xavier Alameda-Pineda
Romain Serizel
25
1
0
02 Nov 2022
Visual Acoustic Matching
Visual Acoustic Matching
Changan Chen
Ruohan Gao
P. Calamia
Kristen Grauman
21
56
0
14 Feb 2022
A Novel Temporal Attentive-Pooling based Convolutional Recurrent
  Architecture for Acoustic Signal Enhancement
A Novel Temporal Attentive-Pooling based Convolutional Recurrent Architecture for Acoustic Signal Enhancement
Tassadaq Hussain
Wei-Chien Wang
M. Gogate
K. Dashtipour
Yu Tsao
Xugang Lu
A. Ahsan
Amir Hussain
21
3
0
24 Jan 2022
SINVAD: Search-based Image Space Navigation for DNN Image Classifier
  Test Input Generation
SINVAD: Search-based Image Space Navigation for DNN Image Classifier Test Input Generation
Sungmin Kang
R. Feldt
S. Yoo
AAML
26
32
0
19 May 2020
Robust Speaker Recognition Using Speech Enhancement And Attention Model
Robust Speaker Recognition Using Speech Enhancement And Attention Model
Yanpei Shi
Qiang Huang
Thomas Hain
22
25
0
14 Jan 2020
Mixture of Inference Networks for VAE-based Audio-visual Speech
  Enhancement
Mixture of Inference Networks for VAE-based Audio-visual Speech Enhancement
M. Sadeghi
Xavier Alameda-Pineda
13
21
0
23 Dec 2019
1