Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.17456
Cited By
Audio-Visual Speech Enhancement and Separation by Utilizing Multi-Modal Self-Supervised Embeddings
31 October 2022
Ethan Chern
Kuo-Hsuan Hung
Yi-Ting Chen
Tassadaq Hussain
M. Gogate
Amir Hussain
Yu Tsao
Jen-Cheng Hou
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Audio-Visual Speech Enhancement and Separation by Utilizing Multi-Modal Self-Supervised Embeddings"
10 / 10 papers shown
Title
CoGenAV: Versatile Audio-Visual Representation Learning via Contrastive-Generative Synchronization
Detao Bai
Zhiheng Ma
Xihan Wei
Liefeng Bo
55
0
0
06 May 2025
Diffusion-based Unsupervised Audio-visual Speech Enhancement
Jean-Eudes Ayilo
Mostafa Sadeghi
Romain Serizel
Xavier Alameda-Pineda
DiffM
15
0
0
04 Oct 2024
FlowAVSE: Efficient Audio-Visual Speech Enhancement with Conditional Flow Matching
Chaeyoung Jung
Suyeon Lee
Ji-Hoon Kim
Joon Son Chung
DiffM
35
4
0
13 Jun 2024
Towards Environmental Preference Based Speech Enhancement For Individualised Multi-Modal Hearing Aids
Jasper Kirton-Wingate
Shafique Ahmed
Adeel Hussain
M. Gogate
K. Dashtipour
Jen-Cheng Hou
Tassadaq Hussain
Yu Tsao
Amir Hussain
16
0
0
26 Feb 2024
Audio-Visual Speech Enhancement in Noisy Environments via Emotion-Based Contextual Cues
Tassadaq Hussain
K. Dashtipour
Yu Tsao
Amir Hussain
25
2
0
26 Feb 2024
Self-Supervised Adaptive AV Fusion Module for Pre-Trained ASR Models
Christopher Simic
Tobias Bocklet
19
5
0
21 Dec 2023
AV-Lip-Sync+: Leveraging AV-HuBERT to Exploit Multimodal Inconsistency for Video Deepfake Detection
Sahibzada Adil Shahzad
Ammarah Hashmi
Yan-Tsung Peng
Yu Tsao
Hsin-Min Wang
8
5
0
05 Nov 2023
Deep Complex U-Net with Conformer for Audio-Visual Speech Enhancement
Shafique Ahmed
Chia-Wei Chen
Wenze Ren
Chin-Jou Li
Ernie Chu
Jun-Cheng Chen
Amir Hussain
H. Wang
Yu Tsao
Jen-Cheng Hou
20
1
0
20 Sep 2023
AV2Wav: Diffusion-Based Re-synthesis from Continuous Self-supervised Features for Audio-Visual Speech Enhancement
Ju-Chieh Chou
Chung-Ming Chien
Karen Livescu
DiffM
11
4
0
14 Sep 2023
VisualVoice: Audio-Visual Speech Separation with Cross-Modal Consistency
Ruohan Gao
Kristen Grauman
CVBM
185
196
0
08 Jan 2021
1