RAVSS: Robust Audio-Visual Speech Separation in Multi-Speaker Scenarios
with Missing Visual CuesACM Multimedia (MM), 2024 |
Multi-Modal Multi-Correlation Learning for Audio-Visual Speech
SeparationInterspeech (Interspeech), 2022 |
VoViT: Low Latency Graph-based Audio-Visual Voice Separation TransformerEuropean Conference on Computer Vision (ECCV), 2022 |