Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2401.14185
Cited By
TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion
25 January 2024
Samuel Pegg
Kai Li
Xiaolin Hu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion"
6 / 6 papers shown
Title
Cross-attention Inspired Selective State Space Models for Target Sound Extraction
Donghang Wu
Yiwen Wang
Xihong Wu
T. Qu
Mamba
23
3
0
07 Sep 2024
Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network
Xiaolin Hu
Kai Li
Weiyi Zhang
Yi Luo
Jean-Marie Lemercier
Timo Gerkmann
41
47
0
04 Dec 2021
The Right to Talk: An Audio-Visual Transformer Approach
Thanh-Dat Truong
C. Duong
T. D. Vu
H. Pham
Bhiksha Raj
Ngan Le
Khoa Luu
44
36
0
06 Aug 2021
A cappella: Audio-visual Singing Voice Separation
Juan F. Montesinos
V. S. Kadandale
G. Haro
30
13
0
20 Apr 2021
Effective Low-Cost Time-Domain Audio Separation Using Globally Attentive Locally Recurrent Networks
Max W. Y. Lam
Jun Wang
Dan Su
Dong Yu
27
26
0
13 Jan 2021
VisualVoice: Audio-Visual Speech Separation with Cross-Modal Consistency
Ruohan Gao
Kristen Grauman
CVBM
185
196
0
08 Jan 2021
1