ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.14185
  4. Cited By
TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down
  Fusion

TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion

25 January 2024
Samuel Pegg
Kai Li
Xiaolin Hu
ArXivPDFHTML

Papers citing "TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion"

6 / 6 papers shown
Title
Cross-attention Inspired Selective State Space Models for Target Sound
  Extraction
Cross-attention Inspired Selective State Space Models for Target Sound Extraction
Donghang Wu
Yiwen Wang
Xihong Wu
T. Qu
Mamba
23
3
0
07 Sep 2024
Speech Separation Using an Asynchronous Fully Recurrent Convolutional
  Neural Network
Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network
Xiaolin Hu
Kai Li
Weiyi Zhang
Yi Luo
Jean-Marie Lemercier
Timo Gerkmann
41
47
0
04 Dec 2021
The Right to Talk: An Audio-Visual Transformer Approach
The Right to Talk: An Audio-Visual Transformer Approach
Thanh-Dat Truong
C. Duong
T. D. Vu
H. Pham
Bhiksha Raj
Ngan Le
Khoa Luu
44
36
0
06 Aug 2021
A cappella: Audio-visual Singing Voice Separation
A cappella: Audio-visual Singing Voice Separation
Juan F. Montesinos
V. S. Kadandale
G. Haro
30
13
0
20 Apr 2021
Effective Low-Cost Time-Domain Audio Separation Using Globally Attentive
  Locally Recurrent Networks
Effective Low-Cost Time-Domain Audio Separation Using Globally Attentive Locally Recurrent Networks
Max W. Y. Lam
Jun Wang
Dan Su
Dong Yu
27
26
0
13 Jan 2021
VisualVoice: Audio-Visual Speech Separation with Cross-Modal Consistency
VisualVoice: Audio-Visual Speech Separation with Cross-Modal Consistency
Ruohan Gao
Kristen Grauman
CVBM
185
196
0
08 Jan 2021
1