ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2212.10744
  4. Cited By
An Audio-Visual Speech Separation Model Inspired by
  Cortico-Thalamo-Cortical Circuits

An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits

21 December 2022
Kai Li
Fenghua Xie
Hang Chen
K. Yuan
Xiaolin Hu
ArXivPDFHTML

Papers citing "An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits"

15 / 15 papers shown
Title
Apollo: Band-sequence Modeling for High-Quality Audio Restoration
Apollo: Band-sequence Modeling for High-Quality Audio Restoration
Kai Li
Yi Luo
31
0
0
08 Jan 2025
Spoofing-Aware Speaker Verification Robust Against Domain and Channel
  Mismatches
Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches
Chang Zeng
Xiaoxiao Miao
Xin Wang
Erica Cooper
Junichi Yamagishi
AAML
35
0
0
10 Sep 2024
AV-CrossNet: an Audiovisual Complex Spectral Mapping Network for Speech
  Separation By Leveraging Narrow- and Cross-Band Modeling
AV-CrossNet: an Audiovisual Complex Spectral Mapping Network for Speech Separation By Leveraging Narrow- and Cross-Band Modeling
Vahid Ahmadi Kalkhorani
Cheng Yu
Anurag Kumar
Ke Tan
Buye Xu
DeLiang Wang
32
0
0
17 Jun 2024
Separate in the Speech Chain: Cross-Modal Conditional Audio-Visual
  Target Speech Extraction
Separate in the Speech Chain: Cross-Modal Conditional Audio-Visual Target Speech Extraction
Zhaoxi Mu
Xinyu Yang
27
5
0
19 Apr 2024
TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down
  Fusion
TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion
Samuel Pegg
Kai Li
Xiaolin Hu
24
1
0
25 Jan 2024
RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual
  Speech Separation
RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech Separation
Samuel Pegg
Kai Li
Xiaolin Hu
19
4
0
29 Sep 2023
Rethinking the visual cues in audio-visual speaker extraction
Rethinking the visual cues in audio-visual speaker extraction
Junjie Li
Meng Ge
Zexu Pan
Rui Cao
Longbiao Wang
J. Dang
Shiliang Zhang
12
9
0
05 Jun 2023
A Survey of Knowledge Graph Reasoning on Graph Types: Static, Dynamic,
  and Multimodal
A Survey of Knowledge Graph Reasoning on Graph Types: Static, Dynamic, and Multimodal
K. Liang
Lingyuan Meng
Meng Liu
Yue Liu
Wenxuan Tu
Siwei Wang
Sihang Zhou
Xinwang Liu
Fu Sun
LRM
24
107
0
12 Dec 2022
Speech Separation Using an Asynchronous Fully Recurrent Convolutional
  Neural Network
Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network
Xiaolin Hu
Kai Li
Weiyi Zhang
Yi Luo
Jean-Marie Lemercier
Timo Gerkmann
44
47
0
04 Dec 2021
The Right to Talk: An Audio-Visual Transformer Approach
The Right to Talk: An Audio-Visual Transformer Approach
Thanh-Dat Truong
C. Duong
T. D. Vu
H. Pham
Bhiksha Raj
Ngan Le
Khoa Luu
55
36
0
06 Aug 2021
A cappella: Audio-visual Singing Voice Separation
A cappella: Audio-visual Singing Voice Separation
Juan F. Montesinos
V. S. Kadandale
G. Haro
38
16
0
20 Apr 2021
VisualVoice: Audio-Visual Speech Separation with Cross-Modal Consistency
VisualVoice: Audio-Visual Speech Separation with Cross-Modal Consistency
Ruohan Gao
Kristen Grauman
CVBM
185
198
0
08 Jan 2021
Lipreading using Temporal Convolutional Networks
Lipreading using Temporal Convolutional Networks
Brais Martínez
Pingchuan Ma
Stavros Petridis
M. Pantic
168
238
0
23 Jan 2020
VoxCeleb2: Deep Speaker Recognition
VoxCeleb2: Deep Speaker Recognition
Joon Son Chung
Arsha Nagrani
Andrew Zisserman
214
2,224
0
14 Jun 2018
Bridging the Gaps Between Residual Learning, Recurrent Neural Networks
  and Visual Cortex
Bridging the Gaps Between Residual Learning, Recurrent Neural Networks and Visual Cortex
Q. Liao
T. Poggio
206
255
0
13 Apr 2016
1