ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.08408
  4. Cited By
Audio-Visual Active Speaker Extraction for Sparsely Overlapped
  Multi-talker Speech

Audio-Visual Active Speaker Extraction for Sparsely Overlapped Multi-talker Speech

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
15 September 2023
Jun Yu Li
Ruijie Tao
Zexu Pan
Meng Ge
Shuai Wang
Haizhou Li
ArXiv (abs)PDFHTML

Papers citing "Audio-Visual Active Speaker Extraction for Sparsely Overlapped Multi-talker Speech"

2 / 2 papers shown
Title
M3ANet: Multi-scale and Multi-Modal Alignment Network for Brain-Assisted Target Speaker Extraction
M3ANet: Multi-scale and Multi-Modal Alignment Network for Brain-Assisted Target Speaker ExtractionInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
Cunhang Fan
Ying Chen
Jian Zhou
Zexu Pan
Jingjing Zhang
Youdian Gao
Xiaoke Yang
Zhengqi Wen
Zhao Lv
147
3
0
31 May 2025
On the effectiveness of enrollment speech augmentation for Target
  Speaker Extraction
On the effectiveness of enrollment speech augmentation for Target Speaker ExtractionSpoken Language Technology Workshop (SLT), 2024
Junjie Li
Ke Zhang
Shuai Wang
Haizhou Li
Man-Wai Mak
Kong Aik Lee
138
9
0
15 Sep 2024
1