ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2306.02625
  4. Cited By
Rethinking the visual cues in audio-visual speaker extraction

Rethinking the visual cues in audio-visual speaker extraction

5 June 2023
Junjie Li
Meng Ge
Zexu Pan
Rui Cao
Longbiao Wang
J. Dang
Shiliang Zhang
ArXivPDFHTML

Papers citing "Rethinking the visual cues in audio-visual speaker extraction"

2 / 2 papers shown
Title
LiMuSE: Lightweight Multi-modal Speaker Extraction
LiMuSE: Lightweight Multi-modal Speaker Extraction
Qinghua Liu
Yating Huang
Yunzhe Hao
Jiaming Xu
Bo Xu
40
6
0
07 Nov 2021
VisualVoice: Audio-Visual Speech Separation with Cross-Modal Consistency
VisualVoice: Audio-Visual Speech Separation with Cross-Modal Consistency
Ruohan Gao
Kristen Grauman
CVBM
190
198
0
08 Jan 2021
1