Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2003.04358
Cited By
Cross modal video representations for weakly supervised active speaker localization
9 March 2020
Rahul Sharma
Krishna Somandepalli
Shrikanth Narayanan
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Cross modal video representations for weakly supervised active speaker localization"
4 / 4 papers shown
Title
Audio-Visual Activity Guided Cross-Modal Identity Association for Active Speaker Detection
Rahul Sharma
Shrikanth Narayanan
37
8
0
01 Dec 2022
Unsupervised active speaker detection in media content using cross-modal information
Rahul Sharma
Shrikanth Narayanan
24
3
0
24 Sep 2022
MAAS: Multi-modal Assignation for Active Speaker Detection
Juan Carlos León Alcázar
Fabian Caba Heilbron
Ali K. Thabet
Guohao Li
65
51
0
11 Jan 2021
pyannote.audio: neural building blocks for speaker diarization
H. Bredin
Ruiqing Yin
Juan Manuel Coria
G. Gelly
Pavel Korshunov
Marvin Lavechin
D. Fustes
Hadrien Titeux
Wassim Bouaziz
Marie-Philippe Gill
194
313
0
04 Nov 2019
1