UniCon: Unified Context Network for Robust Active Speaker Detection

5 August 2021

Papers citing "UniCon: Unified Context Network for Robust Active Speaker Detection"

21 / 21 papers shown

Title
CLIP-VAD: Exploiting Vision-Language Models for Voice Activity Detection Andrea Appiani Cigdem Beyan CLIP VLM 28 0 0 18 Oct 2024
Audio-Visual Talker Localization in Video for Spatial Sound Reproduction Davide Berghi Philip J. B. Jackson 29 0 0 01 Jun 2024
Robust Active Speaker Detection in Noisy Environments Siva Sai Nagender Vasireddy Chenxu Zhang Xiaohu Guo Yapeng Tian 19 0 0 27 Mar 2024
Leveraging Visual Supervision for Array-based Active Speaker Detection and Localization Davide Berghi Philip J. B. Jackson 35 5 0 21 Dec 2023
A Real-Time Active Speaker Detection System Integrating an Audio-Visual Signal with a Spatial Querying Mechanism I. Gurvich Ido Leichter Dharmendar Reddy Palle Yossi Asher Alon Vinnikov Igor Abramovski Vishak Gopal Ross Cutler Eyal Krupka 15 4 0 15 Sep 2023
Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos Sagnik Majumder Ziad Al-Halah Kristen Grauman SSL EgoV 32 4 0 10 Jul 2023
Target Active Speaker Detection with Audio-visual Cues Yiding Jiang Ruijie Tao Zexu Pan Haizhou Li 15 16 0 22 May 2023
WASD: A Wilder Active Speaker Detection Dataset Tiago Roxo Joana Cabral Costa Pedro R. M. Inácio Hugo Manuel Proença 14 3 0 09 Mar 2023
A Light Weight Model for Active Speaker Detection Junhua Liao Haihan Duan Kanghui Feng Wanbing Zhao Yanbing Yang Liangyin Chen 24 35 0 08 Mar 2023
AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis Susan Liang Chao Huang Yapeng Tian Anurag Kumar Chenliang Xu VGen 27 27 0 04 Feb 2023
LoCoNet: Long-Short Context Network for Active Speaker Detection Xizi Wang Feng Cheng Gedas Bertasius David J. Crandall 11 14 0 19 Jan 2023
Whose Emotion Matters? Speaking Activity Localisation without Prior Knowledge Hugo C. C. Carneiro C. Weber S. Wermter 17 5 0 23 Nov 2022
Push-Pull: Characterizing the Adversarial Robustness for Audio-Visual Active Speaker Detection Xuan-Bo Chen Haibin Wu H. Meng Hung-yi Lee J. Jang AAML 17 3 0 03 Oct 2022
Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection Kyle Min Sourya Roy Subarna Tripathi T. Guha Somdeb Majumdar 11 36 0 15 Jul 2022
UniCon+: ICTCAS-UCAS Submission to the AVA-ActiveSpeaker Task at ActivityNet Challenge 2022 Yuanhang Zhang Susan Liang Shuang Yang Shiguang Shan 8 4 0 22 Jun 2022
Rethinking Audio-visual Synchronization for Active Speaker Detection Abudukelimu Wuerkaixi You Zhang Z. Duan Changshui Zhang 11 10 0 21 Jun 2022
End-to-End Active Speaker Detection Juan Carlos León Alcázar M. Cordes Chen Zhao Bernard Ghanem 22 27 0 27 Mar 2022
$Look\&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement$ Look\&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement Jun Xiong Yu Zhou Peng Zhang Lei Xie Wei Huang Yufei Zha 12 20 0 04 Mar 2022
Learning Spatial-Temporal Graphs for Active Speaker Detection Sourya Roy Kyle Min Subarna Tripathi T. Guha Somdeb Majumdar 23 3 0 02 Dec 2021
Self-supervised learning for audio-visual speaker diarization Yifan Ding Yong-mei Xu Shi-Xiong Zhang Yahuan Cong Liqiang Wang VLM 34 29 0 13 Feb 2020
VoxCeleb2: Deep Speaker Recognition Joon Son Chung Arsha Nagrani Andrew Zisserman 214 2,224 0 14 Jun 2018