Robust Audio-Visual Segmentation via Audio-Guided Visual Convergent AlignmentComputer Vision and Pattern Recognition (CVPR), 2025 |
AV-PedAware: Self-Supervised Audio-Visual Fusion for Dynamic Pedestrian AwarenessIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023 |
STNet: Deep Audio-Visual Fusion Network for Robust Speaker TrackingIEEE transactions on multimedia (IEEE TMM), 2024 |
Audio Self-supervised Learning: A SurveyPatterns (Patterns), 2022 Shuo Liu Adria Mallol-Ragolta Emilia Parada-Cabeleiro Kun Qian Xingshuo Jing Alexander Kathan Bin Hu Bjoern W. Schuller |
Pano-AVQA: Grounded Audio-Visual Question Answering on 360
VideosIEEE International Conference on Computer Vision (ICCV), 2021 |
A Survey of Sound Source Localization with Deep Learning MethodsJournal of the Acoustical Society of America (JASA), 2021 |