ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2007.13976
  4. Cited By
Self-supervised Neural Audio-Visual Sound Source Localization via
  Probabilistic Spatial Modeling

Self-supervised Neural Audio-Visual Sound Source Localization via Probabilistic Spatial Modeling

IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2020
28 July 2020
Yoshiki Masuyama
Yoshiaki Bando
Kohei Yatabe
Y. Sasaki
Masaki Onishi
Yasuhiro Oikawa
    SSL
ArXiv (abs)PDFHTML

Papers citing "Self-supervised Neural Audio-Visual Sound Source Localization via Probabilistic Spatial Modeling"

8 / 8 papers shown
Robust Audio-Visual Segmentation via Audio-Guided Visual Convergent Alignment
Robust Audio-Visual Segmentation via Audio-Guided Visual Convergent AlignmentComputer Vision and Pattern Recognition (CVPR), 2025
Chen Liu
Peike Li
Liying Yang
Dadong Wang
Lincheng Li
Xin Yu
VOS
288
4
0
17 Mar 2025
AV-PedAware: Self-Supervised Audio-Visual Fusion for Dynamic Pedestrian Awareness
AV-PedAware: Self-Supervised Audio-Visual Fusion for Dynamic Pedestrian AwarenessIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Yizhuo Yang
Shenghai Yuan
Muqing Cao
Jianfei Yang
Lihua Xie
605
15
0
11 Nov 2024
STNet: Deep Audio-Visual Fusion Network for Robust Speaker Tracking
STNet: Deep Audio-Visual Fusion Network for Robust Speaker TrackingIEEE transactions on multimedia (IEEE TMM), 2024
Yidi Li
Hong Liu
Bing Yang
458
8
0
08 Oct 2024
Deep Learning for Omnidirectional Vision: A Survey and New Perspectives
Deep Learning for Omnidirectional Vision: A Survey and New Perspectives
Hao Ai
Zidong Cao
Jin Zhu
Haotian Bai
Yucheng Chen
Ling Wang
402
45
0
21 May 2022
Audio Self-supervised Learning: A Survey
Audio Self-supervised Learning: A SurveyPatterns (Patterns), 2022
Shuo Liu
Adria Mallol-Ragolta
Emilia Parada-Cabeleiro
Kun Qian
Xingshuo Jing
Alexander Kathan
Bin Hu
Bjoern W. Schuller
SSL
353
136
0
02 Mar 2022
Multi-Modal Perception Attention Network with Self-Supervised Learning
  for Audio-Visual Speaker Tracking
Multi-Modal Perception Attention Network with Self-Supervised Learning for Audio-Visual Speaker Tracking
Yidi Li
Hong Liu
Hao Tang
276
25
0
14 Dec 2021
Pano-AVQA: Grounded Audio-Visual Question Answering on 360$^\circ$
  Videos
Pano-AVQA: Grounded Audio-Visual Question Answering on 360∘^\circ∘ VideosIEEE International Conference on Computer Vision (ICCV), 2021
Heeseung Yun
Youngjae Yu
Wonsuk Yang
Kangil Lee
Gunhee Kim
367
124
0
11 Oct 2021
A Survey of Sound Source Localization with Deep Learning Methods
A Survey of Sound Source Localization with Deep Learning MethodsJournal of the Acoustical Society of America (JASA), 2021
Pierre-Amaury Grumiaux
Srdjan Kitić
Laurent Girin
Alexandre Guérin
394
344
0
08 Sep 2021
1
Page 1 of 1