ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2109.03465
  4. Cited By
A Survey of Sound Source Localization with Deep Learning Methods
v1v2v3 (latest)

A Survey of Sound Source Localization with Deep Learning Methods

Journal of the Acoustical Society of America (JASA), 2021
8 September 2021
Pierre-Amaury Grumiaux
Srdjan Kitić
Laurent Girin
Alexandre Guérin
ArXiv (abs)PDFHTML

Papers citing "A Survey of Sound Source Localization with Deep Learning Methods"

50 / 52 papers shown
Acoustic Imaging for UAV Detection: Dense Beamformed Energy Maps and U-Net SELD
Acoustic Imaging for UAV Detection: Dense Beamformed Energy Maps and U-Net SELD
Belman Jahir Rodriguez
Sergio Chevtchenko
Marcelo Herrera Martinez
Yeshwant Bethy
Saeed Afshar
161
0
0
30 Mar 2026
DeepASA: An Object-Oriented Multi-Purpose Network for Auditory Scene Analysis
DeepASA: An Object-Oriented Multi-Purpose Network for Auditory Scene Analysis
Dongheon Lee
Younghoo Kwon
Jung-Woo Choi
349
1
0
24 Dec 2025
Systematic Evaluation of Time-Frequency Features for Binaural Sound Source Localization
Systematic Evaluation of Time-Frequency Features for Binaural Sound Source Localization
Davoud Shariat Panah
Alessandro Ragano
Dan Barry
Jan Skoglund
Andrew Hines
164
0
0
17 Nov 2025
Multi-Speaker DOA Estimation in Binaural Hearing Aids using Deep Learning and Speaker Count Fusion
Multi-Speaker DOA Estimation in Binaural Hearing Aids using Deep Learning and Speaker Count Fusion
Farnaz Jazaeri
Homayoun Kamkar-Parsi
François Grondin
M. Bouchard
128
0
0
23 Sep 2025
Towards Low-Latency Tracking of Multiple Speakers With Short-Context Speaker Embeddings
Towards Low-Latency Tracking of Multiple Speakers With Short-Context Speaker Embeddings
Taous Iatariene
Alexandre Guérin
Romain Serizel
185
0
0
18 Aug 2025
From Waveforms to Pixels: A Survey on Audio-Visual Segmentation
From Waveforms to Pixels: A Survey on Audio-Visual Segmentation
Jia Li
Yapeng Tian
VOS
264
3
0
29 Jul 2025
Single-Microphone-Based Sound Source Localization for Mobile Robots in Reverberant Environments
Single-Microphone-Based Sound Source Localization for Mobile Robots in Reverberant Environments
Jiang Wang
Runwu Shi
Benjamin Yen
He Kong
Kazuhiro Nakadai
195
0
0
19 Jun 2025
Tracking of Intermittent and Moving Speakers : Dataset and Metrics
Tracking of Intermittent and Moving Speakers : Dataset and Metrics
Taous Iatariene
Alexandre Guérin
Romain Serizel
328
2
0
11 Jun 2025
CNN-based Robust Sound Source Localization with SRP-PHAT for the Extreme Edge
CNN-based Robust Sound Source Localization with SRP-PHAT for the Extreme EdgeACM Transactions on Embedded Computing Systems (TECS), 2023
Jun Yin
Marian Verhelst
330
1
0
03 Mar 2025
PSELDNets: Pre-trained Neural Networks on a Large-scale Synthetic Dataset for Sound Event Localization and Detection
PSELDNets: Pre-trained Neural Networks on a Large-scale Synthetic Dataset for Sound Event Localization and DetectionIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2024
Jinbo Hu
Yin Cao
Ming Wu
Fang Kang
Feiran Yang
Wenwu Wang
Mark D. Plumbley
J. Yang
437
8
0
10 Nov 2024
SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios
SonicSim: A customizable simulation platform for speech processing in moving sound source scenariosInternational Conference on Learning Representations (ICLR), 2024
Kai Li
Wendi Sang
Chang Zeng
Runxuan Yang
Guo Chen
Xiaolin Hu
361
9
0
02 Oct 2024
Conformal Prediction for Manifold-based Source Localization with Gaussian Processes
Conformal Prediction for Manifold-based Source Localization with Gaussian Processes
Vadim Rozenfeld
Bracha Laufer Goldshtein
347
2
0
18 Sep 2024
wav2pos: Sound Source Localization using Masked Autoencoders
wav2pos: Sound Source Localization using Masked AutoencodersInternational Conference on Indoor Positioning and Indoor Navigation (IPIN), 2024
Axel Berg
Jens Gulin
Mark O'Connor
Chuteng Zhou
Karl Åström
Magnus Oskarsson
226
6
0
28 Aug 2024
Computer Audition: From Task-Specific Machine Learning to Foundation Models
Computer Audition: From Task-Specific Machine Learning to Foundation Models
Andreas Triantafyllopoulos
Iosif Tsangko
Alexander Gebhard
A. Mesaros
Maria Sandsten
B. Schuller
474
11
0
22 Jul 2024
RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for
  Dynamic Speech Enhancement and Localization
RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization
Bing Yang
Changsheng Quan
Yabo Wang
Pengyu Wang
Yujie Yang
Ying Fang
Nian Shao
Hui Bu
Xin Xu
Xiaofei Li
257
23
0
28 Jun 2024
Winner-takes-all learners are geometry-aware conditional density
  estimators
Winner-takes-all learners are geometry-aware conditional density estimatorsInternational Conference on Machine Learning (ICML), 2024
Victor Letzelter
David Perera
Cédric Rommel
Mathieu Fontaine
S. Essid
Gael Richard
Patrick Pérez
273
6
0
07 Jun 2024
The Neural-SRP method for positional sound source localization
The Neural-SRP method for positional sound source localizationAsilomar Conference on Signals, Systems and Computers (ACSSC), 2023
Eric Grinstein
Toon van Waterschoot
Mike Brookes
Patrick A. Naylor
224
3
0
14 Mar 2024
Enhanced Sound Event Localization and Detection in Real 360-degree
  audio-visual soundscapes
Enhanced Sound Event Localization and Detection in Real 360-degree audio-visual soundscapes
Adrian S. Roman
Baladithya Balamurugan
Rithik Pothuganti
163
10
0
29 Jan 2024
Selective-Memory Meta-Learning with Environment Representations for
  Sound Event Localization and Detection
Selective-Memory Meta-Learning with Environment Representations for Sound Event Localization and Detection
Jinbo Hu
Yin Cao
Ming Wu
Qiuqiang Kong
Feiran Yang
Mark D. Plumbley
Jun Yang
207
6
0
27 Dec 2023
Sound Source Localization for a Source inside a Structure using
  Ac-CycleGAN
Sound Source Localization for a Source inside a Structure using Ac-CycleGAN
S. Kita
Choong Sik Park
Y. Kajikawa
232
0
0
08 Dec 2023
Resilient Multiple Choice Learning: A learned scoring scheme with
  application to audio scene analysis
Resilient Multiple Choice Learning: A learned scoring scheme with application to audio scene analysisNeural Information Processing Systems (NeurIPS), 2023
Victor Letzelter
Mathieu Fontaine
Mickaël Chen
Patrick Pérez
S. Essid
Ga¨el Richard
364
14
0
02 Nov 2023
Study of speaker localization with binaural microphone array
  incorporating auditory filters and lateral angle estimation
Study of speaker localization with binaural microphone array incorporating auditory filters and lateral angle estimationApplied Acoustics (Appl. Acoust.), 2023
Yanir Maymon
Israel Nelken
B. Rafaely
232
1
0
31 Oct 2023
Sound Source Distance Estimation in Diverse and Dynamic Acoustic
  Conditions
Sound Source Distance Estimation in Diverse and Dynamic Acoustic ConditionsIEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2023
Saksham Singh Kushwaha
Irán R. Román
Magdalena Fuentes
J. P. Bello
158
16
0
17 Sep 2023
A Real-Time Active Speaker Detection System Integrating an Audio-Visual
  Signal with a Spatial Querying Mechanism
A Real-Time Active Speaker Detection System Integrating an Audio-Visual Signal with a Spatial Querying MechanismIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
I. Gurvich
Ido Leichter
Dharmendar Reddy Palle
Yossi Asher
Alon Vinnikov
Igor Abramovski
Vishak Gopal
Ross Cutler
Eyal Krupka
236
4
0
15 Sep 2023
Spatial LibriSpeech: An Augmented Dataset for Spatial Audio Learning
Spatial LibriSpeech: An Augmented Dataset for Spatial Audio LearningInterspeech (Interspeech), 2023
Miguel Sarabia
Elena Menyaylenko
Alessandro Toso
Skyler Seto
Zakaria Aldeneh
Shadi Pirhosseinloo
Luca Zappella
B. Theobald
N. Apostoloff
Jonathan Sheaffer
189
13
0
18 Aug 2023
Multi-goal Audio-visual Navigation using Sound Direction Map
Multi-goal Audio-visual Navigation using Sound Direction MapIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Haruo Kondoh
Asako Kanezaki
338
10
0
01 Aug 2023
Enhanced Neural Beamformer with Spatial Information for Target Speech
  Extraction
Enhanced Neural Beamformer with Spatial Information for Target Speech ExtractionAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2023
Aoqi Guo
Junnan Wu
Peng Gao
Wenbo Zhu
Qinwen Guo
Dazhi Gao
Yujun Wang
167
1
0
28 Jun 2023
How far generated data can impact Neural Networks performance?
How far generated data can impact Neural Networks performance?VISIGRAPP (VISIGRAPP), 2023
Sayeh Gholipour Picha
D. Chanti
A. Caplier
CVBM
148
1
0
27 Mar 2023
Improving trajectory localization accuracy via direction-of-arrival
  derivative estimation
Improving trajectory localization accuracy via direction-of-arrival derivative estimation
Ruchi Pandey
Shreya Jaiswal
Huy P Phan
S. Nannuru
234
0
0
07 Dec 2022
Assisted RTF-Vector-Based Binaural Direction of Arrival Estimation
  Exploiting a Calibrated External Microphone Array
Assisted RTF-Vector-Based Binaural Direction of Arrival Estimation Exploiting a Calibrated External Microphone ArrayIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Daniel Fejgin
Simon Doclo
207
5
0
30 Nov 2022
How to (virtually) train your speaker localizer
How to (virtually) train your speaker localizerInterspeech (Interspeech), 2022
Prerak Srivastava
Antoine Deleforge
Archontis Politis
Emmanuel Vincent
257
8
0
30 Nov 2022
Position tracking of a varying number of sound sources with sliding
  permutation invariant training
Position tracking of a varying number of sound sources with sliding permutation invariant trainingEuropean Signal Processing Conference (EUSIPCO), 2022
David Diaz-Guerra
Archontis Politis
Maria Sandsten
233
7
0
26 Oct 2022
Deep Learning Based Stage-wise Two-dimensional Speaker Localization with
  Large Ad-hoc Microphone Arrays
Deep Learning Based Stage-wise Two-dimensional Speaker Localization with Large Ad-hoc Microphone ArraysSpeech Communication (Speech Commun.), 2022
Shupei Liu
Linfeng Feng
Yijun Gong
Chengdong Liang
Chen Zhang
Xiao-Lei Zhang
Xuelong Li
299
4
0
19 Oct 2022
End-to-end Two-dimensional Sound Source Localization With Ad-hoc
  Microphone Arrays
End-to-end Two-dimensional Sound Source Localization With Ad-hoc Microphone ArraysAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2022
Yijun Gong
Shupei Liu
Xiao-Lei Zhang
175
13
0
16 Oct 2022
Enemy Spotted: in-game gun sound dataset for gunshot classification and
  localization
Enemy Spotted: in-game gun sound dataset for gunshot classification and localization
Junwoo Park
Youngwoo Cho
Gyuhyeon Sim
Hojoon Lee
Jaegul Choo
224
11
0
12 Oct 2022
Vision+X: A Survey on Multimodal Learning in the Light of Data
Vision+X: A Survey on Multimodal Learning in the Light of DataIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Ye Zhu
Yuehua Wu
Andrii Zadaianchuk
Yan Yan
489
43
0
05 Oct 2022
Sound Event Localization and Detection for Real Spatial Sound Scenes:
  Event-Independent Network and Data Augmentation Chains
Sound Event Localization and Detection for Real Spatial Sound Scenes: Event-Independent Network and Data Augmentation ChainsWorkshop on Detection and Classification of Acoustic Scenes and Events (DCASE), 2022
Jinbo Hu
Yin Cao
Ming Wu
Qiuqiang Kong
Feiran Yang
Mark D. Plumbley
J. Yang
343
16
0
05 Sep 2022
Extending GCC-PHAT using Shift Equivariant Neural Networks
Extending GCC-PHAT using Shift Equivariant Neural NetworksInterspeech (Interspeech), 2022
Axel Berg
Mark O'Connor
Kalle Åström
Magnus Oskarsson
260
15
0
09 Aug 2022
A physically-informed Deep-Learning approach for locating sources in a
  waveguide
A physically-informed Deep-Learning approach for locating sources in a waveguideJournal of the Acoustical Society of America (JASA), 2022
Adar Kahana
Symeon Papadimitropoulos
Eli Turkel
Dmitry Batenkov
190
5
0
07 Aug 2022
AID: Open-source Anechoic Interferer Dataset
AID: Open-source Anechoic Interferer DatasetInternational Workshop on Acoustic Signal Enhancement (IWAENC), 2022
Philipp Götz
Cagdas Tuna
Andreas Walther
Emanuel Habets
190
4
0
05 Aug 2022
SoundSpaces 2.0: A Simulation Platform for Visual-Acoustic Learning
SoundSpaces 2.0: A Simulation Platform for Visual-Acoustic LearningNeural Information Processing Systems (NeurIPS), 2022
Changan Chen
Carl Schissler
Sanchit Garg
Philip Kobernik
Alexander Clegg
P. Calamia
Dhruv Batra
Philip Robinson
Kristen Grauman
3DGS
399
124
0
16 Jun 2022
Neural network for multi-exponential sound energy decay analysis
Neural network for multi-exponential sound energy decay analysisJournal of the Acoustical Society of America (JASA), 2022
Georg Götz
Ricardo Falcón Pérez
Sebastian J. Schlecht
V. Pulkki
180
30
0
19 May 2022
MESH2IR: Neural Acoustic Impulse Response Generator for Complex 3D
  Scenes
MESH2IR: Neural Acoustic Impulse Response Generator for Complex 3D ScenesACM Multimedia (ACM MM), 2022
Anton Ratnarajah
Zhenyu Tang
R. Aralikatti
Tianyi Zhou
AI4CE
397
52
0
18 May 2022
Uncertainty-Based Non-Parametric Active Peak Detection
Uncertainty-Based Non-Parametric Active Peak DetectionInternational Symposium on Information Theory (ISIT), 2022
Praneeth Narayanamurthy
U. Mitra
161
0
0
05 May 2022
GWA: A Large High-Quality Acoustic Dataset for Audio Processing
GWA: A Large High-Quality Acoustic Dataset for Audio ProcessingInternational Conference on Computer Graphics and Interactive Techniques (SIGGRAPH), 2022
Zhenyu Tang
R. Aralikatti
Anton Ratnarajah
Tianyi Zhou
433
48
0
04 Apr 2022
Direction of Arrival Estimation of Sound Sources Using Icosahedral CNNs
Direction of Arrival Estimation of Sound Sources Using Icosahedral CNNsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
David Diaz-Guerra
A. Miguel
J. R. Beltrán
418
34
0
31 Mar 2022
Echo-enabled Direction-of-Arrival and range estimation of a mobile
  source in Ambisonic domain
Echo-enabled Direction-of-Arrival and range estimation of a mobile source in Ambisonic domainEuropean Signal Processing Conference (EUSIPCO), 2022
J. Daniel
Srdan Kitic
203
7
0
10 Mar 2022
Locate This, Not That: Class-Conditioned Sound Event DOA Estimation
Locate This, Not That: Class-Conditioned Sound Event DOA EstimationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Olga Slizovskaia
Gordon Wichern
Zhong-Qiu Wang
Jonathan Le Roux
320
6
0
08 Mar 2022
Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization
Egocentric Deep Multi-Channel Audio-Visual Active Speaker LocalizationComputer Vision and Pattern Recognition (CVPR), 2022
Hao Jiang
Calvin Murdock
V. Ithapu
EgoV
324
49
0
06 Jan 2022
DA-MUSIC: Data-Driven DoA Estimation via Deep Augmented MUSIC Algorithm
DA-MUSIC: Data-Driven DoA Estimation via Deep Augmented MUSIC AlgorithmIEEE Transactions on Vehicular Technology (IEEE Trans. Veh. Technol.), 2021
Julian P Merkofer
Guy Revach
Stefano Rini
T. Routtenberg
Ruud J. G. van Sloun
408
91
0
22 Sep 2021
12
Next
Page 1 of 2