v1v2v3 (latest)

A Survey of Sound Source Localization with Deep Learning Methods

Journal of the Acoustical Society of America (JASA), 2021

8 September 2021

Pierre-Amaury Grumiaux

Papers citing "A Survey of Sound Source Localization with Deep Learning Methods"

50 / 52 papers shown

Acoustic Imaging for UAV Detection: Dense Beamformed Energy Maps and U-Net SELD

Belman Jahir Rodriguez

Sergio Chevtchenko

Marcelo Herrera Martinez

Yeshwant Bethy

Saeed Afshar

161

30 Mar 2026

DeepASA: An Object-Oriented Multi-Purpose Network for Auditory Scene Analysis

Dongheon Lee

Younghoo Kwon

Jung-Woo Choi

349

24 Dec 2025

Systematic Evaluation of Time-Frequency Features for Binaural Sound Source Localization

164

17 Nov 2025

Multi-Speaker DOA Estimation in Binaural Hearing Aids using Deep Learning and Speaker Count Fusion

Farnaz Jazaeri

Homayoun Kamkar-Parsi

François Grondin

M. Bouchard

128

23 Sep 2025

Towards Low-Latency Tracking of Multiple Speakers With Short-Context Speaker Embeddings

Taous Iatariene

Alexandre Guérin

Romain Serizel

185

18 Aug 2025

From Waveforms to Pixels: A Survey on Audio-Visual Segmentation

Jia Li

Yapeng Tian

VOS

264

29 Jul 2025

Single-Microphone-Based Sound Source Localization for Mobile Robots in Reverberant Environments

195

19 Jun 2025

Tracking of Intermittent and Moving Speakers : Dataset and Metrics

Taous Iatariene

Alexandre Guérin

Romain Serizel

328

11 Jun 2025

CNN-based Robust Sound Source Localization with SRP-PHAT for the Extreme EdgeACM Transactions on Embedded Computing Systems (TECS), 2023

Jun Yin

Marian Verhelst

330

03 Mar 2025

PSELDNets: Pre-trained Neural Networks on a Large-scale Synthetic Dataset for Sound Event Localization and DetectionIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2024

437

10 Nov 2024

SonicSim: A customizable simulation platform for speech processing in moving sound source scenariosInternational Conference on Learning Representations (ICLR), 2024

Kai Li

361

02 Oct 2024

Conformal Prediction for Manifold-based Source Localization with Gaussian Processes

Vadim Rozenfeld

Bracha Laufer Goldshtein

347

18 Sep 2024

wav2pos: Sound Source Localization using Masked AutoencodersInternational Conference on Indoor Positioning and Indoor Navigation (IPIN), 2024

Karl Åström

226

28 Aug 2024

Computer Audition: From Task-Specific Machine Learning to Foundation Models

Andreas Triantafyllopoulos

474

22 Jul 2024

RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization

Yabo Wang

Xiaofei Li

257

28 Jun 2024

Winner-takes-all learners are geometry-aware conditional density estimatorsInternational Conference on Machine Learning (ICML), 2024

Gael Richard

Patrick Pérez

273

07 Jun 2024

The Neural-SRP method for positional sound source localizationAsilomar Conference on Signals, Systems and Computers (ACSSC), 2023

224

14 Mar 2024

Enhanced Sound Event Localization and Detection in Real 360-degree audio-visual soundscapes

Adrian S. Roman

Baladithya Balamurugan

Rithik Pothuganti

163

29 Jan 2024

Selective-Memory Meta-Learning with Environment Representations for Sound Event Localization and Detection

207

27 Dec 2023

Sound Source Localization for a Source inside a Structure using Ac-CycleGAN

S. Kita

Choong Sik Park

Y. Kajikawa

232

08 Dec 2023

Resilient Multiple Choice Learning: A learned scoring scheme with application to audio scene analysisNeural Information Processing Systems (NeurIPS), 2023

364

02 Nov 2023

Study of speaker localization with binaural microphone array incorporating auditory filters and lateral angle estimationApplied Acoustics (Appl. Acoust.), 2023

Yanir Maymon

Israel Nelken

B. Rafaely

232

31 Oct 2023

Sound Source Distance Estimation in Diverse and Dynamic Acoustic ConditionsIEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2023

Saksham Singh Kushwaha

Irán R. Román

Magdalena Fuentes

J. P. Bello

158

17 Sep 2023

A Real-Time Active Speaker Detection System Integrating an Audio-Visual Signal with a Spatial Querying MechanismIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

I. Gurvich

Ido Leichter

Dharmendar Reddy Palle

236

15 Sep 2023

Spatial LibriSpeech: An Augmented Dataset for Spatial Audio LearningInterspeech (Interspeech), 2023

Miguel Sarabia

Luca Zappella

189

18 Aug 2023

Multi-goal Audio-visual Navigation using Sound Direction MapIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023

Haruo Kondoh

Asako Kanezaki

338

01 Aug 2023

Enhanced Neural Beamformer with Spatial Information for Target Speech ExtractionAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2023

Yujun Wang

167

28 Jun 2023

How far generated data can impact Neural Networks performance?VISIGRAPP (VISIGRAPP), 2023

Sayeh Gholipour Picha

D. Chanti

A. Caplier

CVBM

148

27 Mar 2023

Improving trajectory localization accuracy via direction-of-arrival derivative estimation

234

07 Dec 2022

Assisted RTF-Vector-Based Binaural Direction of Arrival Estimation Exploiting a Calibrated External Microphone ArrayIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Daniel Fejgin

Simon Doclo

207

30 Nov 2022

How to (virtually) train your speaker localizerInterspeech (Interspeech), 2022

Prerak Srivastava

Antoine Deleforge

Archontis Politis

Emmanuel Vincent

257

30 Nov 2022

Position tracking of a varying number of sound sources with sliding permutation invariant trainingEuropean Signal Processing Conference (EUSIPCO), 2022

David Diaz-Guerra

Archontis Politis

Maria Sandsten

233

26 Oct 2022

Deep Learning Based Stage-wise Two-dimensional Speaker Localization with Large Ad-hoc Microphone ArraysSpeech Communication (Speech Commun.), 2022

Xuelong Li

299

19 Oct 2022

End-to-end Two-dimensional Sound Source Localization With Ad-hoc Microphone ArraysAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2022

Yijun Gong

Shupei Liu

Xiao-Lei Zhang

175

16 Oct 2022

Enemy Spotted: in-game gun sound dataset for gunshot classification and localization

224

12 Oct 2022

Vision+X: A Survey on Multimodal Learning in the Light of DataIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

Ye Zhu

Yuehua Wu

Andrii Zadaianchuk

Yan Yan

489

05 Oct 2022

Sound Event Localization and Detection for Real Spatial Sound Scenes: Event-Independent Network and Data Augmentation ChainsWorkshop on Detection and Classification of Acoustic Scenes and Events (DCASE), 2022

343

05 Sep 2022

Extending GCC-PHAT using Shift Equivariant Neural NetworksInterspeech (Interspeech), 2022

Axel Berg

Mark O'Connor

Kalle Åström

Magnus Oskarsson

260

09 Aug 2022

A physically-informed Deep-Learning approach for locating sources in a waveguideJournal of the Acoustical Society of America (JASA), 2022

Adar Kahana

Symeon Papadimitropoulos

Eli Turkel

Dmitry Batenkov

190

07 Aug 2022

AID: Open-source Anechoic Interferer DatasetInternational Workshop on Acoustic Signal Enhancement (IWAENC), 2022

190

05 Aug 2022

SoundSpaces 2.0: A Simulation Platform for Visual-Acoustic LearningNeural Information Processing Systems (NeurIPS), 2022

399

124

16 Jun 2022

Neural network for multi-exponential sound energy decay analysisJournal of the Acoustical Society of America (JASA), 2022

Georg Götz

Ricardo Falcón Pérez

Sebastian J. Schlecht

V. Pulkki

180

19 May 2022

MESH2IR: Neural Acoustic Impulse Response Generator for Complex 3D ScenesACM Multimedia (ACM MM), 2022

397

18 May 2022

Uncertainty-Based Non-Parametric Active Peak DetectionInternational Symposium on Information Theory (ISIT), 2022

Praneeth Narayanamurthy

U. Mitra

161

05 May 2022

GWA: A Large High-Quality Acoustic Dataset for Audio ProcessingInternational Conference on Computer Graphics and Interactive Techniques (SIGGRAPH), 2022

433

04 Apr 2022

Direction of Arrival Estimation of Sound Sources Using Icosahedral CNNsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

David Diaz-Guerra

A. Miguel

J. R. Beltrán

418

31 Mar 2022

Echo-enabled Direction-of-Arrival and range estimation of a mobile source in Ambisonic domainEuropean Signal Processing Conference (EUSIPCO), 2022

J. Daniel

Srdan Kitic

203

10 Mar 2022

Locate This, Not That: Class-Conditioned Sound Event DOA EstimationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

320

08 Mar 2022

Egocentric Deep Multi-Channel Audio-Visual Active Speaker LocalizationComputer Vision and Pattern Recognition (CVPR), 2022

324

06 Jan 2022

DA-MUSIC: Data-Driven DoA Estimation via Deep Augmented MUSIC AlgorithmIEEE Transactions on Vehicular Technology (IEEE Trans. Veh. Technol.), 2021

408

22 Sep 2021