ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2109.03465
  4. Cited By
A Survey of Sound Source Localization with Deep Learning Methods

A Survey of Sound Source Localization with Deep Learning Methods

8 September 2021
Pierre-Amaury Grumiaux
Srdjan Kitić
Laurent Girin
Alexandre Guérin
ArXivPDFHTML

Papers citing "A Survey of Sound Source Localization with Deep Learning Methods"

31 / 31 papers shown
Title
Deep, data-driven modeling of room acoustics: literature review and research perspectives
Deep, data-driven modeling of room acoustics: literature review and research perspectives
Toon van Waterschoot
AI4CE
19
0
0
22 Apr 2025
CNN-based Robust Sound Source Localization with SRP-PHAT for the Extreme Edge
Jun Yin
Marian Verhelst
54
1
0
03 Mar 2025
Completing Sets of Prototype Transfer Functions for Subspace-based Direction of Arrival Estimation of Multiple Speakers
Completing Sets of Prototype Transfer Functions for Subspace-based Direction of Arrival Estimation of Multiple Speakers
Daniel Fejgin
Simon Doclo
39
0
0
13 Jan 2025
SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios
SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios
Kai Li
Wendi Sang
Chang Zeng
Runxuan Yang
Guo Chen
Xiaolin Hu
26
2
0
02 Oct 2024
Conformal Prediction for Manifold-based Source Localization with Gaussian Processes
Conformal Prediction for Manifold-based Source Localization with Gaussian Processes
Vadim Rozenfeld
Bracha Laufer Goldshtein
28
0
0
18 Sep 2024
RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for
  Dynamic Speech Enhancement and Localization
RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization
Bing Yang
Changsheng Quan
Yabo Wang
Pengyu Wang
Yujie Yang
Ying Fang
Nian Shao
Hui Bu
Xin Xu
Xiaofei Li
38
5
0
28 Jun 2024
Sound Source Localization for a Source inside a Structure using
  Ac-CycleGAN
Sound Source Localization for a Source inside a Structure using Ac-CycleGAN
S. Kita
Choong Sik Park
Y. Kajikawa
24
0
0
08 Dec 2023
Feature Aggregation in Joint Sound Classification and Localization
  Neural Networks
Feature Aggregation in Joint Sound Classification and Localization Neural Networks
Brendan Healy
Patrick McNamee
Z. Nili Ahmadabadi
31
0
0
29 Oct 2023
Permutation Invariant Recurrent Neural Networks for Sound Source
  Tracking Applications
Permutation Invariant Recurrent Neural Networks for Sound Source Tracking Applications
David Diaz-Guerra
A. Politis
A. Miguel
J. R. Beltrán
Tuomas Virtanen
18
0
0
14 Jun 2023
Deep Learning Based Stage-wise Two-dimensional Speaker Localization with
  Large Ad-hoc Microphone Arrays
Deep Learning Based Stage-wise Two-dimensional Speaker Localization with Large Ad-hoc Microphone Arrays
Shupei Liu
Linfeng Feng
Yijun Gong
Chengdong Liang
Chen Zhang
Xiao-Lei Zhang
Xuelong Li
16
3
0
19 Oct 2022
End-to-end Two-dimensional Sound Source Localization With Ad-hoc
  Microphone Arrays
End-to-end Two-dimensional Sound Source Localization With Ad-hoc Microphone Arrays
Yijun Gong
Shupei Liu
Xiao-Lei Zhang
11
8
0
16 Oct 2022
Enemy Spotted: in-game gun sound dataset for gunshot classification and
  localization
Enemy Spotted: in-game gun sound dataset for gunshot classification and localization
Junwoo Park
Youngwoo Cho
Gyuhyeon Sim
Hojoon Lee
Jaegul Choo
18
7
0
12 Oct 2022
A convolutional plane wave model for sound field reconstruction
A convolutional plane wave model for sound field reconstruction
Manuel Hahmann
Efren Fernandez-Grande
23
6
0
24 Aug 2022
Extending GCC-PHAT using Shift Equivariant Neural Networks
Extending GCC-PHAT using Shift Equivariant Neural Networks
Axel Berg
Mark O'Connor
Kalle Åström
Magnus Oskarsson
20
10
0
09 Aug 2022
A physically-informed Deep-Learning approach for locating sources in a
  waveguide
A physically-informed Deep-Learning approach for locating sources in a waveguide
Adar Kahana
Symeon Papadimitropoulos
Eli Turkel
Dmitry Batenkov
21
3
0
07 Aug 2022
AID: Open-source Anechoic Interferer Dataset
AID: Open-source Anechoic Interferer Dataset
Philipp Götz
Cagdas Tuna
Andreas Walther
Emanuel Habets
14
2
0
05 Aug 2022
SoundSpaces 2.0: A Simulation Platform for Visual-Acoustic Learning
SoundSpaces 2.0: A Simulation Platform for Visual-Acoustic Learning
Changan Chen
Carl Schissler
Sanchit Garg
Philip Kobernik
Alexander William Clegg
P. Calamia
Dhruv Batra
Philip Robinson
Kristen Grauman
3DGS
31
79
0
16 Jun 2022
MESH2IR: Neural Acoustic Impulse Response Generator for Complex 3D
  Scenes
MESH2IR: Neural Acoustic Impulse Response Generator for Complex 3D Scenes
Anton Ratnarajah
Zhenyu Tang
R. Aralikatti
Dinesh Manocha
AI4CE
10
35
0
18 May 2022
GWA: A Large High-Quality Acoustic Dataset for Audio Processing
GWA: A Large High-Quality Acoustic Dataset for Audio Processing
Zhenyu Tang
R. Aralikatti
Anton Ratnarajah
Dinesh Manocha
27
31
0
04 Apr 2022
Echo-enabled Direction-of-Arrival and range estimation of a mobile
  source in Ambisonic domain
Echo-enabled Direction-of-Arrival and range estimation of a mobile source in Ambisonic domain
J. Daniel
Srdan Kitic
26
7
0
10 Mar 2022
Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization
Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization
Hao Jiang
Calvin Murdock
V. Ithapu
EgoV
25
40
0
06 Jan 2022
BeamLearning: an end-to-end Deep Learning approach for the angular
  localization of sound sources using raw multichannel acoustic pressure data
BeamLearning: an end-to-end Deep Learning approach for the angular localization of sound sources using raw multichannel acoustic pressure data
Hadrien Pujol
Éric Bavu
Alexandre Garcia
42
22
0
27 Apr 2021
A Review of Speaker Diarization: Recent Advances with Deep Learning
A Review of Speaker Diarization: Recent Advances with Deep Learning
Tae Jin Park
Naoyuki Kanda
Dimitrios Dimitriadis
Kyu Jeong Han
Shinji Watanabe
Shrikanth Narayanan
VLM
269
326
0
24 Jan 2021
TrackFormer: Multi-Object Tracking with Transformers
TrackFormer: Multi-Object Tracking with Transformers
Tim Meinhardt
A. Kirillov
Laura Leal-Taixe
Christoph Feichtenhofer
VOT
218
742
0
07 Jan 2021
TransTrack: Multiple Object Tracking with Transformer
TransTrack: Multiple Object Tracking with Transformer
Pei Sun
Jinkun Cao
Yi-Xin Jiang
Rufeng Zhang
Enze Xie
Zehuan Yuan
Changhu Wang
Ping Luo
ViT
VOT
243
565
0
31 Dec 2020
Sound event localization and detection based on crnn using rectangular
  filters and channel rotation data augmentation
Sound event localization and detection based on crnn using rectangular filters and channel rotation data augmentation
Francesca Ronchini
Daniel Arteaga
Andrés Pérez-López
13
9
0
13 Oct 2020
Self-supervised Neural Audio-Visual Sound Source Localization via
  Probabilistic Spatial Modeling
Self-supervised Neural Audio-Visual Sound Source Localization via Probabilistic Spatial Modeling
Yoshiki Masuyama
Yoshiaki Bando
Kohei Yatabe
Y. Sasaki
Masaki Onishi
Yasuhiro Oikawa
SSL
42
13
0
28 Jul 2020
High-Resolution Speaker Counting In Reverberant Rooms Using CRNN With
  Ambisonics Features
High-Resolution Speaker Counting In Reverberant Rooms Using CRNN With Ambisonics Features
Pierre-Amaury Grumiaux
Srdjan Kitic
Laurent Girin
Alexandre Guérin
36
17
0
17 Mar 2020
UnOVOST: Unsupervised Offline Video Object Segmentation and Tracking
UnOVOST: Unsupervised Offline Video Object Segmentation and Tracking
Jonathon Luiten
Idil Esen Zulfikar
Bastian Leibe
VOS
125
62
0
15 Jan 2020
DDSP: Differentiable Digital Signal Processing
DDSP: Differentiable Digital Signal Processing
Jesse Engel
Lamtharn Hantrakul
Chenjie Gu
Adam Roberts
DiffM
83
373
0
14 Jan 2020
Convolutional Neural Networks for Sentence Classification
Convolutional Neural Networks for Sentence Classification
Yoon Kim
AILaw
VLM
250
13,364
0
25 Aug 2014
1