Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.03465
Cited By
A Survey of Sound Source Localization with Deep Learning Methods
8 September 2021
Pierre-Amaury Grumiaux
Srdjan Kitić
Laurent Girin
Alexandre Guérin
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Survey of Sound Source Localization with Deep Learning Methods"
31 / 31 papers shown
Title
Deep, data-driven modeling of room acoustics: literature review and research perspectives
Toon van Waterschoot
AI4CE
16
0
0
22 Apr 2025
CNN-based Robust Sound Source Localization with SRP-PHAT for the Extreme Edge
Jun Yin
Marian Verhelst
54
1
0
03 Mar 2025
Completing Sets of Prototype Transfer Functions for Subspace-based Direction of Arrival Estimation of Multiple Speakers
Daniel Fejgin
Simon Doclo
39
0
0
13 Jan 2025
SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios
Kai Li
Wendi Sang
Chang Zeng
Runxuan Yang
Guo Chen
Xiaolin Hu
26
2
0
02 Oct 2024
Conformal Prediction for Manifold-based Source Localization with Gaussian Processes
Vadim Rozenfeld
Bracha Laufer Goldshtein
28
0
0
18 Sep 2024
RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization
Bing Yang
Changsheng Quan
Yabo Wang
Pengyu Wang
Yujie Yang
Ying Fang
Nian Shao
Hui Bu
Xin Xu
Xiaofei Li
38
5
0
28 Jun 2024
Sound Source Localization for a Source inside a Structure using Ac-CycleGAN
S. Kita
Choong Sik Park
Y. Kajikawa
24
0
0
08 Dec 2023
Feature Aggregation in Joint Sound Classification and Localization Neural Networks
Brendan Healy
Patrick McNamee
Z. Nili Ahmadabadi
31
0
0
29 Oct 2023
Permutation Invariant Recurrent Neural Networks for Sound Source Tracking Applications
David Diaz-Guerra
A. Politis
A. Miguel
J. R. Beltrán
Tuomas Virtanen
16
0
0
14 Jun 2023
Deep Learning Based Stage-wise Two-dimensional Speaker Localization with Large Ad-hoc Microphone Arrays
Shupei Liu
Linfeng Feng
Yijun Gong
Chengdong Liang
Chen Zhang
Xiao-Lei Zhang
Xuelong Li
16
3
0
19 Oct 2022
End-to-end Two-dimensional Sound Source Localization With Ad-hoc Microphone Arrays
Yijun Gong
Shupei Liu
Xiao-Lei Zhang
11
8
0
16 Oct 2022
Enemy Spotted: in-game gun sound dataset for gunshot classification and localization
Junwoo Park
Youngwoo Cho
Gyuhyeon Sim
Hojoon Lee
Jaegul Choo
16
7
0
12 Oct 2022
A convolutional plane wave model for sound field reconstruction
Manuel Hahmann
Efren Fernandez-Grande
23
6
0
24 Aug 2022
Extending GCC-PHAT using Shift Equivariant Neural Networks
Axel Berg
Mark O'Connor
Kalle Åström
Magnus Oskarsson
20
10
0
09 Aug 2022
A physically-informed Deep-Learning approach for locating sources in a waveguide
Adar Kahana
Symeon Papadimitropoulos
Eli Turkel
Dmitry Batenkov
19
3
0
07 Aug 2022
AID: Open-source Anechoic Interferer Dataset
Philipp Götz
Cagdas Tuna
Andreas Walther
Emanuel Habets
14
2
0
05 Aug 2022
SoundSpaces 2.0: A Simulation Platform for Visual-Acoustic Learning
Changan Chen
Carl Schissler
Sanchit Garg
Philip Kobernik
Alexander William Clegg
P. Calamia
Dhruv Batra
Philip Robinson
Kristen Grauman
3DGS
31
79
0
16 Jun 2022
MESH2IR: Neural Acoustic Impulse Response Generator for Complex 3D Scenes
Anton Ratnarajah
Zhenyu Tang
R. Aralikatti
Dinesh Manocha
AI4CE
10
35
0
18 May 2022
GWA: A Large High-Quality Acoustic Dataset for Audio Processing
Zhenyu Tang
R. Aralikatti
Anton Ratnarajah
Dinesh Manocha
27
31
0
04 Apr 2022
Echo-enabled Direction-of-Arrival and range estimation of a mobile source in Ambisonic domain
J. Daniel
Srdan Kitic
26
7
0
10 Mar 2022
Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization
Hao Jiang
Calvin Murdock
V. Ithapu
EgoV
25
40
0
06 Jan 2022
BeamLearning: an end-to-end Deep Learning approach for the angular localization of sound sources using raw multichannel acoustic pressure data
Hadrien Pujol
Éric Bavu
Alexandre Garcia
42
22
0
27 Apr 2021
A Review of Speaker Diarization: Recent Advances with Deep Learning
Tae Jin Park
Naoyuki Kanda
Dimitrios Dimitriadis
Kyu Jeong Han
Shinji Watanabe
Shrikanth Narayanan
VLM
269
326
0
24 Jan 2021
TrackFormer: Multi-Object Tracking with Transformers
Tim Meinhardt
A. Kirillov
Laura Leal-Taixe
Christoph Feichtenhofer
VOT
218
742
0
07 Jan 2021
TransTrack: Multiple Object Tracking with Transformer
Pei Sun
Jinkun Cao
Yi-Xin Jiang
Rufeng Zhang
Enze Xie
Zehuan Yuan
Changhu Wang
Ping Luo
ViT
VOT
241
565
0
31 Dec 2020
Sound event localization and detection based on crnn using rectangular filters and channel rotation data augmentation
Francesca Ronchini
Daniel Arteaga
Andrés Pérez-López
13
9
0
13 Oct 2020
Self-supervised Neural Audio-Visual Sound Source Localization via Probabilistic Spatial Modeling
Yoshiki Masuyama
Yoshiaki Bando
Kohei Yatabe
Y. Sasaki
Masaki Onishi
Yasuhiro Oikawa
SSL
42
13
0
28 Jul 2020
High-Resolution Speaker Counting In Reverberant Rooms Using CRNN With Ambisonics Features
Pierre-Amaury Grumiaux
Srdjan Kitic
Laurent Girin
Alexandre Guérin
36
17
0
17 Mar 2020
UnOVOST: Unsupervised Offline Video Object Segmentation and Tracking
Jonathon Luiten
Idil Esen Zulfikar
Bastian Leibe
VOS
125
62
0
15 Jan 2020
DDSP: Differentiable Digital Signal Processing
Jesse Engel
Lamtharn Hantrakul
Chenjie Gu
Adam Roberts
DiffM
83
373
0
14 Jan 2020
Convolutional Neural Networks for Sentence Classification
Yoon Kim
AILaw
VLM
250
13,364
0
25 Aug 2014
1