ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1807.11094
  4. Cited By
Towards End-to-End Acoustic Localization using Deep Learning: from Audio
  Signal to Source Position Coordinates

Towards End-to-End Acoustic Localization using Deep Learning: from Audio Signal to Source Position Coordinates

29 July 2018
Juan Manuel Vera-Diaz
Daniel Pizarro-Perez
Javier Macias-Guarasa
ArXiv (abs)PDFHTML

Papers citing "Towards End-to-End Acoustic Localization using Deep Learning: from Audio Signal to Source Position Coordinates"

28 / 28 papers shown
Title
CNN-based Robust Sound Source Localization with SRP-PHAT for the Extreme Edge
Jun Yin
Marian Verhelst
195
1
0
03 Mar 2025
SONNET: Enhancing Time Delay Estimation by Leveraging Simulated Audio
SONNET: Enhancing Time Delay Estimation by Leveraging Simulated Audio
Erik Tegler
Magnus Oskarsson
Kalle Åström
111
0
0
20 Nov 2024
wav2pos: Sound Source Localization using Masked Autoencoders
wav2pos: Sound Source Localization using Masked Autoencoders
Axel Berg
Jens Gulin
Mark O'Connor
Chuteng Zhou
Karl Åström
Magnus Oskarsson
64
2
0
28 Aug 2024
Interpreting End-to-End Deep Learning Models for Speech Source
  Localization Using Layer-wise Relevance Propagation
Interpreting End-to-End Deep Learning Models for Speech Source Localization Using Layer-wise Relevance Propagation
Luca Comanducci
Fabio Antonacci
Augusto Sarti
63
1
0
04 Apr 2024
Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Jinzheng Zhao
Yong-mei Xu
Xinyuan Qian
Davide Berghi
Peipei Wu
Meng Cui
Jianyuan Sun
Philip J. B. Jackson
Wenwu Wang
BDL
128
7
0
23 Oct 2023
Audio Visual Speaker Localization from EgoCentric Views
Audio Visual Speaker Localization from EgoCentric Views
Jinzheng Zhao
Yong-mei Xu
Xinyuan Qian
Wenwu Wang
EgoV
66
5
0
28 Sep 2023
Sound field decomposition based on two-stage neural networks
Sound field decomposition based on two-stage neural networks
Ryo Matsuda
Makoto Otani
33
0
0
13 Sep 2023
Dual input neural networks for positional sound source localization
Dual input neural networks for positional sound source localization
Eric Grinstein
Vincent W. Neo
Patrick A. Naylor
42
6
0
08 Aug 2023
Deep Learning Based Stage-wise Two-dimensional Speaker Localization with
  Large Ad-hoc Microphone Arrays
Deep Learning Based Stage-wise Two-dimensional Speaker Localization with Large Ad-hoc Microphone Arrays
Shupei Liu
Linfeng Feng
Yijun Gong
Chengdong Liang
Chen Zhang
Xiao-Lei Zhang
Xuelong Li
89
3
0
19 Oct 2022
Extending GCC-PHAT using Shift Equivariant Neural Networks
Extending GCC-PHAT using Shift Equivariant Neural Networks
Axel Berg
Mark O'Connor
Kalle Åström
Magnus Oskarsson
55
12
0
09 Aug 2022
Direction of Arrival Estimation of Sound Sources Using Icosahedral CNNs
Direction of Arrival Estimation of Sound Sources Using Icosahedral CNNs
David Diaz-Guerra
A. Miguel
J. R. Beltrán
107
22
0
31 Mar 2022
A Deep Reinforcement Learning Approach for Audio-based Navigation and
  Audio Source Localization in Multi-speaker Environments
A Deep Reinforcement Learning Approach for Audio-based Navigation and Audio Source Localization in Multi-speaker Environments
Petros Giannakopoulos
Aggelos Pikrakis
Y. Cotronis
143
3
0
25 Oct 2021
Emergency Vehicles Audio Detection and Localization in Autonomous
  Driving
Emergency Vehicles Audio Detection and Localization in Autonomous Driving
Hongyi Sun
Xinyi Liu
Kecheng Xu
Jinghao Miao
Qi Luo
59
20
0
30 Sep 2021
A Survey of Sound Source Localization with Deep Learning Methods
A Survey of Sound Source Localization with Deep Learning Methods
Pierre-Amaury Grumiaux
Srdjan Kitić
Laurent Girin
Alexandre Guérin
80
257
0
08 Sep 2021
SALADnet: Self-Attentive multisource Localization in the Ambisonics
  Domain
SALADnet: Self-Attentive multisource Localization in the Ambisonics Domain
Pierre-Amaury Grumiaux
Srdan Kitic
Prerak Srivastava
Laurent Girin
Alexandre Guérin
39
8
0
23 Jul 2021
A Deep Reinforcement Learning Approach to Audio-Based Navigation in a
  Multi-Speaker Environment
A Deep Reinforcement Learning Approach to Audio-Based Navigation in a Multi-Speaker Environment
Petros Giannakopoulos
A. Pikrakis
Y. Cotronis
72
7
0
10 May 2021
Improved feature extraction for CRNN-based multiple sound source
  localization
Improved feature extraction for CRNN-based multiple sound source localization
Pierre-Amaury Grumiaux
Srdan Kitic
Laurent Girin
Alexandre Guérin
44
23
0
05 May 2021
BeamLearning: an end-to-end Deep Learning approach for the angular
  localization of sound sources using raw multichannel acoustic pressure data
BeamLearning: an end-to-end Deep Learning approach for the angular localization of sound sources using raw multichannel acoustic pressure data
Hadrien Pujol
Éric Bavu
Alexandre Garcia
88
22
0
27 Apr 2021
Data-Efficient Framework for Real-world Multiple Sound Source 2D
  Localization
Data-Efficient Framework for Real-world Multiple Sound Source 2D Localization
G. L. Moing
Phongtharin Vinayavekhin
Don Joven Agravante
Tadanobu Inoue
J. Vongkulbhisal
Asim Munawar
Ryuki Tachibana
60
9
0
10 Dec 2020
Ensemble of Discriminators for Domain Adaptation in Multiple Sound
  Source 2D Localization
Ensemble of Discriminators for Domain Adaptation in Multiple Sound Source 2D Localization
G. L. Moing
Don Joven Agravante
Tadanobu Inoue
J. Vongkulbhisal
Asim Munawar
Ryuki Tachibana
Phongtharin Vinayavekhin
27
1
0
10 Dec 2020
Learning Multiple Sound Source 2D Localization
Learning Multiple Sound Source 2D Localization
G. L. Moing
Phongtharin Vinayavekhin
Tadanobu Inoue
J. Vongkulbhisal
Asim Munawar
Ryuki Tachibana
Don Joven Agravante
52
13
0
10 Dec 2020
FCN Approach for Dynamically Locating Multiple Speakers
FCN Approach for Dynamically Locating Multiple Speakers
Hodaya Hammer
Shlomo E. Chazan
Jacob Goldberger
Sharon Gannot
51
3
0
26 Aug 2020
Sound Event Localization and Detection using Squeeze-Excitation Residual
  CNNs
Sound Event Localization and Detection using Squeeze-Excitation Residual CNNs
Javier Naranjo-Alcazar
Sergi Perez-Castanos
J. Ferrandis
P. Zuccarello
M. Cobos
69
12
0
25 Jun 2020
Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural
  Networks
Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks
David Diaz-Guerra
A. Miguel
J. R. Beltrán
93
89
0
16 Jun 2020
Time Difference of Arrival Estimation from Frequency-Sliding Generalized
  Cross-Correlations Using Convolutional Neural Networks
Time Difference of Arrival Estimation from Frequency-Sliding Generalized Cross-Correlations Using Convolutional Neural Networks
Luca Comanducci
M. Cobos
Fabio Antonacci
Augusto Sarti
64
27
0
03 Feb 2020
Sound source detection, localization and classification using
  consecutive ensemble of CRNN models
Sound source detection, localization and classification using consecutive ensemble of CRNN models
Slawomir Kapka
M. Lewandowski
122
66
0
02 Aug 2019
Regression and Classification for Direction-of-Arrival Estimation with
  Convolutional Recurrent Neural Networks
Regression and Classification for Direction-of-Arrival Estimation with Convolutional Recurrent Neural Networks
Zhenyu Tang
John D. Kanu
Kevin Hogan
Tianyi Zhou
88
43
0
17 Apr 2019
Listening for Sirens: Locating and Classifying Acoustic Alarms in City
  Scenes
Listening for Sirens: Locating and Classifying Acoustic Alarms in City Scenes
Letizia Marchegiani
Paul Newman
57
37
0
11 Oct 2018
1