ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1807.00129
  4. Cited By
Sound Event Localization and Detection of Overlapping Sources Using
  Convolutional Recurrent Neural Networks
v1v2v3 (latest)

Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks

30 June 2018
Sharath Adavanne
Archontis Politis
Joonas Nikunen
Tuomas Virtanen
ArXiv (abs)PDFHTML

Papers citing "Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks"

50 / 89 papers shown
Title
Stereo sound event localization and detection based on PSELDnet pretraining and BiMamba sequence modeling
Stereo sound event localization and detection based on PSELDnet pretraining and BiMamba sequence modeling
Wenmiao Gao
Yang Xiao
Mamba
38
0
0
16 Jun 2025
Spatial Audio Processing with Large Language Model on Wearable Devices
Spatial Audio Processing with Large Language Model on Wearable Devices
Ayushi Mishra
Yang Bai
Priyadarshan Narayanasamy
Nakul Garg
Nirupam Roy
107
1
0
11 Apr 2025
Location-Oriented Sound Event Localization and Detection with Spatial Mapping and Regression Localization
Location-Oriented Sound Event Localization and Detection with Spatial Mapping and Regression Localization
Xueping Zhang
Yaxiong Chen
Ruilin Yao
Yunfei Zi
Shengwu Xiong
84
0
0
11 Apr 2025
CNN-based Robust Sound Source Localization with SRP-PHAT for the Extreme Edge
Jun Yin
Marian Verhelst
195
1
0
03 Mar 2025
PSELDNets: Pre-trained Neural Networks on a Large-scale Synthetic Dataset for Sound Event Localization and Detection
PSELDNets: Pre-trained Neural Networks on a Large-scale Synthetic Dataset for Sound Event Localization and Detection
Jinbo Hu
Yin Cao
Ming Wu
Fang Kang
Feiran Yang
Wenwu Wang
Mark D. Plumbley
J. Yang
68
1
0
10 Nov 2024
Exploring Text-Queried Sound Event Detection with Audio Source Separation
Exploring Text-Queried Sound Event Detection with Audio Source Separation
Han Yin
Jisheng Bai
Yang Xiao
Hui Wang
Siqi Zheng
Yafeng Chen
Rohan Kumar Das
Chong Deng
Jianfeng Chen
106
4
0
20 Sep 2024
Guided Masked Self-Distillation Modeling for Distributed Multimedia
  Sensor Event Analysis
Guided Masked Self-Distillation Modeling for Distributed Multimedia Sensor Event Analysis
Masahiro Yasuda
Noboru Harada
Yasunori Ohishi
Shoichiro Saito
Akira Nakayama
Nobutaka Ono
94
4
0
12 Apr 2024
BAT: Learning to Reason about Spatial Sounds with Large Language Models
BAT: Learning to Reason about Spatial Sounds with Large Language Models
Zhisheng Zheng
Puyuan Peng
Ziyang Ma
Xie Chen
Eunsol Choi
David Harwath
LRM
125
19
0
02 Feb 2024
Enhanced Sound Event Localization and Detection in Real 360-degree
  audio-visual soundscapes
Enhanced Sound Event Localization and Detection in Real 360-degree audio-visual soundscapes
Adrian S. Roman
Baladithya Balamurugan
Rithik Pothuganti
67
5
0
29 Jan 2024
Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Jinzheng Zhao
Yong-mei Xu
Xinyuan Qian
Davide Berghi
Peipei Wu
Meng Cui
Jianyuan Sun
Philip J. B. Jackson
Wenwu Wang
BDL
128
7
0
23 Oct 2023
Improving trajectory localization accuracy via direction-of-arrival
  derivative estimation
Improving trajectory localization accuracy via direction-of-arrival derivative estimation
Ruchi Pandey
Shreya Jaiswal
Huy P Phan
S. Nannuru
70
0
0
07 Dec 2022
Panoramic Video Salient Object Detection with Ambisonic Audio Guidance
Panoramic Video Salient Object Detection with Ambisonic Audio Guidance
Xiang Li
H. Cao
Shijie Zhao
Junlin Li
Li Zhang
Bhiksha Raj
92
13
0
26 Nov 2022
Position tracking of a varying number of sound sources with sliding
  permutation invariant training
Position tracking of a varying number of sound sources with sliding permutation invariant training
David Diaz-Guerra
Archontis Politis
Tuomas Virtanen
68
6
0
26 Oct 2022
CoLoC: Conditioned Localizer and Classifier for Sound Event Localization
  and Detection
CoLoC: Conditioned Localizer and Classifier for Sound Event Localization and Detection
Slawomir Kapka
J. Tkaczuk
52
0
0
25 Oct 2022
Neural Sound Field Decomposition with Super-resolution of Sound
  Direction
Neural Sound Field Decomposition with Super-resolution of Sound Direction
Qiuqiang Kong
Shilei Liu
Junjie Shi
Xuzhou Ye
Yin Cao
Qiaoxi Zhu
Yong-mei Xu
Yuxuan Wang
56
0
0
22 Oct 2022
Deep Learning Based Stage-wise Two-dimensional Speaker Localization with
  Large Ad-hoc Microphone Arrays
Deep Learning Based Stage-wise Two-dimensional Speaker Localization with Large Ad-hoc Microphone Arrays
Shupei Liu
Linfeng Feng
Yijun Gong
Chengdong Liang
Chen Zhang
Xiao-Lei Zhang
Xuelong Li
83
3
0
19 Oct 2022
Binaural Signal Representations for Joint Sound Event Detection and
  Acoustic Scene Classification
Binaural Signal Representations for Joint Sound Event Detection and Acoustic Scene Classification
D. Krause
A. Mesaros
33
3
0
13 Sep 2022
Sound Event Localization and Detection for Real Spatial Sound Scenes:
  Event-Independent Network and Data Augmentation Chains
Sound Event Localization and Detection for Real Spatial Sound Scenes: Event-Independent Network and Data Augmentation Chains
Jinbo Hu
Yin Cao
Ming Wu
Qiuqiang Kong
Feiran Yang
Mark D. Plumbley
J. Yang
86
10
0
05 Sep 2022
Data Augmentation and Squeeze-and-Excitation Network on Multiple
  Dimension for Sound Event Localization and Detection in Real Scenes
Data Augmentation and Squeeze-and-Excitation Network on Multiple Dimension for Sound Event Localization and Detection in Real Scenes
Byeongil Ko
Hyeonuk Nam
Seong-Hu Kim
D. Min
Seung-Deok Choi
Yong-Hwa Park
77
7
0
24 Jun 2022
STARSS22: A dataset of spatial recordings of real scenes with
  spatiotemporal annotations of sound events
STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events
Archontis Politis
Kazuki Shimada
Parthasaarathy Sudarsanam
Sharath Adavanne
D. Krause
Yuichiro Koyama
Naoya Takahashi
Shusuke Takahashi
Yuki Mitsufuji
Tuomas Virtanen
147
64
0
04 Jun 2022
Sonic Interactions in Virtual Environments: the Egocentric Audio
  Perspective of the Digital Twin
Sonic Interactions in Virtual Environments: the Egocentric Audio Perspective of the Digital Twin
M. Geronazzo
Stefania Serafin
27
6
0
21 Apr 2022
Dual Quaternion Ambisonics Array for Six-Degree-of-Freedom Acoustic
  Representation
Dual Quaternion Ambisonics Array for Six-Degree-of-Freedom Acoustic Representation
Eleonora Grassucci
Gioia Mancini
Christian Brignone
A. Uncini
Danilo Comminiello
72
16
0
04 Apr 2022
Direction of Arrival Estimation of Sound Sources Using Icosahedral CNNs
Direction of Arrival Estimation of Sound Sources Using Icosahedral CNNs
David Diaz-Guerra
A. Miguel
J. R. Beltrán
107
22
0
31 Mar 2022
A Track-Wise Ensemble Event Independent Network for Polyphonic Sound
  Event Localization and Detection
A Track-Wise Ensemble Event Independent Network for Polyphonic Sound Event Localization and Detection
Jinbo Hu
Yin Cao
Ming Wu
Qiuqiang Kong
Feiran Yang
Mark D. Plumbley
J. Yang
64
23
0
19 Mar 2022
Locate This, Not That: Class-Conditioned Sound Event DOA Estimation
Locate This, Not That: Class-Conditioned Sound Event DOA Estimation
Olga Slizovskaia
Gordon Wichern
Zhong-Qiu Wang
Jonathan Le Roux
59
4
0
08 Mar 2022
L3DAS22 Challenge: Learning 3D Audio Sources in a Real Office
  Environment
L3DAS22 Challenge: Learning 3D Audio Sources in a Real Office Environment
E. Guizzo
Christian Marinoni
Marco Pennese
Xinlei Ren
Xiguang Zheng
Chen Zhang
Bruno Masiero
A. Uncini
Danilo Comminiello
113
54
0
21 Feb 2022
L-SpEx: Localized Target Speaker Extraction
L-SpEx: Localized Target Speaker Extraction
Meng Ge
Chenglin Xu
Longbiao Wang
Eng Siong Chng
Jianwu Dang
Haizhou Li
52
24
0
21 Feb 2022
Multi-view and Multi-modal Event Detection Utilizing Transformer-based
  Multi-sensor fusion
Multi-view and Multi-modal Event Detection Utilizing Transformer-based Multi-sensor fusion
Masahiro Yasuda
Yasunori Ohishi
Shoichiro Saito
Noboru Harada
74
13
0
18 Feb 2022
Wearable SELD dataset: Dataset for sound event localization and
  detection using wearable devices around head
Wearable SELD dataset: Dataset for sound event localization and detection using wearable devices around head
Kento Nagatomo
Masahiro Yasuda
Kohei Yatabe
Shoichiro Saito
Yasuhiro Oikawa
133
9
0
17 Feb 2022
SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound
  Source Localization
SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization
Bing Yang
Hong Liu
Xiaofei Li
98
20
0
16 Feb 2022
Visual Sound Localization in the Wild by Cross-Modal Interference
  Erasing
Visual Sound Localization in the Wild by Cross-Modal Interference Erasing
Xian Liu
Rui Qian
Hang Zhou
Di Hu
Weiyao Lin
Ziwei Liu
Bolei Zhou
Xiaowei Zhou
62
26
0
13 Feb 2022
Self-Supervised Moving Vehicle Detection from Audio-Visual Cues
Self-Supervised Moving Vehicle Detection from Audio-Visual Cues
Jannik Zürn
Wolfram Burgard
SSL
87
8
0
30 Jan 2022
Signal-Aware Direction-of-Arrival Estimation Using Attention Mechanisms
Signal-Aware Direction-of-Arrival Estimation Using Attention Mechanisms
Wolfgang Mack
Julian Wechsler
Emanuel Habets
124
11
0
03 Jan 2022
End-to-end Alexa Device Arbitration
End-to-end Alexa Device Arbitration
Jarred Barber
Yifeng Fan
Tao Zhang
46
3
0
08 Dec 2021
SALSA-Lite: A Fast and Effective Feature for Polyphonic Sound Event
  Localization and Detection with Microphone Arrays
SALSA-Lite: A Fast and Effective Feature for Polyphonic Sound Event Localization and Detection with Microphone Arrays
Thi Ngoc Tho Nguyen
Douglas L. Jones
Karn N. Watcharasupat
Huy P Phan
W. Gan
70
37
0
16 Nov 2021
Differentiable Tracking-Based Training of Deep Learning Sound Source
  Localizers
Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers
Sharath Adavanne
Archontis Politis
Tuomas Virtanen
72
17
0
29 Oct 2021
Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same
  Class with Auxiliary Duplicating Permutation Invariant Training
Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training
Kazuki Shimada
Yuichiro Koyama
Shusuke Takahashi
Naoya Takahashi
E. Tsunoo
Yuki Mitsufuji
70
66
0
14 Oct 2021
Spatial mixup: Directional loudness modification as data augmentation
  for sound event localization and detection
Spatial mixup: Directional loudness modification as data augmentation for sound event localization and detection
Ricardo Falcón Pérez
Kazuki Shimada
Yuichiro Koyama
Shusuke Takahashi
Yuki Mitsufuji
128
5
0
12 Oct 2021
Direct source and early reflections localization using deep
  deconvolution network under reverberant environment
Direct source and early reflections localization using deep deconvolution network under reverberant environment
Shan Gao
Xihong Wu
T. Qu
18
0
0
10 Oct 2021
PHNNs: Lightweight Neural Networks via Parameterized Hypercomplex
  Convolutions
PHNNs: Lightweight Neural Networks via Parameterized Hypercomplex Convolutions
Eleonora Grassucci
Aston Zhang
Danilo Comminiello
83
37
0
08 Oct 2021
SALSA: Spatial Cue-Augmented Log-Spectrogram Features for Polyphonic
  Sound Event Localization and Detection
SALSA: Spatial Cue-Augmented Log-Spectrogram Features for Polyphonic Sound Event Localization and Detection
Thi Ngoc Tho Nguyen
Karn N. Watcharasupat
Ngoc Khanh Nguyen
Douglas L. Jones
W. Gan
67
49
0
01 Oct 2021
A Few-Shot Learning Approach for Sound Source Distance Estimation Using
  Relation Networks
A Few-Shot Learning Approach for Sound Source Distance Estimation Using Relation Networks
Amirreza Sobhdel
R. Razavi-Far
75
4
0
22 Sep 2021
A Survey of Sound Source Localization with Deep Learning Methods
A Survey of Sound Source Localization with Deep Learning Methods
Pierre-Amaury Grumiaux
Srdjan Kitić
Laurent Girin
Alexandre Guérin
80
257
0
08 Sep 2021
Joint Direction and Proximity Classification of Overlapping Sound Events
  from Binaural Audio
Joint Direction and Proximity Classification of Overlapping Sound Events from Binaural Audio
D. Krause
Archontis Politis
A. Mesaros
44
8
0
26 Jul 2021
Assessment of Self-Attention on Learned Features For Sound Event
  Localization and Detection
Assessment of Self-Attention on Learned Features For Sound Event Localization and Detection
Parthasaarathy Sudarsanam
Archontis Politis
Konstantinos Drossos
54
13
0
20 Jul 2021
DCASE 2021 Task 3: Spectrotemporally-aligned Features for Polyphonic
  Sound Event Localization and Detection
DCASE 2021 Task 3: Spectrotemporally-aligned Features for Polyphonic Sound Event Localization and Detection
Thi Ngoc Tho Nguyen
Karn N. Watcharasupat
Ngoc Khanh Nguyen
Douglas L. Jones
W. Gan
70
16
0
29 Jun 2021
A Dataset of Dynamic Reverberant Sound Scenes with Directional
  Interferers for Sound Event Localization and Detection
A Dataset of Dynamic Reverberant Sound Scenes with Directional Interferers for Sound Event Localization and Detection
Archontis Politis
Sharath Adavanne
D. Krause
Antoine Deleforge
Prerak Srivastava
Tuomas Virtanen
75
68
0
13 Jun 2021
PILOT: Introducing Transformers for Probabilistic Sound Event
  Localization
PILOT: Introducing Transformers for Probabilistic Sound Event Localization
C. Schymura
Benedikt T. Bönninghoff
Tsubasa Ochiai
Marc Delcroix
K. Kinoshita
Tomohiro Nakatani
S. Araki
D. Kolossa
63
24
0
07 Jun 2021
Refinement of Direction of Arrival Estimators by
  Majorization-Minimization Optimization on the Array Manifold
Refinement of Direction of Arrival Estimators by Majorization-Minimization Optimization on the Array Manifold
Robin Scheibler
M. Togami
26
3
0
02 Jun 2021
Weakly Supervised Source-Specific Sound Level Estimation in Noisy
  Soundscapes
Weakly Supervised Source-Specific Sound Level Estimation in Noisy Soundscapes
Aurora Cramer
M. Cartwright
Fatemeh Pishdadian
J. P. Bello
58
2
0
06 May 2021
12
Next