ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1905.00268
  4. Cited By
Polyphonic Sound Event Detection and Localization using a Two-Stage
  Strategy
v1v2v3v4 (latest)

Polyphonic Sound Event Detection and Localization using a Two-Stage Strategy

1 May 2019
Yin Cao
Qiuqiang Kong
Turab Iqbal
Fengyan An
Wenwu Wang
Mark D. Plumbley
ArXiv (abs)PDFHTML

Papers citing "Polyphonic Sound Event Detection and Localization using a Two-Stage Strategy"

46 / 46 papers shown
Title
PSELDNets: Pre-trained Neural Networks on a Large-scale Synthetic Dataset for Sound Event Localization and Detection
PSELDNets: Pre-trained Neural Networks on a Large-scale Synthetic Dataset for Sound Event Localization and Detection
Jinbo Hu
Yin Cao
Ming Wu
Fang Kang
Feiran Yang
Wenwu Wang
Mark D. Plumbley
J. Yang
68
1
0
10 Nov 2024
Audio-Visual Talker Localization in Video for Spatial Sound Reproduction
Audio-Visual Talker Localization in Video for Spatial Sound Reproduction
Davide Berghi
Philip J. B. Jackson
78
0
0
01 Jun 2024
The Neural-SRP method for positional sound source localization
The Neural-SRP method for positional sound source localization
Eric Grinstein
Toon van Waterschoot
Mike Brookes
Patrick A. Naylor
54
2
0
14 Mar 2024
Leveraging Visual Supervision for Array-based Active Speaker Detection
  and Localization
Leveraging Visual Supervision for Array-based Active Speaker Detection and Localization
Davide Berghi
Philip J. B. Jackson
64
5
0
21 Dec 2023
Fusion of Audio and Visual Embeddings for Sound Event Localization and
  Detection
Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection
Davide Berghi
Peipei Wu
Jinzheng Zhao
Wenwu Wang
Philip J. B. Jackson
85
11
0
14 Dec 2023
Feature Aggregation in Joint Sound Classification and Localization
  Neural Networks
Feature Aggregation in Joint Sound Classification and Localization Neural Networks
Brendan Healy
Patrick McNamee
Z. Nili Ahmadabadi
67
0
0
29 Oct 2023
Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Jinzheng Zhao
Yong-mei Xu
Xinyuan Qian
Davide Berghi
Peipei Wu
Meng Cui
Jianyuan Sun
Philip J. B. Jackson
Wenwu Wang
BDL
128
7
0
23 Oct 2023
LocSelect: Target Speaker Localization with an Auditory Selective
  Hearing Mechanism
LocSelect: Target Speaker Localization with an Auditory Selective Hearing Mechanism
Yu Chen
Xinyuan Qian
Zexu Pan
Kainan Chen
Haizhou Li
53
3
0
16 Oct 2023
Audio Visual Speaker Localization from EgoCentric Views
Audio Visual Speaker Localization from EgoCentric Views
Jinzheng Zhao
Yong-mei Xu
Xinyuan Qian
Wenwu Wang
EgoV
66
5
0
28 Sep 2023
Exploring Self-Supervised Contrastive Learning of Spatial Sound Event
  Representation
Exploring Self-Supervised Contrastive Learning of Spatial Sound Event Representation
Xilin Jiang
Cong Han
Yinghao Aaron Li
N. Mesgarani
SSL
83
1
0
27 Sep 2023
Dual input neural networks for positional sound source localization
Dual input neural networks for positional sound source localization
Eric Grinstein
Vincent W. Neo
Patrick A. Naylor
42
6
0
08 Aug 2023
In search of strong embedding extractors for speaker diarisation
In search of strong embedding extractors for speaker diarisation
Jee-weon Jung
Hee-Soo Heo
Bong-Jin Lee
Jaesung Huh
A. Brown
Youngki Kwon
Shinji Watanabe
Joon Son Chung
83
16
0
26 Oct 2022
Binaural Signal Representations for Joint Sound Event Detection and
  Acoustic Scene Classification
Binaural Signal Representations for Joint Sound Event Detection and Acoustic Scene Classification
D. Krause
A. Mesaros
33
3
0
13 Sep 2022
Polyphonic sound event detection for highly dense birdsong scenes
Polyphonic sound event detection for highly dense birdsong scenes
Alberto García Arroba Parrilla
D. Stowell
30
3
0
13 Jul 2022
Data Augmentation and Squeeze-and-Excitation Network on Multiple
  Dimension for Sound Event Localization and Detection in Real Scenes
Data Augmentation and Squeeze-and-Excitation Network on Multiple Dimension for Sound Event Localization and Detection in Real Scenes
Byeongil Ko
Hyeonuk Nam
Seong-Hu Kim
D. Min
Seung-Deok Choi
Yong-Hwa Park
77
7
0
24 Jun 2022
STARSS22: A dataset of spatial recordings of real scenes with
  spatiotemporal annotations of sound events
STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events
Archontis Politis
Kazuki Shimada
Parthasaarathy Sudarsanam
Sharath Adavanne
D. Krause
Yuichiro Koyama
Naoya Takahashi
Shusuke Takahashi
Yuki Mitsufuji
Tuomas Virtanen
152
64
0
04 Jun 2022
Self-supervised Learning of Audio Representations from Audio-Visual Data
  using Spatial Alignment
Self-supervised Learning of Audio Representations from Audio-Visual Data using Spatial Alignment
Shanshan Wang
Archontis Politis
A. Mesaros
Tuomas Virtanen
SSL
42
7
0
02 Jun 2022
A Track-Wise Ensemble Event Independent Network for Polyphonic Sound
  Event Localization and Detection
A Track-Wise Ensemble Event Independent Network for Polyphonic Sound Event Localization and Detection
Jinbo Hu
Yin Cao
Ming Wu
Qiuqiang Kong
Feiran Yang
Mark D. Plumbley
J. Yang
64
23
0
19 Mar 2022
L3DAS22 Challenge: Learning 3D Audio Sources in a Real Office
  Environment
L3DAS22 Challenge: Learning 3D Audio Sources in a Real Office Environment
E. Guizzo
Christian Marinoni
Marco Pennese
Xinlei Ren
Xiguang Zheng
Chen Zhang
Bruno Masiero
A. Uncini
Danilo Comminiello
113
54
0
21 Feb 2022
SALSA-Lite: A Fast and Effective Feature for Polyphonic Sound Event
  Localization and Detection with Microphone Arrays
SALSA-Lite: A Fast and Effective Feature for Polyphonic Sound Event Localization and Detection with Microphone Arrays
Thi Ngoc Tho Nguyen
Douglas L. Jones
Karn N. Watcharasupat
Huy P Phan
W. Gan
70
37
0
16 Nov 2021
Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same
  Class with Auxiliary Duplicating Permutation Invariant Training
Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training
Kazuki Shimada
Yuichiro Koyama
Shusuke Takahashi
Naoya Takahashi
E. Tsunoo
Yuki Mitsufuji
72
66
0
14 Oct 2021
Spatial Data Augmentation with Simulated Room Impulse Responses for
  Sound Event Localization and Detection
Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection
Yuichiro Koyama
Kazuhide Shigemi
Masafumi Takahashi
Kazuki Shimada
Naoya Takahashi
E. Tsunoo
Shusuke Takahashi
Yuki Mitsufuji
67
12
0
13 Oct 2021
Spatial mixup: Directional loudness modification as data augmentation
  for sound event localization and detection
Spatial mixup: Directional loudness modification as data augmentation for sound event localization and detection
Ricardo Falcón Pérez
Kazuki Shimada
Yuichiro Koyama
Shusuke Takahashi
Yuki Mitsufuji
128
5
0
12 Oct 2021
SALSA: Spatial Cue-Augmented Log-Spectrogram Features for Polyphonic
  Sound Event Localization and Detection
SALSA: Spatial Cue-Augmented Log-Spectrogram Features for Polyphonic Sound Event Localization and Detection
Thi Ngoc Tho Nguyen
Karn N. Watcharasupat
Ngoc Khanh Nguyen
Douglas L. Jones
W. Gan
71
49
0
01 Oct 2021
A Survey of Sound Source Localization with Deep Learning Methods
A Survey of Sound Source Localization with Deep Learning Methods
Pierre-Amaury Grumiaux
Srdjan Kitić
Laurent Girin
Alexandre Guérin
80
257
0
08 Sep 2021
SLoClas: A Database for Joint Sound Localization and Classification
SLoClas: A Database for Joint Sound Localization and Classification
Xinyuan Qian
Bidisha Sharma
Amine El Abridi
Haizhou Li
31
3
0
05 Aug 2021
What Makes Sound Event Localization and Detection Difficult? Insights
  from Error Analysis
What Makes Sound Event Localization and Detection Difficult? Insights from Error Analysis
Thi Ngoc Tho Nguyen
Karn N. Watcharasupat
Zhen Jian Lee
Ngoc Khanh Nguyen
Douglas L. Jones
W. Gan
52
6
0
22 Jul 2021
Assessment of Self-Attention on Learned Features For Sound Event
  Localization and Detection
Assessment of Self-Attention on Learned Features For Sound Event Localization and Detection
Parthasaarathy Sudarsanam
Archontis Politis
Konstantinos Drossos
54
13
0
20 Jul 2021
DCASE 2021 Task 3: Spectrotemporally-aligned Features for Polyphonic
  Sound Event Localization and Detection
DCASE 2021 Task 3: Spectrotemporally-aligned Features for Polyphonic Sound Event Localization and Detection
Thi Ngoc Tho Nguyen
Karn N. Watcharasupat
Ngoc Khanh Nguyen
Douglas L. Jones
W. Gan
70
16
0
29 Jun 2021
A Dataset of Dynamic Reverberant Sound Scenes with Directional
  Interferers for Sound Event Localization and Detection
A Dataset of Dynamic Reverberant Sound Scenes with Directional Interferers for Sound Event Localization and Detection
Archontis Politis
Sharath Adavanne
D. Krause
Antoine Deleforge
Prerak Srivastava
Tuomas Virtanen
75
68
0
13 Jun 2021
SoundDet: Polyphonic Moving Sound Event Detection and Localization from
  Raw Waveform
SoundDet: Polyphonic Moving Sound Event Detection and Localization from Raw Waveform
Yuhang He
A. Trigoni
Andrew Markham
67
19
0
13 Jun 2021
L3DAS21 Challenge: Machine Learning for 3D Audio Signal Processing
L3DAS21 Challenge: Machine Learning for 3D Audio Signal Processing
E. Guizzo
R. F. Gramaccioni
Saeid Jamili
Christian Marinoni
Edoardo Massaro
...
Marco Pennese
Sveva Pepe
Enrico Rocchi
A. Uncini
Danilo Comminiello
154
27
0
12 Apr 2021
Three-class Overlapped Speech Detection using a Convolutional Recurrent
  Neural Network
Three-class Overlapped Speech Detection using a Convolutional Recurrent Neural Network
Jee-weon Jung
Hee-Soo Heo
Youngki Kwon
Joon Son Chung
Bong-Jin Lee
122
19
0
07 Apr 2021
A Four-Stage Data Augmentation Approach to ResNet-Conformer Based
  Acoustic Modeling for Sound Event Localization and Detection
A Four-Stage Data Augmentation Approach to ResNet-Conformer Based Acoustic Modeling for Sound Event Localization and Detection
Qing Wang
Jun Du
Hua-Xin Wu
Jia Pan
Feng Ma
Chin-Hui Lee
64
83
0
08 Jan 2021
ACCDOA: Activity-Coupled Cartesian Direction of Arrival Representation
  for Sound Event Localization and Detection
ACCDOA: Activity-Coupled Cartesian Direction of Arrival Representation for Sound Event Localization and Detection
Kazuki Shimada
Yuichiro Koyama
Naoya Takahashi
Shusuke Takahashi
Yuki Mitsufuji
112
89
0
29 Oct 2020
An Improved Event-Independent Network for Polyphonic Sound Event
  Localization and Detection
An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection
Yin Cao
Turab Iqbal
Qiuqiang Kong
Y. Zhong
Wenwu Wang
Mark D. Plumbley
71
78
0
25 Oct 2020
Sound event localization and detection based on crnn using rectangular
  filters and channel rotation data augmentation
Sound event localization and detection based on crnn using rectangular filters and channel rotation data augmentation
Francesca Ronchini
Daniel Arteaga
Andrés Pérez-López
50
9
0
13 Oct 2020
Event-Independent Network for Polyphonic Sound Event Localization and
  Detection
Event-Independent Network for Polyphonic Sound Event Localization and Detection
Yin Cao
Turab Iqbal
Qiuqiang Kong
Yue Zhong
Wenwu Wang
Mark D. Plumbley
60
32
0
30 Sep 2020
On Multitask Loss Function for Audio Event Detection and Localization
On Multitask Loss Function for Audio Event Detection and Localization
Huy P Phan
L. D. Pham
P. Koch
Ngoc Q. K. Duong
Ian Mcloughlin
Alfred Mertins
87
14
0
11 Sep 2020
Overview and Evaluation of Sound Event Localization and Detection in
  DCASE 2019
Overview and Evaluation of Sound Event Localization and Detection in DCASE 2019
Archontis Politis
A. Mesaros
Sharath Adavanne
Toni Heittola
Tuomas Virtanen
133
128
0
06 Sep 2020
Sound Event Localization and Detection Using Activity-Coupled Cartesian
  DOA Vector and RD3net
Sound Event Localization and Detection Using Activity-Coupled Cartesian DOA Vector and RD3net
Kazuki Shimada
Naoya Takahashi
Shusuke Takahashi
Yuki Mitsufuji
121
19
0
22 Jun 2020
A Dataset of Reverberant Spatial Sound Scenes with Moving Sources for
  Sound Event Localization and Detection
A Dataset of Reverberant Spatial Sound Scenes with Moving Sources for Sound Event Localization and Detection
Archontis Politis
Sharath Adavanne
Tuomas Virtanen
94
109
0
02 Jun 2020
A two-step system for sound event localization and detection
A two-step system for sound event localization and detection
T. N. T. Nguyen
D. L. Jones
Rishabh Ranjan
S. Jayabalan
W. Gan
43
4
0
26 Nov 2019
Sound Event Localization and Detection Using CRNN on Pairs of
  Microphones
Sound Event Localization and Detection Using CRNN on Pairs of Microphones
François Grondin
James R. Glass
I. Sobieraj
Mark D. Plumbley
77
33
0
22 Oct 2019
First Order Ambisonics Domain Spatial Augmentation for DNN-based
  Direction of Arrival Estimation
First Order Ambisonics Domain Spatial Augmentation for DNN-based Direction of Arrival Estimation
Luca Mazzon
Yuma Koizumi
Masahiro Yasuda
Noboru Harada
67
44
0
10 Oct 2019
Sound source detection, localization and classification using
  consecutive ensemble of CRNN models
Sound source detection, localization and classification using consecutive ensemble of CRNN models
Slawomir Kapka
M. Lewandowski
122
66
0
02 Aug 2019
1