DCASE 2024 Task 4: Sound Event Detection with Heterogeneous Data and Missing Labels

12 June 2024

Romain Serizel

ArXiv (abs)PDF HTML Github (40★)

Papers citing "DCASE 2024 Task 4: Sound Event Detection with Heterogeneous Data and Missing Labels"

15 / 15 papers shown

CASTELLA: Long Audio Dataset with Captions and Temporal Boundaries

145

19 Nov 2025

Metric Analysis for Spatial Semantic Segmentation of Sound Scenes

Mayank Mishra

P. Magron

Romain Serizel

159

10 Nov 2025

Not in Sync: Unveiling Temporal Bias in Audio Chat Models

137

14 Oct 2025

Detect Any Sound: Open-Vocabulary Sound Event Detection with Multi-Modal Queries

336

22 Jul 2025

FLAM: Frame-Wise Language-Audio Modeling

511

08 May 2025

Policy Optimization Algorithms in a Unified Framework

Shuang Wu

299

04 Apr 2025

Leveraging LLM and Text-Queried Separation for Noise-Robust Sound Event Detection

497

02 Nov 2024

A decade of DCASE: Achievements, practices, evaluations and future challengesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

Romain Serizel

228

07 Oct 2024

Prototype based Masked Audio Model for Self-Supervised Learning of Sound Event DetectionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

283

26 Sep 2024

Effective Pre-Training of Audio Transformers for Sound Event DetectionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

358

14 Sep 2024

Energy Consumption Trends in Sound Event Detection SystemsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

Constance Douwes

Romain Serizel

348

13 Sep 2024

MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for Heterogeneous Sound Event DetectionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

Zhicheng Zhang

238

10 Sep 2024

Improving Audio Spectrogram Transformers for Sound Event Detection Through Multi-Stage Training

315

17 Jul 2024

FMSG-JLESS Submission for DCASE 2024 Task4 on Sound Event Detection with Heterogeneous Training Dataset and Potentially Missing Labels

313

29 Jun 2024

Self Training and Ensembling Frequency Dependent Networks with Coarse Prediction Pooling and Sound Event Bounding Boxes

213

22 Jun 2024