ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.08056
  4. Cited By
DCASE 2024 Task 4: Sound Event Detection with Heterogeneous Data and
  Missing Labels

DCASE 2024 Task 4: Sound Event Detection with Heterogeneous Data and Missing Labels

12 June 2024
Samuele Cornell
Janek Ebbers
Constance Douwes
Irene Martín-Morató
Manu Harju
A. Mesaros
Romain Serizel
ArXiv (abs)PDFHTMLGithub (40★)

Papers citing "DCASE 2024 Task 4: Sound Event Detection with Heterogeneous Data and Missing Labels"

15 / 15 papers shown
CASTELLA: Long Audio Dataset with Captions and Temporal Boundaries
CASTELLA: Long Audio Dataset with Captions and Temporal Boundaries
Hokuto Munakata
Takehiro Imamura
Taichi Nishimura
Tatsuya Komatsu
137
0
0
19 Nov 2025
Metric Analysis for Spatial Semantic Segmentation of Sound Scenes
Metric Analysis for Spatial Semantic Segmentation of Sound Scenes
Mayank Mishra
P. Magron
Romain Serizel
153
0
0
10 Nov 2025
Not in Sync: Unveiling Temporal Bias in Audio Chat Models
Not in Sync: Unveiling Temporal Bias in Audio Chat Models
Jiayu Yao
Shenghua Liu
Yiwei Wang
Rundong Cheng
Lingrui Mei
Baolong Bi
Zhen Xiong
Xueqi Cheng
137
1
0
14 Oct 2025
Detect Any Sound: Open-Vocabulary Sound Event Detection with Multi-Modal Queries
Detect Any Sound: Open-Vocabulary Sound Event Detection with Multi-Modal Queries
Pengfei Cai
Yan Song
Qing Gu
Nan Jiang
Haoyu Song
Ian Mcloughlin
VLM
321
3
0
22 Jul 2025
FLAM: Frame-Wise Language-Audio Modeling
FLAM: Frame-Wise Language-Audio Modeling
Yusong Wu
Christos Tsirigotis
Ke Chen
Cheng-Zhi Anna Huang
Rameswar Panda
Oriol Nieto
Prem Seetharaman
Justin Salamon
505
12
0
08 May 2025
Policy Optimization Algorithms in a Unified Framework
Policy Optimization Algorithms in a Unified Framework
Shuang Wu
294
1
0
04 Apr 2025
Leveraging LLM and Text-Queried Separation for Noise-Robust Sound Event Detection
Leveraging LLM and Text-Queried Separation for Noise-Robust Sound Event Detection
Han Yin
Yang Xiao
Jisheng Bai
Rohan Kumar Das
495
2
0
02 Nov 2024
A decade of DCASE: Achievements, practices, evaluations and future
  challenges
A decade of DCASE: Achievements, practices, evaluations and future challengesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
A. Mesaros
Romain Serizel
Toni Heittola
Maria Sandsten
Mark D. Plumbley
225
11
0
07 Oct 2024
Prototype based Masked Audio Model for Self-Supervised Learning of Sound
  Event Detection
Prototype based Masked Audio Model for Self-Supervised Learning of Sound Event DetectionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Pengfei Cai
Yan Song
Nan Jiang
Qing Gu
Ian Mcloughlin
275
5
0
26 Sep 2024
Effective Pre-Training of Audio Transformers for Sound Event Detection
Effective Pre-Training of Audio Transformers for Sound Event DetectionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Florian Schmid
T. Morocutti
Francesco Foscarin
Jan Schluter
Paul Primus
Gerhard Widmer
ViT
350
12
0
14 Sep 2024
Energy Consumption Trends in Sound Event Detection Systems
Energy Consumption Trends in Sound Event Detection SystemsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Constance Douwes
Romain Serizel
346
2
0
13 Sep 2024
MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for
  Heterogeneous Sound Event Detection
MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for Heterogeneous Sound Event DetectionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Zehao Wang
Haobo Yue
Zhicheng Zhang
Da Mu
Jin Tang
Jianqin Yin
237
0
0
10 Sep 2024
Improving Audio Spectrogram Transformers for Sound Event Detection
  Through Multi-Stage Training
Improving Audio Spectrogram Transformers for Sound Event Detection Through Multi-Stage Training
Florian Schmid
Paul Primus
T. Morocutti
Jonathan Greif
Gerhard Widmer
308
13
0
17 Jul 2024
FMSG-JLESS Submission for DCASE 2024 Task4 on Sound Event Detection with
  Heterogeneous Training Dataset and Potentially Missing Labels
FMSG-JLESS Submission for DCASE 2024 Task4 on Sound Event Detection with Heterogeneous Training Dataset and Potentially Missing Labels
Yang Xiao
Han Yin
Jisheng Bai
Rohan Kumar Das
308
7
0
29 Jun 2024
Self Training and Ensembling Frequency Dependent Networks with Coarse
  Prediction Pooling and Sound Event Bounding Boxes
Self Training and Ensembling Frequency Dependent Networks with Coarse Prediction Pooling and Sound Event Bounding Boxes
Hyeonuk Nam
D. Min
Seungdeok Choi
Inhan Choi
Yong-Hwa Park
204
9
0
22 Jun 2024
1
Page 1 of 1