A UNIFIED LOSS FUNCTION TO TACKLE INTER-CLASS AND INTRA-CLASS DATA IMBALANCE IN SOUND EVENT DETECTION

Research output: Chapter in Book/Conference paperConference paperpeer-review

1 Citation (Scopus)

Abstract

Data imbalance is an important issue in data-driven deep-learning methodologies. In sound event detection (SED), there are two types of data imbalance issues caused by the diverse time duration of sound events: the data imbalance between sound event classes (inter-class imbalance) and the active/inactive imbalance within the class (intra-class imbalance). In this paper, we propose a unified loss function (ULF), which adeptly addresses both the inter-class imbalance and intra-class imbalance simultaneously. Evaluation experiments substantiate that the ULF consistently yields superior and more stable performance compared to existing loss functions that singularly address either type of imbalance. Furthermore, the ULF loss also enhances the model's capacity to detect hard-to-detect sound events.

Original languageEnglish
Title of host publication ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
PublisherIEEE, Institute of Electrical and Electronics Engineers
Pages996-1000
Number of pages5
ISBN (Electronic)9798350344851
ISBN (Print)979-8-3503-4486-8
DOIs
Publication statusPublished - 2024
Event49th IEEE International Conference on Acoustics, Speech, and Signal Processing, - Seoul, Korea, Republic of
Duration: 14 Apr 202419 Apr 2024

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN (Print)1520-6149

Conference

Conference49th IEEE International Conference on Acoustics, Speech, and Signal Processing,
Abbreviated titleICASSP 2024
Country/TerritoryKorea, Republic of
CitySeoul
Period14/04/2419/04/24

Fingerprint

Dive into the research topics of 'A UNIFIED LOSS FUNCTION TO TACKLE INTER-CLASS AND INTRA-CLASS DATA IMBALANCE IN SOUND EVENT DETECTION'. Together they form a unique fingerprint.

Cite this