Abstract
This paper introduces the use of local binary patterns (LBP) extracted from a time-frequency representation (TFR) for acoustic scene classification. As LBP provides a description of the global TFR texture we propose a novel zoning mechanism that provides a simple solution to extract spectrally relevant local features which better characterize the audio TFRs. To further improve the classification performance, we perform feature and score level fusion of the proposed LBP (with zoning) with histogram of gradients (HOG) of the TFR images. Our technique demonstrates an improved performance by achieving a classification accuracy of 95.2% using a fusion of time-frequency derived features.
Original language | English |
---|---|
Title of host publication | 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing Proceedings |
Editors | Magdy A. Bayoumi |
Place of Publication | New Orleans |
Publisher | IEEE, Institute of Electrical and Electronics Engineers |
Pages | 626-630 |
Number of pages | 5 |
ISBN (Electronic) | 9781509041176 |
DOIs | |
Publication status | Published - 19 Jun 2017 |
Event | 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017 - New Orleans, United States Duration: 5 Mar 2017 → 9 Mar 2017 |
Conference
Conference | 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017 |
---|---|
Abbreviated title | ICASSP 2017 |
Country/Territory | United States |
City | New Orleans |
Period | 5/03/17 → 9/03/17 |