Local Binary Pattern with Random Forest for Acoustic Scene Classification

Shamsiah Abidin, Xianjun Xia, Robert Togneri, Ferdous Sohel

Research output: Chapter in Book/Conference paperConference paperpeer-review

8 Citations (Scopus)

Abstract

This paper presents an approach for acoustic scene classification using the local binary pattern (LBP) and random forest (RF). The audio signal is converted to a Constant-Q transform (CQT) representation and LBP is used to extract the features from this time-frequency representation. The CQT representations are divided into a number of sub-bands to obtain more localized features relevant to the spectral information. We then use random forest to select the most important features for each band of extracted LBP features. For further performance enhancement, we use feature level fusion of LBP and HOG features. The proposed system has achieved an accuracy of 85% on the DCASE 2016 dataset.

Original languageEnglish
Title of host publication2018 IEEE International Conference on Multimedia and Expo, ICME 2018
PublisherIEEE, Institute of Electrical and Electronics Engineers
Volume2018-July
ISBN (Electronic)9781538617373
DOIs
Publication statusPublished - 8 Oct 2018
Event2018 IEEE International Conference on Multimedia and Expo, ICME 2018 - San Diego, United States
Duration: 23 Jul 201827 Jul 2018

Conference

Conference2018 IEEE International Conference on Multimedia and Expo, ICME 2018
Country/TerritoryUnited States
CitySan Diego
Period23/07/1827/07/18

Fingerprint

Dive into the research topics of 'Local Binary Pattern with Random Forest for Acoustic Scene Classification'. Together they form a unique fingerprint.

Cite this