Chroma-Actions Dataset: Acoustic Images

  • Naveed Akhtar (Creator)
  • Douglas Chai (Creator)
  • Syed Mohammed Shamsul Islam (Contributor)
  • Muhammad Bilal Shaikh (Creator)

Dataset

Description

Chromagram-based representation of audio extracted from videos. These representations were extracted from the UCF-101 Human Action Recognition dataset. Only videos with audio channels were considered.

Steps to reproduce:
How the data were acquired. Audios of human actions were extracted from UCF101, which was originally collected from YouTube. A script was devised to extract audios of actions from fifty-one different action categories: Archery, Cricket Shot, Hair Cutting, Playing Flute, Rafting, Sky Diving and so on. Data were arranged in two folders train and test to help researchers in evaluating their models.
Date made available10 Jan 2023
PublisherMendeley Data

Cite this