Description
Chromagram-based representation of audio extracted from videos. These representations were extracted from the UCF-101 Human Action Recognition dataset. Only videos with audio channels were considered.
Steps to reproduce:
How the data were acquired. Audios of human actions were extracted from UCF101, which was originally collected from YouTube. A script was devised to extract audios of actions from fifty-one different action categories: Archery, Cricket Shot, Hair Cutting, Playing Flute, Rafting, Sky Diving and so on. Data were arranged in two folders train and test to help researchers in evaluating their models.
Steps to reproduce:
How the data were acquired. Audios of human actions were extracted from UCF101, which was originally collected from YouTube. A script was devised to extract audios of actions from fifty-one different action categories: Archery, Cricket Shot, Hair Cutting, Playing Flute, Rafting, Sky Diving and so on. Data were arranged in two folders train and test to help researchers in evaluating their models.
Date made available | 10 Jan 2023 |
---|---|
Publisher | Mendeley Data |