Abstract
We present PyMAiVAR, a versatile toolbox that encompasses the generation of image representations for audio data including Wave plots, Spectral Centroids, Spectral Roll Offs, Mel Frequency Cepstral Coefficients (MFCC), MFCC Feature Scaling, and Chromagrams. This wide-ranging toolkit generates rich audio-image representations, playing a pivotal role in reshaping human action recognition. By fully exploiting audio data's latent potential, PyMAiVAR stands as a significant advancement in the field. The package is implemented in Python and can be used across different operating systems.
Original language | English |
---|---|
Article number | 100544 |
Journal | Software Impacts |
Volume | 17 |
DOIs | |
Publication status | Published - Sept 2023 |
Fingerprint
Dive into the research topics of 'PyMAiVAR: An open-source Python suit for audio-image representation in human action recognition[Formula presented]'. Together they form a unique fingerprint.Datasets
-
MFFCs for Multi-class Human Action Analysis : A Benchmark Dataset
Shaikh, M. B. (Creator), Chai, D. (Contributor), Islam, S. M. S. (Contributor) & Akhtar, N. (Contributor), Mendeley Data, 26 Jul 2023
DOI: 10.17632/6ng2kgvnwk.1, https://data.mendeley.com/datasets/6ng2kgvnwk
Dataset
-
Waveplot-based Dataset for Multi-class Human Action Analysis
Shaikh, M. B. (Creator), Chai, D. (Contributor), Islam, S. M. S. (Contributor) & Akhtar, N. (Contributor), Mendeley Data, 26 Jul 2023
DOI: 10.17632/3vsz7v53pn.1, https://data.mendeley.com/datasets/3vsz7v53pn
Dataset
-
Spectral Rolloff Images for Multi-class Human Action Analysis : A Benchmark Dataset
Shaikh, M. B. (Creator), Chai, D. (Contributor), Islam, S. M. S. (Contributor) & Akhtar, N. (Contributor), Mendeley Data, 26 Jul 2023
DOI: 10.17632/nd5kftbhyj.1, https://data.mendeley.com/datasets/nd5kftbhyj
Dataset