TY - JOUR
T1 - ECM-LSE
T2 - Prediction of Extracellular Matrix Proteins Using Deep Latent Space Encoding of k-Spaced Amino Acid Pairs
AU - Al-Saggaf, Ubaid M.
AU - Usman, Muhammad
AU - Naseem, Imran
AU - Moinuddin, Muhammad
AU - Jiman, Ahmad A.
AU - Alsaggaf, Mohammed U.
AU - Alshoubaki, Hitham K.
AU - Khan, Shujaat
PY - 2021/10/14
Y1 - 2021/10/14
N2 - Extracelluar matrix (ECM) proteins create complex networks of macromolecules which fill-in the extracellular spaces of living tissues. They provide structural support and play an important role in maintaining cellular functions. Identification of ECM proteins can play a vital role in studying various types of diseases. Conventional wet lab–based methods are reliable; however, they are expensive and time consuming and are, therefore, not scalable. In this research, we propose a sequence-based novel machine learning approach for the prediction of ECM proteins. In the proposed method, composition of k-spaced amino acid pair (CKSAAP) features are encoded into a classifiable latent space (LS) with the help of deep latent space encoding (LSE). A comprehensive ablation analysis is conducted for performance evaluation of the proposed method. Results are compared with other state-of-the-art methods on the benchmark dataset, and the proposed ECM-LSE approach has shown to comprehensively outperform the contemporary methods.
AB - Extracelluar matrix (ECM) proteins create complex networks of macromolecules which fill-in the extracellular spaces of living tissues. They provide structural support and play an important role in maintaining cellular functions. Identification of ECM proteins can play a vital role in studying various types of diseases. Conventional wet lab–based methods are reliable; however, they are expensive and time consuming and are, therefore, not scalable. In this research, we propose a sequence-based novel machine learning approach for the prediction of ECM proteins. In the proposed method, composition of k-spaced amino acid pair (CKSAAP) features are encoded into a classifiable latent space (LS) with the help of deep latent space encoding (LSE). A comprehensive ablation analysis is conducted for performance evaluation of the proposed method. Results are compared with other state-of-the-art methods on the benchmark dataset, and the proposed ECM-LSE approach has shown to comprehensively outperform the contemporary methods.
KW - amino acid composition (AAC)
KW - auto-encoder
KW - classification
KW - composition of k-spaced amino acid pair (CKSAAP)
KW - extracellular matrix (ECM)
KW - latent space learning
KW - neural network
UR - http://www.scopus.com/inward/record.url?scp=85118139048&partnerID=8YFLogxK
U2 - 10.3389/fbioe.2021.752658
DO - 10.3389/fbioe.2021.752658
M3 - Article
C2 - 34722479
AN - SCOPUS:85118139048
SN - 2296-4185
VL - 9
JO - Frontiers in Bioengineering and Biotechnology
JF - Frontiers in Bioengineering and Biotechnology
M1 - 752658
ER -