Classification of MALDI-MS imaging data of tissue microarrays using canonical correlation analysis-based variable selection

Lyron Winderbaum, Inge Koch, Parul Mittal, Peter Hoffmann

Research output: Contribution to journalArticlepeer-review

9 Citations (Scopus)

Abstract

Applying MALDI-MS imaging to tissue microarrays (TMAs) provides access to proteomics data from large cohorts of patients in a cost- and time-efficient way, and opens the potential for applying this technology in clinical diagnosis. The complexity of these TMA data—high-dimensional low sample size—provides challenges for the statistical analysis, as classical methods typically require a nonsingular covariance matrix that cannot be satisfied if the dimension is greater than the sample size. We use TMAs to collect data from endometrial primary carcinomas from 43 patients. Each patient has a lymph node metastasis (LNM) status of positive or negative, which we predict on the basis of the MALDI-MS imaging TMA data. We propose a variable selection approach based on canonical correlation analysis that explicitly uses the LNM information. We apply LDA to the selected variables only. Our method misclassifies 2.3–20.9% of patients by leave-one-out cross-validation and strongly outperforms LDA after reduction of the original data with principle component analysis.

Original languageEnglish
Pages (from-to)1731-1735
Number of pages5
JournalProteomics
Volume16
Issue number11-12
DOIs
Publication statusPublished - 1 Jun 2016
Externally publishedYes

Fingerprint

Dive into the research topics of 'Classification of MALDI-MS imaging data of tissue microarrays using canonical correlation analysis-based variable selection'. Together they form a unique fingerprint.

Cite this