TY - JOUR
T1 - Sequential estimation of Spearman rank correlation using Hermite series estimators
AU - Stephanou, Michael
AU - Varughese, Melvin
PY - 2021/11
Y1 - 2021/11
N2 - In this article we describe a new Hermite series based sequential estimator for the Spearman rank correlation coefficient and provide algorithms applicable in both the stationary and non-stationary settings. To treat the non-stationary setting, we introduce a novel, exponentially weighted estimator for the Spearman rank correlation, which allows the local nonparametric correlation of a bivariate data stream to be tracked. To the best of our knowledge this is the first algorithm to be proposed for estimating a time varying Spearman rank correlation that does not rely on a moving window approach. We explore the practical effectiveness of the Hermite series based estimators through real data and simulation studies demonstrating good practical performance. The simulation studies in particular reveal competitive performance compared to an existing algorithm. The potential applications of this work are manifold. The Hermite series based Spearman rank correlation estimator can be applied to fast and robust online calculation of correlation which may vary over time. Possible machine learning applications include, amongst others, fast feature selection and hierarchical clustering on massive data sets.
AB - In this article we describe a new Hermite series based sequential estimator for the Spearman rank correlation coefficient and provide algorithms applicable in both the stationary and non-stationary settings. To treat the non-stationary setting, we introduce a novel, exponentially weighted estimator for the Spearman rank correlation, which allows the local nonparametric correlation of a bivariate data stream to be tracked. To the best of our knowledge this is the first algorithm to be proposed for estimating a time varying Spearman rank correlation that does not rely on a moving window approach. We explore the practical effectiveness of the Hermite series based estimators through real data and simulation studies demonstrating good practical performance. The simulation studies in particular reveal competitive performance compared to an existing algorithm. The potential applications of this work are manifold. The Hermite series based Spearman rank correlation estimator can be applied to fast and robust online calculation of correlation which may vary over time. Possible machine learning applications include, amongst others, fast feature selection and hierarchical clustering on massive data sets.
KW - Hermite series estimators
KW - Incremental estimation
KW - Nonparametric correlation
KW - O(1) update algorithm
KW - Online estimation
KW - Sequential estimation
KW - Spearman correlation coefficient
UR - http://www.scopus.com/inward/record.url?scp=85110774502&partnerID=8YFLogxK
U2 - 10.1016/j.jmva.2021.104783
DO - 10.1016/j.jmva.2021.104783
M3 - Article
AN - SCOPUS:85110774502
SN - 0047-259X
VL - 186
JO - Journal of Multivariate Analysis
JF - Journal of Multivariate Analysis
M1 - 104783
ER -