Ultrahigh dimensional variable selection for interpolation of point referenced spatial data: A digital soil mapping case study

Benjamin R. Fitzpatrick, David W. Lamb, Kerrie Mengersen

Research output: Contribution to journalArticlepeer-review

10 Citations (Scopus)

Abstract

Modern soil mapping is characterised by the need to interpolate point referenced (geostatistical) observations and the availability of large numbers of environmental characteristics for consideration as covariates to aid this interpolation. Modelling tasks of this nature also occur in other fields such as biogeography and environmental science. This analysis employs the Least Angle Regression (LAR) algorithm for fitting Least Absolute Shrinkage and Selection Operator (LASSO) penalized Multiple Linear Regressions models. This analysis demonstrates the efficiency of the LAR algorithm at selecting covariates to aid the interpolation of geostatistical soil carbon observations. Where an exhaustive search of the models that could be constructed from 800 potential covariate terms and 60 observations would be prohibitively demanding, LASSO variable selection is accomplished with trivial computational investment.

Original languageEnglish
Article numbere0162489
Number of pages19
JournalPLoS One
Volume11
Issue number9
DOIs
Publication statusPublished - Sept 2016
Externally publishedYes

Fingerprint

Dive into the research topics of 'Ultrahigh dimensional variable selection for interpolation of point referenced spatial data: A digital soil mapping case study'. Together they form a unique fingerprint.

Cite this