TY - JOUR
T1 - Polynomial histograms for multivariate density and mode estimation
AU - Jing, Junmei
AU - Koch, Inge
AU - Naito, Kanta
PY - 2012/3/1
Y1 - 2012/3/1
N2 - We consider the problem of efficiently estimating multivariate densities and their modes for moderate dimensions and an abundance of data. We propose polynomial histograms to solve this estimation problem. We present first- and second-order polynomial histogram estimators for a general d-dimensional setting. Our theoretical results include pointwise bias and variance of these estimators, their asymptotic mean integrated square error (AMISE), and optimal binwidth. The asymptotic performance of the first-order estimator matches that of the kernel density estimator, while the second order has the faster rate of O(n -6/(d+6)). For a bivariate normal setting, we present explicit expressions for the AMISE constants which show the much larger binwidths of the second order estimator and hence also more efficient computations of multivariate densities. We apply polynomial histogram estimators to real data from biotechnology and find the number and location of modes in such data.
AB - We consider the problem of efficiently estimating multivariate densities and their modes for moderate dimensions and an abundance of data. We propose polynomial histograms to solve this estimation problem. We present first- and second-order polynomial histogram estimators for a general d-dimensional setting. Our theoretical results include pointwise bias and variance of these estimators, their asymptotic mean integrated square error (AMISE), and optimal binwidth. The asymptotic performance of the first-order estimator matches that of the kernel density estimator, while the second order has the faster rate of O(n -6/(d+6)). For a bivariate normal setting, we present explicit expressions for the AMISE constants which show the much larger binwidths of the second order estimator and hence also more efficient computations of multivariate densities. We apply polynomial histogram estimators to real data from biotechnology and find the number and location of modes in such data.
KW - Asymptotic performance
KW - Estimation of modes
KW - Multivariate density estimation
KW - Polynomial histogram estimators
UR - http://www.scopus.com/inward/record.url?scp=84857035947&partnerID=8YFLogxK
U2 - 10.1111/j.1467-9469.2011.00764.x
DO - 10.1111/j.1467-9469.2011.00764.x
M3 - Article
AN - SCOPUS:84857035947
SN - 0303-6898
VL - 39
SP - 75
EP - 96
JO - Scandinavian Journal of Statistics
JF - Scandinavian Journal of Statistics
IS - 1
ER -