Feature significance for multivariate kernel density estimation

被引:71
作者
Duong, Tam [1 ]
Cowling, Arianna [1 ]
Koch, Inge [1 ]
Wand, M. P. [1 ]
机构
[1] Univ New S Wales, Sch Math & Stat, Sydney, NSW, Australia
基金
澳大利亚研究理事会;
关键词
D O I
10.1016/j.csda.2008.02.035
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Multivariate kernel density estimation provides information about structure in data. Feature significance is a technique for deciding whether features - such as local extrema - are statistically significant. This paper proposes a framework for feature significance in d-dimensional data which combines kernel density derivative estimators and hypothesis tests for modal regions. For the gradient and curvature estimators distributional properties are given, and pointwise test statistics are derived. The hypothesis tests extend the two-dimensional feature significance ideas of Godtliebsen et al. [Godtliebsen, E, Marron, J.S., Chaudhuri, P., 2002. Significance in scale space for bivariate density estimation. Journal of Computational and Graphical Statistics 11, 1-21]. The theoretical framework is complemented by novel visualization for three-dimensional data. Applications to real data sets show that tests based on the kernel curvature estimators perform well in identifying modal regions. These results can be enhanced by corresponding tests with kernel gradient estimators. (c) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:4225 / 4242
页数:18
相关论文
共 25 条
[1]  
ADLER D, 2006, RGL 3D VISUALIZATION
[2]  
[Anonymous], 1992, MULTIVARIATE DENSITY
[3]  
Bowman AW, 1997, Applied Smoothing Techniques for Data Analysis: the Kernel Approach with S-Plus Illustrations
[4]   Scale space view of curve estimation [J].
Chaudhuri, P ;
Marron, JS .
ANNALS OF STATISTICS, 2000, 28 (02) :408-428
[5]   SiZer for exploration of structures in curves [J].
Chaudhuri, P ;
Marron, JS .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1999, 94 (447) :807-823
[6]   Cross-validation bandwidth matrices for multivariate kernel density estimation [J].
Duong, T ;
Hazelton, ML .
SCANDINAVIAN JOURNAL OF STATISTICS, 2005, 32 (03) :485-506
[7]   Plug-in bandwidth matrices for bivariate kernel density estimation [J].
Duong, T ;
Hazelton, ML .
JOURNAL OF NONPARAMETRIC STATISTICS, 2003, 15 (01) :17-30
[8]  
FENG D, 2005, MISC3D MISCELLANEOUS
[9]  
Givan, 2001, FLOW CYTOMETRY 1 PRI, DOI 10.1002/0471223948
[10]   Significance in scale space for bivariate density estimation [J].
Godtliebsen, F ;
Marron, JS ;
Chaudhuri, P .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2002, 11 (01) :1-21