An algorithm for data-driven bandwidth selection

被引:252
作者
Comaniciu, D [1 ]
机构
[1] Siemens Corp Res, Real Time Vis & Modeling Dept, Princeton, NJ 08540 USA
关键词
variable-bandwidth mean shift; bandwidth selection; multiscale analysis; Jensen-Shannon divergence; feature space;
D O I
10.1109/TPAMI.2003.1177159
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The analysis of a feature space that exhibits multiscale patterns often requires kernel estimation techniques with locally adaptive bandwidths, such as the variable-bandwidth mean shift. Proper selection of the kernel bandwidth is, however, a critical step for superior space analysis and partitioning. This paper presents a mean shift-based approach for local bandwidth selection in the multimodal, multivariate case. Our method is based on a fundamental property of normal distributions regarding the bias of the normalized density gradient. We demonstrate that, within the large sample approximation, the local covariance is estimated by the matrix that maximizes the magnitude of the normalized mean shift vector. Using this property, we develop a reliable algorithm which takes into account the stability of local bandwidth estimates across scales. The validity of our theoretical results is proven in various space partitioning experiments involving the variable-bandwidth mean shift.
引用
收藏
页码:281 / 288
页数:8
相关论文
共 28 条
[11]   IMPROVED VARIABLE WINDOW KERNEL ESTIMATES OF PROBABILITY DENSITIES [J].
HALL, P ;
HU, TC ;
MARRON, JS .
ANNALS OF STATISTICS, 1995, 23 (01) :1-10
[12]  
Irani M, 2000, LECT NOTES COMPUT SC, V1842, P539
[13]  
Jain A.K., 1988, Algorithms for Clustering Data
[14]   A brief survey of bandwidth selection for density estimation [J].
Jones, MC ;
Marron, JS ;
Sheather, SJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1996, 91 (433) :401-407
[15]  
Kauffman L.R., 1990, FINDING GROUPS DATA
[16]  
KULLBACK S, 1997, INFORMATION THEORY S
[17]   Clustering by scale-space filtering [J].
Leung, Y ;
Zhang, JS ;
Xu, ZB .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (12) :1396-1410
[18]   DIVERGENCE MEASURES BASED ON THE SHANNON ENTROPY [J].
LIN, JH .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1991, 37 (01) :145-151
[19]   Feature detection with automatic scale selection [J].
Lindeberg, T .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 1998, 30 (02) :79-116
[20]   AN EXAMINATION OF PROCEDURES FOR DETERMINING THE NUMBER OF CLUSTERS IN A DATA SET [J].
MILLIGAN, GW ;
COOPER, MC .
PSYCHOMETRIKA, 1985, 50 (02) :159-179