Fast implementations of nearest neighbor classifiers

被引:54
作者
Grother, PJ [1 ]
Candela, GT [1 ]
Blue, JL [1 ]
机构
[1] NIST,INFORMAT TECHNOL LAB,MATH MODELLING GRP,GAITHERSBURG,MD 20899
关键词
non-parametric classifiers; kd trees; k nearest neighbors; distance metrics; Karhunen-Loeve transform; OCR;
D O I
10.1016/S0031-3203(96)00098-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Standard implementations of non-parametric classifiers have large computational requirements. Parzen classifiers use the distances of an unknown vector to all N prototype samples, and consequently exhibit O(N) behavior in both memory and time. We describe four techniques for expediting the nearest neighbor methods: replacing the linear search with a new kd tree method, exhibiting approximately O(N-1/2) behavior; employing an L-infinity instead of L-2 distance metric; using variance-ordered features; and rejecting prototypes by evaluating distances in low dimensionality subspaces. We demonstrate that variance-ordered features yield significant efficiency gains over the same features linearly transformed to have uniform variance. We give results for a large OCR problem, but note that the techniques expedite recognition for arbitrary applications. Three of four techniques preserve recognition accuracy. (C) 1997 Pattern Recognition Society.
引用
收藏
页码:459 / 465
页数:7
相关论文
共 14 条
[1]   MULTIDIMENSIONAL BINARY SEARCH TREES USED FOR ASSOCIATIVE SEARCHING [J].
BENTLEY, JL .
COMMUNICATIONS OF THE ACM, 1975, 18 (09) :509-517
[2]   EVALUATION OF PATTERN CLASSIFIERS FOR FINGERPRINT AND OCR APPLICATIONS [J].
BLUE, JL ;
CANDELA, GT ;
GROTHER, PJ ;
CHELLAPPA, R ;
WILSON, CL .
PATTERN RECOGNITION, 1994, 27 (04) :485-501
[3]  
CANDELA GT, 1993, 5163 NISTIR NAT I ST
[4]   NEAREST NEIGHBOR PATTERN CLASSIFICATION [J].
COVER, TM ;
HART, PE .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) :21-+
[5]  
Dasarathy B.V., 1991, IEEE COMPUTER SOC TU
[6]  
Friedman J. H., 1977, ACM Transactions on Mathematical Software, V3, P209, DOI 10.1145/355744.355745
[7]   THE REDUCED PARZEN CLASSIFIER [J].
FUKUNAGA, K ;
HAYES, RR .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1989, 11 (04) :423-425
[8]  
Fukunaga K., 1990, INTRO STAT PATTERN R, P254
[9]  
GARRIS MD, 1994, 5469 NISTIR NAT I ST
[10]  
GROTHER PJ, 1995, 19 HFCD NAT I STAND