Discriminant adaptive nearest neighbor classification

被引:524
作者
Hastie, T
Tibshirani, R
机构
[1] STANFORD UNIV, DEPT BIOSTAT, PALO ALTO, CA 94305 USA
[2] UNIV TORONTO, DEPT PREVENT MED & BIOSTAT, TORONTO, ON M5S 1A1, CANADA
[3] UNIV TORONTO, DEPT STAT, TORONTO, ON M5S 1A1, CANADA
基金
美国国家科学基金会; 加拿大自然科学与工程研究理事会;
关键词
classification; nearest neighbors; linear discriminant analysis; curse of dimensionality;
D O I
10.1109/34.506411
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nearest neighbor classification expects the class conditional probabilities to be locally constant, and suffers from bias in high dimensions. We propose a locally adaptive form of nearest neighbor classification to try to ameliorate this curse of dimensionality. We use a local linear discriminant analysis to estimate an effective metric for computing neighborhoods. We determine the local decision boundaries from centroid information, and then shrink neighborhoods in directions orthogonal to these local decision boundaries, and elongate them parallel to the boundaries. Thereafter, any neighborhood-based classifier can be employed, using the modified neighborhoods. The posterior probabilities tend to be more homogeneous in the modified neighborhoods. We also propose a method for global dimension reduction, that combines local dimension information. In a number of examples, the methods demonstrate the potential for substantial improvements over nearest neighbor classification.
引用
收藏
页码:607 / 616
页数:10
相关论文
共 17 条
[1]  
Breiman L., 1984, Classification and Regression Trees, DOI DOI 10.2307/2530946
[2]   ROBUST LOCALLY WEIGHTED REGRESSION AND SMOOTHING SCATTERPLOTS [J].
CLEVELAND, WS .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1979, 74 (368) :829-836
[3]  
COVER TM, 1968, 1ST P ANN HAW C SYST, P413
[4]   SLICING REGRESSION - A LINK-FREE REGRESSION METHOD [J].
DUAN, N ;
LI, KC .
ANNALS OF STATISTICS, 1991, 19 (02) :505-530
[5]  
Duda R. O., 1973, PATTERN CLASSIFICATI, V3
[6]  
Friedman JeromeH., 1994, FLEXIBLE METRIC NEAR
[7]  
HASTIE T, 1993, LEARNING PROTOTYPE M
[8]  
HASTIE T, 1994, ANN STAT
[9]  
LOWE DG, 1993, SIMILARITY METRIC LE
[10]  
McLachlan G. J., 2005, Discriminant analysis and statistical pattern recognition