Locally adaptive metric nearest-neighbor classification

被引:193
作者
Domeniconi, C
Peng, J
Gunopulos, D
机构
[1] Univ Calif Riverside, Comp Sci Dept, Riverside, CA 92521 USA
[2] Tulane Univ, Elect Engn & Comp Sci Dept, New Orleans, LA 70118 USA
关键词
Chi-squared distance; classification; feature relevance; nearest neighbors;
D O I
10.1109/TPAMI.2002.1033219
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nearest-neighbor classification assumes locally constant class conditional probabilities. This assumption becomes invalid in high dimensions with finite samples due to the curse of dimensionality. Severe bias can be introduced under these conditions when using the nearest-neighbor rule. We propose a locally adaptive nearest-neighbor classification method to try to minimize bias. We use a Chi-squared distance analysis to compute a flexible metric for producing neighborhoods that are highly adaptive to query locations. Neighborhoods are elongated along less relevant feature dimensions and constricted along most influential ones. As a result, the class conditional probabilities are smoother in the modified neighborhoods, whereby better classification performance can e achieved. The efficacy of our method is validated and compared against other techniques using both simulated and real-world data.
引用
收藏
页码:1281 / 1285
页数:5
相关论文
共 15 条