BEST-CASE RESULTS FOR NEAREST-NEIGHBOR LEARNING

被引:20
作者
SALZBERG, S [1 ]
DELCHER, AL [1 ]
HEATH, D [1 ]
KASIF, S [1 ]
机构
[1] LOYOLA COLL,DEPT COMP SCI,BALTIMORE,MD 21210
基金
美国国家科学基金会;
关键词
MACHINE LEARNING; NEAREST-NEIGHBOR; GEOMETRIC CONCEPTS;
D O I
10.1109/34.387506
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose a theoretical model for analysis of classification methods, in which the teacher knows the classification algorithm and chooses examples in the best way possible. We apply this model using the nearest-neighbor learning algorithm, and develop upper and lower bounds on sample complexity for several different concept classes. For some concept classes, the sample complexity turns out to be exponential even using this best-case model, which implies that the concept class is inherently difficult for the NN algorithm. We identify several geometric properties that make learning certain concepts relatively easy. Finally we discuss the relation of our work to helpful teacher models, its application to decision tree learning algorithms, and some of its implications for current experimental work.
引用
收藏
页码:599 / 608
页数:10
相关论文
共 29 条
[1]  
Aha D, 1991, MACHINE LEARNING, V6
[2]   LEARNING REGULAR SETS FROM QUERIES AND COUNTEREXAMPLES [J].
ANGLUIN, D .
INFORMATION AND COMPUTATION, 1987, 75 (02) :87-106
[3]   NONOBTUSE TRIANGULATION OF POLYGONS [J].
BAKER, BS ;
GROSSE, E ;
RAFFERTY, CS .
DISCRETE & COMPUTATIONAL GEOMETRY, 1988, 3 (02) :147-168
[4]  
BERN M, 1992, LATIN 92, P46
[5]  
BERN M, 1991, 7TH ACM S COMPUTATIO, P342
[6]  
BERN M, 1994, 10TH P ANN ACM S COM, P221
[7]  
BHATTACHARYA B, 1992, SOCS9219 TECH REP
[8]   FINDING PROTOTYPES FOR NEAREST NEIGHBOR CLASSIFIERS [J].
CHANG, CL .
IEEE TRANSACTIONS ON COMPUTERS, 1974, C 23 (11) :1179-1184
[9]   A WEIGHTED NEAREST NEIGHBOR ALGORITHM FOR LEARNING WITH SYMBOLIC FEATURES [J].
COST, S ;
SALZBERG, S .
MACHINE LEARNING, 1993, 10 (01) :57-78
[10]   NEAREST NEIGHBOR PATTERN CLASSIFICATION [J].
COVER, TM ;
HART, PE .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) :21-+