A new fast prototype selection method based on clustering

被引:88
作者
Arturo Olvera-Lopez, J. [1 ]
Ariel Carrasco-Ochoa, J. [1 ]
Francisco Martinez-Trinidad, J. [1 ]
机构
[1] Natl Inst Astrophys Opt & Elect, Dept Comp Sci, Puebla 72000, Mexico
关键词
Prototype selection; Supervised classification; Instance-based classifiers; Border prototypes; Data reduction; Clustering; NEAREST; CLASSIFICATION;
D O I
10.1007/s10044-008-0142-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In supervised classification, a training set T is given to a classifier for classifying new prototypes. In practice, not all information in T is useful for classifiers, therefore, it is convenient to discard irrelevant prototypes from T. This process is known as prototype selection, which is an important task for classifiers since through this process the time for classification or training could be reduced. In this work, we propose a new fast prototype selection method for large datasets, based on clustering, which selects border prototypes and some interior prototypes. Experimental results showing the performance of our method and comparing accuracy and runtimes against other prototype selection methods are reported.
引用
收藏
页码:131 / 141
页数:11
相关论文
共 29 条
[1]  
[Anonymous], 2014, C4. 5: programs for machine learning
[2]  
[Anonymous], 1973, Pattern Classification and Scene Analysis
[3]  
Atkeson CG, 1997, ARTIF INTELL REV, V11, P11, DOI 10.1023/A:1006559212014
[4]   Nearest prototype classifier designs: An experimental study [J].
Bezdek, JC ;
Kuncheva, LI .
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2001, 16 (12) :1445-1473
[5]   Advances in instance selection for instance-based learning algorithms [J].
Brighton, H ;
Mellish, C .
DATA MINING AND KNOWLEDGE DISCOVERY, 2002, 6 (02) :153-172
[6]  
Chou CH, 2006, INT C PATT RECOG, P556
[7]   NEAREST NEIGHBOR PATTERN CLASSIFICATION [J].
COVER, TM ;
HART, PE .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) :21-+
[8]  
CRISTANNI N, 2000, INTRO SUPPORT VECTOR
[9]  
Devijver P. A., 1980, Proceedings of the 5th International Conference on Pattern Recognition, P72
[10]   Approximate statistical tests for comparing supervised classification learning algorithms [J].
Dietterich, TG .
NEURAL COMPUTATION, 1998, 10 (07) :1895-1923