A nonparametric classification method based on K-associated graphs

被引:41
作者
Bertini, Joao Roberto, Jr. [1 ]
Zhao, Liang [1 ]
Motta, Robson [1 ]
Lopes, Alneu de Andrade [1 ]
机构
[1] Univ Sao Paulo, Inst Math & Comp Sci, BR-13560970 Sao Carlos, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
Graph-based learning; Nonparametric classification; Graph formation; Multi-class classification; K-associated graph; Purity measure; HIGH-DIMENSIONAL DATA;
D O I
10.1016/j.ins.2011.07.043
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Graph is a powerful representation formalism that has been widely employed in machine learning and data mining. In this paper, we present a graph-based classification method, consisting of the construction of a special graph referred to as K-associated graph, which is capable of representing similarity relationships among data cases and proportion of classes overlapping. The main properties of the K-associated graphs as well as the classification algorithm are described. Experimental evaluation indicates that the proposed technique captures topological structure of the training data and leads to good results on classification task particularly for noisy data. In comparison to other well-known classification techniques, the proposed approach shows the following interesting features: (1) A new measure, called purity, is introduced not only to characterize the degree of overlap among classes in the input data set, but also to construct the K-associated optimal graph for classification; (2) nonlinear classification with automatic local adaptation according to the input data. Contrasting to K-nearest neighbor classifier, which uses a fixed K, the proposed algorithm is able to automatically consider different values of K, in order to best fit the corresponding overlap of classes in different data subspaces, revealing both the local and global structure of input data. (3) The proposed classification algorithm is nonparametric, implicating high efficiency and no need for model selection in practical applications. (C) 2011 Elsevier Inc. All rights reserved.
引用
收藏
页码:5435 / 5456
页数:22
相关论文
共 55 条
[1]  
[Anonymous], 2008, Semi-Supervised Learning Literature Survey
[2]  
[Anonymous], 2003, KDD
[3]  
[Anonymous], 2005, AISTAT
[4]  
[Anonymous], 2001, Pattern Classification
[5]  
[Anonymous], 2007, Uci machine learning repository
[6]  
[Anonymous], 2005, ICML
[7]   Semi-supervised learning on Riemannian manifolds [J].
Belkin, M ;
Niyogi, P .
MACHINE LEARNING, 2004, 56 (1-3) :209-239
[8]   Laplacian eigenmaps for dimensionality reduction and data representation [J].
Belkin, M ;
Niyogi, P .
NEURAL COMPUTATION, 2003, 15 (06) :1373-1396
[9]  
Belkin M., 2006, J MACHINE LEARNING R, V1, P1
[10]  
Callut J, 2008, LECT NOTES ARTIF INT, V5211, P162, DOI 10.1007/978-3-540-87479-9_29