Data dimensionality reduction with application to simplifying RBF network structure and improving classification performance

被引:253
作者
Fu, XJ [1 ]
Wang, LP [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 2263, Singapore
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 2003年 / 33卷 / 03期
关键词
classifier; data dimensionality reduction; overlaps; RBF neural networks; SCM;
D O I
10.1109/TSMCB.2003.810911
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 [计算机科学与技术];
摘要
For high dimensional data, if no preprocessing is carried out before inputting patterns to classifiers, the computation required may be too heavy. For example, the number of hidden units of a radial basis function (RBF) neural network can be too large. This is not suitable for some practical applications due to speed and memory constraints. In many cases, some attributes are not relevant to concepts in the data at all. In this paper, we propose a novel separability-correlation measure (SCM) to rank the importance of attributes. According to the attribute ranking results, different attribute subsets are used as inputs to a classifier, such as an RBF neural network. Those attributes that increase the validation error are deemed irrelevant and are deleted. The complexity of the classifier can thus be reduced and its classification performance improved. Computer simulations show that our method for attribute importance ranking leads to smaller attribute subsets with higher accuracies compared with the existing SUD and Relief-F methods. We also propose a modified method for efficient construction of an RBF classifier. In this method we allow for large overlaps between clusters corresponding to the same class label. Our approach significantly reduces the structural complexity of the RBF network and improves the classification performance.
引用
收藏
页码:399 / 409
页数:11
相关论文
共 20 条
[1]
USING MUTUAL INFORMATION FOR SELECTING FEATURES IN SUPERVISED NEURAL-NET LEARNING [J].
BATTITI, R .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (04) :537-550
[2]
Bishop C. M., 1995, NEURAL NETWORKS PATT
[3]
BOLLACKER KD, 1996, P IEEE INT C NEUR NE, V3, P1528
[4]
AN ADAPTIVE LEARNING ALGORITHM FOR PRINCIPAL COMPONENT ANALYSIS [J].
CHEN, LH ;
CHANG, SY .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1995, 6 (05) :1255-1263
[5]
Dimensionality reduction of unsupervised data [J].
Dash, M ;
Liu, H ;
Yao, L .
NINTH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 1997, :532-539
[6]
Devijver P., 1982, PATTERN RECOGN
[7]
KAMBHATLA N, 1993, 1993 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-3, P1213, DOI 10.1109/ICNN.1993.298730
[8]
Kawatani T, 1998, INT C PATT RECOG, P1301, DOI 10.1109/ICPR.1998.711940
[9]
KAYLANI T, 1994, P IEEE INT C MAN CYB, V3, P2584
[10]
Kononenko I, 1994, EUR C MACH LEARN, P171, DOI 10.1007/3-540-57868-4_57