Supervised feature selection by clustering using conditional mutual information-based distances

被引:163
作者
Martinez Sotoca, Jose [1 ]
Pla, Filiberto [1 ]
机构
[1] Univ Jaume 1, Inst New Imaging Technol, Dept Llenguatges & Sistemes Informat, Castellon de La Plana 12071, Spain
关键词
Supervised feature selection; Clustering; Conditional mutual information; INPUT FEATURE-SELECTION;
D O I
10.1016/j.patcog.2009.12.013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a supervised feature selection approach is presented, which is based on metric applied on continuous and discrete data representations. This method builds a dissimilarity space using information theoretic measures, in particular conditional mutual information between features with respect to a relevant variable that represents the class labels. Applying a hierarchical clustering, the algorithm searches for a compression of the information contained in the original set of features. The proposed technique is compared with other state of art methods also based on information measures. Eventually, several experiments are presented to show the effectiveness of the features selected from the point of view of classification accuracy. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2068 / 2081
页数:14
相关论文
共 30 条
[1]  
[Anonymous], 1993, 31 ANN M ASS COMPUTA, DOI [10.3115/981574.981598, DOI 10.3115/981574.981598]
[2]  
[Anonymous], 1984, OLSHEN STONE CLASSIF, DOI 10.2307/2530946
[3]  
[Anonymous], 2006, Series Studies in Fuzziness and Soft Computing
[4]  
[Anonymous], 2002, A first course in information theory
[5]  
BAKER LD, 1998, 21 ANN INT ACM SIGIR, P96
[6]   USING MUTUAL INFORMATION FOR SELECTING FEATURES IN SUPERVISED NEURAL-NET LEARNING [J].
BATTITI, R .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (04) :537-550
[7]   Selection of relevant features and examples in machine learning [J].
Blum, AL ;
Langley, P .
ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) :245-271
[8]  
Christianini N., 2000, INTRO SUPPORT VECTOR, P189
[9]  
Cover T.M., 2006, ELEMENTS INFORM THEO, V2nd ed
[10]   NEAREST NEIGHBOR PATTERN CLASSIFICATION [J].
COVER, TM ;
HART, PE .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) :21-+