Classifier-independent feature selection on the basis of divergence criterion

被引:11
作者
Abe, Naoto [1 ]
Kudo, Mineichi
Toyama, Jun
Shimbo, Masaru
机构
[1] Hokkaido Univ, Div Comp Sci, Grad Sch Informat Sci & Technol, Sapporo, Hokkaido 0600814, Japan
[2] Hokkaido Informat Univ, Fac Informat Media, Ebetsu, Hokkaido 0698585, Japan
关键词
classifier-independent feature selection; Bayes classifier; Gaussian mixture; garbage feature; J-divergence; two-stage feature selection;
D O I
10.1007/s10044-006-0030-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection aims to choose a feature subset that has the most discriminative information from the original feature set. In practical cases, it is preferable to select a feature subset that is universally effective for any kind of classifier because there is no underlying information about a given dataset. Such a trial is called classifier-independent feature selection. We took notice of Novovicova et al.'s study as a classifier-independent feature selection method. However, the number of features have to be selected beforehand in their method. It is more desirable to determine a feature subset size automatically so as to remove only garbage features. In this study, we propose a divergence criterion on the basis of Novovicova et al.'s method.
引用
收藏
页码:127 / 137
页数:11
相关论文
共 34 条
[1]  
ALMUALLIM H, 1991, PROCEEDINGS : NINTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 AND 2, P547
[2]  
[Anonymous], [No title captured]
[3]   Selection of relevant features and examples in machine learning [J].
Blum, AL ;
Langley, P .
ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) :245-271
[4]   SOME ASPECTS OF ERROR-BOUNDS IN FEATURE-SELECTION [J].
BOEKEE, DE ;
VANDERLUBBE, JCA .
PATTERN RECOGNITION, 1979, 11 (5-6) :353-360
[5]   SVMTorch: Support vector machines for large-scale regression problems [J].
Collobert, R ;
Bengio, S .
JOURNAL OF MACHINE LEARNING RESEARCH, 2001, 1 (02) :143-160
[6]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[7]  
FERRI FJ, 1994, MACH INTELL PATT REC, V16, P403
[8]  
FRIEDMAN JH, 1977, IEEE T COMPUT, V26, P404, DOI 10.1109/TC.1977.1674849
[9]  
HALL M, 2000, P 17 INT C MACH LEAR, P259
[10]  
Ho TK, 2002, IEEE T PATTERN ANAL, V24, P289, DOI 10.1109/34.990132