Pairwise feature evaluation for constructing reduced representations

被引:18
作者
Harol, Artsiom [1 ]
Lai, Carmen
Pezkalska, Elzbieta
Duin, Robert P. W.
机构
[1] Delft Univ Technol, Fac Elect Engn Math & Comp Sci, Informat & Commun Theory Grp, Delft, Netherlands
[2] Univ Manchester, Sch Comp Sci, Manchester, Lancs, England
关键词
feature selection; prototype selection; pairwise feature evaluation; pattern classification;
D O I
10.1007/s10044-006-0050-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection methods are often used to determine a small set of informative features that guarantee good classification results. Such procedures usually consist of two components: a separability criterion and a selection strategy. The most basic choices for the latter are individual ranking, forward search and backward search. Many intermediate methods such as floating search are also available. The forward as well as backward selection may cause lossy evaluation of the criterion and/or overtraining of the final classifier in case of high-dimensional spaces and small sample size problems. Backward selection may also become computationally prohibitive. Individual ranking, on the other hand, suffers as it neglects dependencies between features. A new strategy based on a pairwise evaluation has recently been proposed by Bo and Jonassen (Genome Biol 3, 2002) and Pekalska et al. (International Conference on Computer Recognition Systems, Poland, pp 271-278, 2005). Since it considers interactions between features, but always restricted to two-dimensional spaces, it may circumvent the small sample size problem. In this paper, we evaluate this idea in a more general framework for the selection of features as well as prototypes. Our finding is that such a pairwise selection may improve over traditional procedures and we present some artificial and real-world examples to support this claim. Additionally, we have also discovered that the set of problems for which the pairwise selection may be effective is small.
引用
收藏
页码:55 / 68
页数:14
相关论文
共 32 条
[1]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[2]  
[Anonymous], 1996, TEXTURES PHOTOGRAPHI
[3]  
[Anonymous], THESIS U WAIKATO
[4]   Information distance [J].
Bennett, CH ;
Gacs, P ;
Li, M ;
Vitanyi, FMB ;
Zurek, WH .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1998, 44 (04) :1407-1423
[5]  
Bo TH, 2002, GENOME BIOL, V3
[6]  
Bunke H., 1990, SYNTACTIC STRUCTURAL
[7]   POSSIBLE ORDERINGS IN MEASUREMENT SELECTION PROBLEM [J].
COVER, TM ;
VANCAMPENHOUT, JM .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1977, 7 (09) :657-661
[8]  
Das S., 2001, P 18 INT C MACHINE L, P74, DOI DOI 10.5555/645530.658297
[9]  
DUBUISSON MP, 1994, INT C PATT RECOG, P566, DOI 10.1109/ICPR.1994.576361
[10]  
Duda RO, 2006, PATTERN CLASSIFICATI