Minimal approximate hitting sets and rule templates

被引:107
作者
Vinterbo, S [1 ]
Ohrn, A [1 ]
机构
[1] Norwegian Univ Sci & Technol, Dept Informat & Comp Sci, Knowledge Syst Grp, N-7491 Trondheim, Norway
关键词
hitting set; genetic algorithm; classification; rough sets; reducts;
D O I
10.1016/S0888-613X(00)00051-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A set S that has a non-empty intersection with every set in a collection of sets C is called a hitting set of C. If no elements can be removed from S without violating the hitting set property, then we say that S is minimal. Several interesting problems can in part be formulated as that of having to find one or more minimal hitting sets. Many of these problems require proper solutions, but sometimes approximate solutions suffice. We define an r-approximate hitting set as a set that intersects at least a fraction r of the sets in C. This notion is extended to the case, where C is a weighted multiset, and properties of I are explored with respect to simplification of C by absorption of supersets, Also, approximations of reducts from rough set theory are defined by means of minimal r-approximate hitting sets, and some links to the notion of dynamic reducts are established. The most common use of reducts are as templates for the generation of minimal classification rules from empirical data. A genetic algorithm is devised to compute r-approximate hitting sets, and applied to induce classifiers from selected real-world databases. These classifiers are then compared to those generated from proper reducts and dynamic reducts, respectively. The improvement in discriminatory ability yielded by the r-approximate reduct based classifiers was statistically significant (p < 0.05)., Furthermore, they were smaller, i.e., had fewer rules. (C) 2000 Elsevier Science Inc. All rights reserved.
引用
收藏
页码:123 / 143
页数:21
相关论文
共 28 条
[1]   Combined 5 x 2 cv F test for comparing supervised classification learning algorithms [J].
Alpaydin, E .
NEURAL COMPUTATION, 1999, 11 (08) :1885-1892
[2]  
ANDREEV AE, 1996, LECT NOTES COMPUTER, P357
[3]  
[Anonymous], 1995, P INT WORKSHOP ROUGH
[4]  
[Anonymous], UCI Repository of Machine Learning Databases
[5]   STRUCTURE PRESERVING REDUCTIONS AMONG CONVEX-OPTIMIZATION PROBLEMS [J].
AUSIELLO, G ;
DATRI, A ;
PROTASI, M .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1980, 21 (01) :136-153
[6]  
BAZAN J, 1994, LECT NOTES ARTIF INT, V869, P346
[7]  
Bazan Y., 1998, ROUGH SETS KNOWLEDGE, P321
[8]  
Brown F.M., 1990, BOOLEAN REASONING
[9]   Approximate statistical tests for comparing supervised classification learning algorithms [J].
Dietterich, TG .
NEURAL COMPUTATION, 1998, 10 (07) :1895-1923
[10]   Prediction of outcome in critically ill patients using artificial neural network synthesised by genetic algorithm [J].
Dybowski, R ;
Weller, P ;
Chang, R ;
Gant, V .
LANCET, 1996, 347 (9009) :1146-1150