Building classification trees using the total uncertainty criterion

被引:118
作者
Abellán, J [1 ]
Moral, S [1 ]
机构
[1] Univ Granada, ETSI Informat, Dept Ciencias Computac & Inteligencia Artificial, E-18071 Granada, Spain
关键词
D O I
10.1002/int.10143
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an application of the measure of total uncertainty on convex sets of probability distributions, also called credal sets, to the construction of classification trees. In these classification trees the probabilities of the classes in each one of its leaves is estimated by using the imprecise Dirichlet model. In this way, smaller samples give rise to wider probability intervals. Branching a classification tree can decrease the entropy associated with the classes but, at the same time, as the sample is divided among the branches the nonspecificity increases. We use a total uncertainty measure (entropy + nonspecificity) as branching criterion. The stopping rule is not to increase the total uncertainty. The good behavior of this procedure for the standard classification problems is shown. It is important to remark that it does not experience of overfitting, with similar results in the training and test samples. (C) 2003 Wiley Periodicals, Inc.
引用
收藏
页码:1215 / 1225
页数:11
相关论文
共 25 条
[1]   Completing a total uncertainty measure in the Dempster-Shafer Theory [J].
Abellán, J ;
Moral, S .
INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 1999, 28 (4-5) :299-314
[2]   A non-specificity measure for convex sets of probability distributions [J].
Abellan, J ;
Moral, S .
INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2000, 8 (03) :357-367
[3]  
ABELLAN J, 2003, IN PRESS INT J UNCER
[4]  
ACID S, 1999, THESIS U GRANADA
[5]  
[Anonymous], 1993, C4 5 PROGRAMS MACHIN
[6]  
[Anonymous], [No title captured]
[7]  
[Anonymous], 1998, UNCERTAINTY BASED IN
[8]  
Breiman L., 1984, BIOMETRICS, DOI DOI 10.2307/2530946
[9]  
Choquet G., 1954, ANN I FOURIER GRENOB, V5, P131, DOI [10.5802/aif.53, DOI 10.5802/AIF.53]
[10]   UPPER AND LOWER PROBABILITIES INDUCED BY A MULTIVALUED MAPPING [J].
DEMPSTER, AP .
ANNALS OF MATHEMATICAL STATISTICS, 1967, 38 (02) :325-&