IMPROVED ESTIMATES FOR THE ACCURACY OF SMALL DISJUNCTS

被引：31

作者：

QUINLAN, JR

机构：

[1] Basser Department of Computer Science, University of Sydney, Sydney

来源：

MACHINE LEARNING | 1991年 / 6卷 / 01期

关键词：

DISJUNCTIVE CONCEPTS; EMPIRICAL LEARNING; ESTIMATION;

D O I：

10.1023/A:1022646118217

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 [模式识别与智能系统]; 0812 [计算机科学与技术]; 0835 [软件工程]; 1405 [智能科学与技术];

摘要：

Learning systems often describe a target class as a disjunction of conjunctions of conditions. Recent work has noted that small disjuncts, i.e., those supported by few training examples, typically have poor predictive accuracy. One model of this accuracy is provided by the Bayes-Laplace formula based on the number of training examples covered by the disjunct and the number of them belonging to the target class. However, experiments show that small disjunts associated with target classes of different relative frequencies tend to have different error rates. This note defines the context of a disjunct as the set of training examples that fail to satisfy at most one of its conditions. An empirical adaptation of the Bayes-Laplace formula is presented that also makes use of the relative frequency of the target class in this context. Trials are reported comparing the performance of the original formula and the adaptation in six learning tasks.

引用

页码：93 / 98

页数：6