AUTOMATED EXTRACTION OF ATTRIBUTE HIERARCHIES FOR AN IMPROVED DECISION-TREE CLASSIFIER

被引:4
作者
NAKASUKA, S
KOISHI, T
机构
[1] Nakasuka, Shinichi
[2] Koishi, Takazo
关键词
DECISION TREE CLASSIFIERS; CONCEPTUAL DISTANCE; GENERALIZATION; KNOWLEDGE ACQUISITION; KNOWLEDGE-BASES; FAULT DIAGNOSIS;
D O I
10.1016/0952-1976(95)00022-S
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A method to automatically extract inherent hierarchies of attribute values from data is proposed to improve the performance of a decision-tree classifier. When attributes used for a decision tree take continuous values, simple decision rules such as ''IF a certain attribute is less than a certain value'' yield good results in many cases. The rationale of this type of rule is the natural tendency that two nearby attribute values have a ''similar meaning'' in the sense that they suggest the same class with high probability. When only discrete-type, unordered attributes are available for tree generation, however, such efficient decision rules are hard to obtain because ''distance relationships'' between the attribute values are seldom known beforehand. In order to solve this problem, the proposed method estimates from the training data set a conceptual distance between each pair of attribute values, and by iteratively grouping two attribute values with minimum distance, generates hierarchies of attribute values, which are then utilized for making the decision tree. This method is applied to the task of fault diagnosis of a certain printed circuit board, and it is indicated that the generated attribute hierarchies can reduce the size of a decision tree sufficiently, which results in a significant improvement of its classification accuracy.
引用
收藏
页码:391 / 399
页数:9
相关论文
共 15 条
[1]   AN AUTOMATED APPROACH TO THE DESIGN OF DECISION TREE CLASSIFIERS [J].
ARGENTIERO, P ;
CHIN, R ;
BEAUDET, P .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1982, 4 (01) :51-57
[2]   BAYESIAN AND DECISION TREE APPROACHES FOR PATTERN-RECOGNITION INCLUDING FEATURE MEASUREMENT COSTS [J].
DATTATREYA, GR ;
SARMA, VVS .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1981, 3 (03) :293-298
[3]   APPLICATION OF A MULTILAYER DECISION TREE IN COMPUTER RECOGNITION OF CHINESE-CHARACTERS [J].
GU, YX ;
WANG, QR ;
SUEN, CY .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1983, 5 (01) :83-89
[4]   LEARNING BASED ON CONCEPTUAL DISTANCE [J].
KODRATOFF, Y ;
TECUCI, G .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1988, 10 (06) :897-909
[5]  
Mitchell T. M., 1983, MACHINE LEARNING ART, P163
[6]   DYNAMIC SCHEDULING SYSTEM UTILIZING MACHINE LEARNING AS A KNOWLEDGE ACQUISITION TOOL [J].
NAKASUKA, S ;
YOSHIDA, T .
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 1992, 30 (02) :411-431
[7]  
NUIK JK, 1980, IEEE T PATTERN ANAL, V2, P429
[8]  
Quinlan J. R., 1986, Machine Learning, V1, P81, DOI 10.1023/A:1022643204877
[9]  
Quinlan J. R., 1990, MACH LEARN, P140
[10]  
Quinlan J.R., 1986, MACH LEARN ARTIF INT, V2, P149