CLASS-DEPENDENT DISCRETIZATION FOR INDUCTIVE LEARNING FROM CONTINUOUS AND MIXED-MODE DATA

被引:156
作者
CHING, JY
WONG, AKC
CHAN, KCC
机构
[1] HONG KONG POLYTECH UNIV,DEPT COMP,KOWLOON,HONG KONG
[2] UNIV WESTERN ONTARIO,LONDON,ON N6A 3K7,CANADA
关键词
INDUCTIVE LEARNING; CLASSIFICATION; DISCRETIZATION; CONTINUOUS ATTRIBUTES; MIXED-MODE ATTRIBUTES; MAXIMUM ENTROPY; MUTUAL INFORMATION; UNCERTAINTY;
D O I
10.1109/34.391407
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Inductive learning systems can be effectively used to acquire classification knowledge from examples. Many existing symbolic learning algorithms can be applied in domains with continuous attributes when integrated with a discretization algorithm to transform the continuous attributes into ordered discrete ones. In this paper, a new information theoretic discretization method optimized for supervised learning is proposed and described. This approach seeks to maximize the mutual dependence as measured by the interdependence redundancy between the discrete intervals and the class labels, and can automatically determine the most preferred number of intervals for an inductive learning application. The method has been tested in a number of inductive learning examples to show that the class-dependent discretizer can significantly improve the classification performance of many existing learning algorithms in domains containing numeric attributes.
引用
收藏
页码:641 / 651
页数:11
相关论文
共 26 条
[1]  
BRATKO I, 1987, INTERACTIONS ARTIFIC
[2]  
Brieman L., 1984, CLASSIFICATION REGRE
[3]   AN IMPROVED RULE GENERATION METHOD FOR EVIDENCE-BASED CLASSIFICATION SYSTEMS [J].
CAELLI, T ;
PENNINGTON, A .
PATTERN RECOGNITION, 1993, 26 (05) :733-740
[4]  
Catlett J., 1991, P EUR WORK SESS LEAR, P164
[5]  
Chan K. C. C., 1991, Knowledge discovery in databases, P107
[6]  
CHAN KCC, 1990, COMPUTATIONAL INTELL, V6
[7]  
CHAN KCC, 1992, 5TH P IEEE COMP BAS
[8]  
CHAN KCC, 1992, P C ARTIFICIAL INTEL
[9]  
CHING JY, 1992, THESIS U WATERLOO CA
[10]   ON THE HANDLING OF CONTINUOUS-VALUED ATTRIBUTES IN DECISION TREE GENERATION [J].
FAYYAD, UM ;
IRANI, KB .
MACHINE LEARNING, 1992, 8 (01) :87-102