CLASS-DEPENDENT DISCRETIZATION FOR INDUCTIVE LEARNING FROM CONTINUOUS AND MIXED-MODE DATA

被引:156
作者
CHING, JY
WONG, AKC
CHAN, KCC
机构
[1] HONG KONG POLYTECH UNIV,DEPT COMP,KOWLOON,HONG KONG
[2] UNIV WESTERN ONTARIO,LONDON,ON N6A 3K7,CANADA
关键词
INDUCTIVE LEARNING; CLASSIFICATION; DISCRETIZATION; CONTINUOUS ATTRIBUTES; MIXED-MODE ATTRIBUTES; MAXIMUM ENTROPY; MUTUAL INFORMATION; UNCERTAINTY;
D O I
10.1109/34.391407
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Inductive learning systems can be effectively used to acquire classification knowledge from examples. Many existing symbolic learning algorithms can be applied in domains with continuous attributes when integrated with a discretization algorithm to transform the continuous attributes into ordered discrete ones. In this paper, a new information theoretic discretization method optimized for supervised learning is proposed and described. This approach seeks to maximize the mutual dependence as measured by the interdependence redundancy between the discrete intervals and the class labels, and can automatically determine the most preferred number of intervals for an inductive learning application. The method has been tested in a number of inductive learning examples to show that the class-dependent discretizer can significantly improve the classification performance of many existing learning algorithms in domains containing numeric attributes.
引用
收藏
页码:641 / 651
页数:11
相关论文
共 26 条
[21]  
Quinlan R., 1987, APPL EXPERT SYSTEMS, P157
[22]   ATTEMPT TO DEFINE THE NATURE OF CHEMICAL DIABETES USING A MULTIDIMENSIONAL-ANALYSIS [J].
REAVEN, GM ;
MILLER, RG .
DIABETOLOGIA, 1979, 16 (01) :17-24
[23]  
STASHUK DW, 1992, IEEE T BIOMEDICA JUN
[24]   AN EVENT-COVERING METHOD FOR EFFECTIVE PROBABILISTIC INFERENCE [J].
WONG, AKC ;
CHIU, DKY .
PATTERN RECOGNITION, 1987, 20 (02) :245-255
[25]   TYPICALITY, DIVERSITY, AND FEATURE PATTERN OF AN ENSEMBLE [J].
WONG, AKC ;
LIU, TS .
IEEE TRANSACTIONS ON COMPUTERS, 1975, C 24 (02) :158-181
[26]   SYNTHESIZING STATISTICAL KNOWLEDGE FROM INCOMPLETE MIXED-MODE DATA [J].
WONG, AKC ;
CHIU, DKY .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1987, 9 (06) :796-805