A Two-Phase Model for Learning Rules from Incomplete Data

被引:7
作者
Li, Huaxiong [1 ]
Yao, Yiyu [2 ]
Zhou, Xianzhong [1 ,4 ]
Huang, Bing [3 ]
机构
[1] Nanjing Univ, Sch Management & Engn, Nanjing 210008, Peoples R China
[2] Univ Regina, Dept Comp Sci, Regina, SK S4S 0A2, Canada
[3] Nanjing Audit Univ, Sch Informat Sci, Nanjing, Peoples R China
[4] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210008, Peoples R China
基金
中国国家自然科学基金;
关键词
missing attribute values; filled-in values; two-phase rule induction; MISSING VALUES;
D O I
10.3233/FI-2009-127
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
A two-phase learning strategy for rule induction from incomplete data is proposed, and a new form of rules is introduced so that a user can easily identify attributes with or without missing values in a rule. Two levels of measurement are assigned to a rule. An algorithm for two-phase rule induction is presented. Instead of filling in missing attribute values before or during the process of rule induction, we divide rule induction into two phases. In the first phase, rules and partial rules are induced based on non-missing values. In the second phase, partial rules are modified and refined by the imputation of some missing values. Such rules truthfully reflect the knowledge embedded in the incomplete data. The study not only presents a new view of rule induction from incomplete data, but also provides a practical solution. Experiments validate the effectiveness of the proposed method.
引用
收藏
页码:219 / 232
页数:14
相关论文
共 23 条
[1]  
[Anonymous], 1997, MACHINE LEARNING, MCGRAW-HILL SCIENCE/ENGINEERING/MATH
[2]   PRISM - AN ALGORITHM FOR INDUCING MODULAR RULES [J].
CENDROWSKA, J .
INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1987, 27 (04) :349-370
[3]  
Clark P., 1989, Machine Learning, V3, P261, DOI 10.1007/BF00116835
[4]  
Ghahramani Zoubin, 1994, Advances in Neural Information Processing Systems (NIPS), P120
[5]  
Greco S, 1999, LECT NOTES ARTIF INT, V1711, P146
[6]  
Grzymala-Busse JerzyW., 2001, ROUGH SETS CURRENT T, DOI DOI 10.1007/3-540-45554-X_46
[7]  
Grzymala-Busse JW, 2004, LECT NOTES COMPUT SC, V3100, P78
[8]  
GRZYMALABUSSE JW, 1991, LECT NOTES ARTIF INT, V542, P368
[9]  
GRZYMALABUSSE JW, 2007, LNCS, V4374, P31
[10]   Rules in incomplete information systems [J].
Kryszkiewicz, M .
INFORMATION SCIENCES, 1999, 113 (3-4) :271-292