A grey-based nearest neighbor approach for missing attribute value prediction

被引:41
作者
Huang, CC [1 ]
Lee, HM
机构
[1] Natl Taiwan Univ Sci & Technol, Dept Elect Engn, Taipei 106, Taiwan
[2] Natl Taiwan Univ Sci & Technol, Dept Comp Sci & Informat Engn, Taipei 106, Taiwan
关键词
missing attribute values; grey-based nearest neighbor approach; grey relational analysis; the nearest neighbor concept;
D O I
10.1023/B:APIN.0000021416.41043.0f
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a grey-based nearest neighbor approach to predict accurately missing attribute values. First, grey relational analysis is employed to determine the nearest neighbors of an instance with missing attribute values. Accordingly, the known attribute values derived from these nearest neighbors are used to infer those missing values. Two datasets were used to demonstrate the performance of the proposed method. Experimental results show that our method outperforms both multiple imputation and mean substitution. Moreover, the proposed method was evaluated using five classification problems with incomplete data. Experimental results indicate that the accuracy of classification is maintained or even increased when the proposed method is applied for missing attribute value prediction.
引用
收藏
页码:239 / 252
页数:14
相关论文
共 35 条
[1]  
AHA DW, 1991, MACH LEARN, V6, P37, DOI 10.1007/BF00153759
[2]  
[Anonymous], P 16 INT C MACH LEAR
[3]   A constraint-based approach to shape management in multimedia databases [J].
Bertino, E ;
Catania, B .
MULTIMEDIA SYSTEMS, 1998, 6 (01) :2-16
[4]  
Blake C.L., 1998, UCI repository of machine learning databases
[5]  
Buntine W. L., 1991, Complex Systems, V5, P603
[6]  
Cestnik B, 1987, Progress in Machine Learning, P31
[7]   NEAREST NEIGHBOR PATTERN CLASSIFICATION [J].
COVER, TM ;
HART, PE .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) :21-+
[8]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[9]  
Deng Julong, 1989, Journal of Grey Systems, V1, P1
[10]  
Deng Julong, 1989, Journal of Grey Systems, V1, P103