Training algorithm with incomplete data for feed-forward neural networks

被引：46

作者：

Yoon, SY ^{[1
]}

Lee, SY ^{[1
]}

机构：

[1] Korea Adv Inst Sci & Technol, Dept Elect Engn, Computat & Neural Syst Lab, Taejon 305701, South Korea

来源：

NEURAL PROCESSING LETTERS | 1999年 / 10卷 / 03期

关键词：

missing data; incomplete data; estimation; supervised learning;

D O I：

10.1023/A:1018772122605

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 [模式识别与智能系统]; 0812 [计算机科学与技术]; 0835 [软件工程]; 1405 [智能科学与技术];

摘要：

A new algorithm is developed to train feed-forward neural networks for non-linear input-to-output mappings with small incomplete data in arbitrary distributions. The developed Training-EStimation-Training (TEST) algorithm consists of 3 steps, i.e., (1) training with the complete portion of the training data set, (2) estimation of the missing attributes with the trained neural networks, and (3) re-training the neural networks with the whole data set. Error back propagation is still applicable to estimate the missing attributes. Unlike other training methods with missing data, it does not assume data distribution models which may not be appropriate for small training data. The developed TEST algorithm is first tested for the Iris benchmark data. By randomly removing some attributes from the complete data set and estimating the values latter, accuracy of the TEST algorithm is demonstrated. Then it is applied to the Diabetes benchmark data, of which about 50% contains missing attributes. Compared with other existing algorithms, the proposed TEST algorithm results in much better recognition accuracy for test data.

引用

页码：171 / 179

页数：9

共 15 条

[1]

Information geometry of the EM and em algorithms for neural networks [J].