Generalisation performance of artificial neural networks for near infrared spectral analysis

被引:15
作者
Wang, W. [1 ]
Paliwal, J. [1 ]
机构
[1] Univ Manitoba, Dept Biosyst Engn, Winnipeg, MB R3T 5V6, Canada
关键词
D O I
10.1016/j.biosystemseng.2006.02.001
中图分类号
S2 [农业工程];
学科分类号
0828 ;
摘要
Generalisation performance of artificial neural networks (ANNs) is very important when a trained network analyses unseen data. It is associated with factors such as, representativeness and number of training samples, model structure and complexity, training procedures, and appropriate data representation. It is, however, difficult to set common benchmarks to assess the generalisation capabilities of different types of networks. This paper discusses the methods to improve generalisation of multi-layer perceptron (MLP) ANNs and provide experimental proof using near-infrared spectral data. Near-infrared spectra of wheat kernels infested with rice weevil (Sitophilus oryzae) at II infestation levels were collected and MLP networks were trained to quantitatively determine the insect infestation levels. Spectral data were pre-processed with principal component analysis (PCA) to reduce the input dimensionality and outliers were successfully detected using Hotelling T-2 and Q statistics. Optimal network complexity was selected by evaluating generalisation performance of neural network using the Schwarz's Bayesian criteria, Akaike's information criterion, and root mean squared errors of cross-validation (RMSECV) derived by 10-fold cross-validation. Model order assessed by RMSECV provided most economic network complexity. Stacked regression and network committees were shown to overcome the drawbacks of winner-takes-all strategy and gave prediction performance on test set with a lowest root mean squared error of prediction (RMSEP) of 3(.)5% and coefficient of determination r(2) >= 0.9. Prediction performance on average spectra had a lowest RMSEP of 1(.)2% with higher r(2) >= 0.97, and prediction performance for low infestation levels (<= 10%) had a lowest RMSEP of 0(.)4%. (c) 2006 IAgrE. All rights reserved Published by Elsevier Ltd.
引用
收藏
页码:7 / 18
页数:12
相关论文
共 33 条
[11]  
Breiman L, 1996, MACH LEARN, V24, P49
[12]   Evaluation of neural-network classifiers for weed species discrimination [J].
Burks, TF ;
Shearer, SA ;
Heath, JR ;
Donohue, KD .
BIOSYSTEMS ENGINEERING, 2005, 91 (03) :293-304
[13]  
CHERKASSKY V, 1994, SELECTING MODELS DAT
[14]  
Davies T., 1998, Analusis Magazine, V26, P17
[15]   Protein content of single kernels of wheat by near-infrared reflectance spectroscopy [J].
Delwiche, SR .
JOURNAL OF CEREAL SCIENCE, 1998, 27 (03) :241-254
[16]   Automated nondestructive detection of internal insect infestation of wheat kernels by using near-infrared reflectance spectroscopy [J].
Dowell, FE ;
Throne, JE ;
Baker, JE .
JOURNAL OF ECONOMIC ENTOMOLOGY, 1998, 91 (04) :899-904
[17]  
EFRON B, 1973, J ROY STAT SOC B MET, V35, P379
[18]   LINEARIZATION AND SCATTER-CORRECTION FOR NEAR-INFRARED REFLECTANCE SPECTRA OF MEAT [J].
GELADI, P ;
MACDOUGALL, D ;
MARTENS, H .
APPLIED SPECTROSCOPY, 1985, 39 (03) :491-500
[19]   The generalization of Student's ratio [J].
Hotelling, H .
ANNALS OF MATHEMATICAL STATISTICS, 1931, 2 :360-378
[20]  
Jollife I., 1986, Principal Component Analysis