Optimal linear combinations of neural networks

被引:319
作者
Hashem, S [1 ]
机构
[1] PACIFIC NW LAB, RICHLAND, WA USA
关键词
optimal linear combination; model selection; function approximation; collinearity; robust estimation; mixture of experts;
D O I
10.1016/S0893-6080(96)00098-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural network-based modeling often involves trying multiple networks with different architectures and training parameters in order to achieve acceptable model accuracy. Typically, one of the trained networks is chosen as best, while the rest are discarded. Hashem and Schmeiser (1995) proposed using optimal linear combinations of a number of trained neural networks instead of using a single best network. Combining the trained networks may help integrate the knowledge acquired by the components networks and thus improve model accuracy. In this paper, we extend the idea of optimal linear combinations (OLCs) of neural networks and discuss issues related to the generalization ability of the combined model. We then present two algorithms for selecting the component networks for the combination to improve the generalization ability of OLCs. Our experimental results demonstrate significant improvements in model accuracy, as a result of using OLCs, compared to using the apparent best network. (C) 1997 Elsevier Science Ltd.
引用
收藏
页码:599 / 614
页数:16
相关论文
共 57 条
[1]  
ALPAYDIN E, 1993, 1993 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-3, P9, DOI 10.1109/ICNN.1993.298539
[2]   DEMOCRACY IN NEURAL NETS - VOTING SCHEMES FOR CLASSIFICATION [J].
BATTITI, R ;
COLLA, AM .
NEURAL NETWORKS, 1994, 7 (04) :691-707
[4]  
Belsley D.A., 1991, Conditioning Diagnostics: Collinearity and Weak Data in Regression
[5]  
Belsley DA, 1980, Regression diagnostics: Identifying influential data and sources of collinearity
[6]  
BENEDIKTSSON JA, 1993, 1993 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-3, P27, DOI 10.1109/ICNN.1993.298536
[7]  
BREIMAN L, 1994, 367 U CAL DEP STAT
[8]   FORECASTING WITH MORE THAN ONE MODEL [J].
BUNN, D .
JOURNAL OF FORECASTING, 1989, 8 (03) :161-166
[9]   CONSTRAINED TOPOLOGICAL MAPPING FOR NONPARAMETRIC REGRESSION-ANALYSIS [J].
CHERKASSKY, V ;
LARINAJAFI, H .
NEURAL NETWORKS, 1991, 4 (01) :27-40
[10]  
CHERKASSKY V, 1995, P WORLD C NEUR NETW, V2, P917