Empirical comparison of various methods for training feed-forward neural networks for salinity forecasting

被引：46

作者：

Maier, HR ^{[1
]}

Dandy, GC ^{[1
]}

机构：

[1] Univ Adelaide, Dept Civil & Environm Engn, Adelaide, SA 5005, Australia

来源：

WATER RESOURCES RESEARCH | 1999年 / 35卷 / 08期

关键词：

D O I：

10.1029/1999WR900150

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Feed-forward artificial neural networks (ANNs) are being used increasingly to model water resources variables. In this technical note, six methods for optimizing the connection weights of feedforward ANNs are investigated in terms of generalization ability, parsimony, and training speed. These include the generalized delta (GD) rule, the normalized cumulative delta (NCD) rule, the delta-bar-delta (DBD) algorithm, the extended-delta-bar-delta (EDBD) algorithm, the QuickProp (QP) algorithm, and the MaxProp (MP) algorithm. Each of these algorithms is applied to a particular case study, the forecasting of salinity in the River Murray at Murray Bridge, South Australia. Thirty models are developed for each algorithm, starting from different positions in weight space. The results obtained indicate that the generalization ability of the first-order methods investigated (i.e., GD, NCD, DBD, and EDBD) is better than that of the second-order algorithms (i.e., QP and MP). When the prediction errors are averaged over the 30 trials carried out, the performance of the first-order methods in which the size of the steps taken in weight space is automatically adjusted in response to changes in the error surface (i.e., DBD and EDBD) is better than that obtained when predetermined step sizes are used (i.e., GD and NCD). However, the reverse applies when the best forecasts of the 30 trials are considered. The results obtained indicate that the EDBD algorithm is the most parsimonious and the MP algorithm is the least parsimonious. It was found that any impact different learning rules have on training speed is masked by the effect of epoch size and the number of hidden nodes required for optimal model performance.

引用

页码：2591 / 2596

页数：6

共 24 条

[1] THE 1991 CENSUS ADJUSTMENT - UNDERCOUNT OR BAD DATA [J].

BREIMAN, L .

STATISTICAL SCIENCE, 1994, 9 (04) :458-475

[2] DEVELOPMENT OF PORTAL-VEIN INVASION AND ITS OUTCOME IN HEPATOCELLULAR-CARCINOMA TREATED BY TRANSCATHETER ARTERIAL CHEMOEMBOLIZATION [J].

CHEN, SC ;

HSIEH, MY ;

CHUANG, WL ;

WANG, LY ;

CHANG, WY .

JOURNAL OF GASTROENTEROLOGY AND HEPATOLOGY, 1994, 9 (01) :1-6

[3]

Fahlman S., 1990, ADV NEURAL INFORMATI, V2, P524

[4]

Fahlman S.E., 1988, Proceedings of the 1988 Connectionist Models Summer School, P38

[5] Time series forecasting with neural networks: A comparative study using the airline data [J].

Faraway, J ;

Chatfield, C .

JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 1998, 47 :231-250

[6]

Golden R.M., 1996, Mathematical Methods for Neural Network Analysis and Design

[7]

Hassoun M. H., 1995, Fundamentals of Artificial Networks

[8]

Hegazy T., 1994, Microcomputers in Civil Engineering, V9, P145

[9] ARTIFICIAL NEURAL-NETWORK MODELING OF THE RAINFALL-RUNOFF PROCESS [J].

HSU, KL ;

GUPTA, HV ;

SOROOSHIAN, S .

WATER RESOURCES RESEARCH, 1995, 31 (10) :2517-2530

[10] INCREASED RATES OF CONVERGENCE THROUGH LEARNING RATE ADAPTATION [J].

JACOBS, RA .

NEURAL NETWORKS, 1988, 1 (04) :295-307

← 1 2 3 →