Optimal division of data for neural network models in water resources applications

被引:228
作者
Bowden, GJ [1 ]
Maier, HR [1 ]
Dandy, GC [1 ]
机构
[1] Univ Adelaide, Dept Civil & Environm Engn, Ctr Appl Modelling Water Engn, Adelaide, SA, Australia
关键词
artificial neural network; data division; self-organizing map; genetic algorithm; forecasting; salinity model;
D O I
10.1029/2001WR000266
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The way that available data are divided into training, testing, and validation subsets can have a significant influence on the performance of an artificial neural network (ANN). Despite numerous studies, no systematic approach has been developed for the optimal division of data for ANN models. This paper presents two methodologies for dividing data into representative subsets, namely, a genetic algorithm (GA) and a self-organizing map (SOM). These two methods are compared with the conventional approach commonly used in the literature, which involves an arbitrary division of the data. A case study is presented in which ANN models developed using each data division technique are used to forecast salinity in the River Murray at Murray Bridge (South Australia) 14 days in advance, When tested on a validation data set from July 1992 to March 1998, the models developed using the GA and SOM data division techniques resulted in a reduction in RMS error of 24.2% and 9.9%, respectively, over the conventional data division method. It was found that a SOM could be used to diagnose why an ANN model has performed poorly, given that the poor performance is primarily related to the data themselves and not the choice of the ANN's parameters or architecture.
引用
收藏
页码:2 / 1
页数:11
相关论文
共 32 条
[1]  
*AM SOC CIV ENG TA, 2000, J HYDROL ENG, V5, P124, DOI DOI 10.1061/(ASCE)1084-0699(2000)5:2(124)
[2]  
[Anonymous], 1989, GENETIC ALGORITHM SE
[3]  
Braddock RD, 1998, ENVIRONMETRICS, V9, P419, DOI 10.1002/(SICI)1099-095X(199807/08)9:4<419::AID-ENV312>3.0.CO
[4]  
2-D
[5]   NEURAL-NETWORK-BASED OBJECTIVE FLOW REGIME IDENTIFICATION IN AIR-WATER 2-PHASE FLOW [J].
CAI, SQ ;
TORAL, H ;
QIU, JH ;
ARCHER, JS .
CANADIAN JOURNAL OF CHEMICAL ENGINEERING, 1994, 72 (03) :440-445
[6]   Forecasting river flow rate during low-pow periods using neural networks [J].
Campolo, M ;
Soldati, A ;
Andreussi, P .
WATER RESOURCES RESEARCH, 1999, 35 (11) :3547-3552
[7]  
CYBENKO G, 1989, MATH CONTROL SIGNAL, V2, P203
[8]   OPTIMUM OPERATION OF A MULTIPLE RESERVOIR SYSTEM INCLUDING SALINITY EFFECTS [J].
DANDY, G ;
CRAWLEY, P .
WATER RESOURCES RESEARCH, 1992, 28 (04) :979-990
[9]   An improved genetic algorithm for pipe network optimization [J].
Dandy, GC ;
Simpson, AR ;
Murphy, LJ .
WATER RESOURCES RESEARCH, 1996, 32 (02) :449-458
[10]   An artificial neural network approach to rainfall-runoff modelling [J].
Dawson, CW ;
Wilby, R .
HYDROLOGICAL SCIENCES JOURNAL-JOURNAL DES SCIENCES HYDROLOGIQUES, 1998, 43 (01) :47-66