Short-term water quality variable prediction using a hybrid CNN-LSTM deep learning model

被引:332
作者
Barzegar, Rahim [1 ,2 ]
Aalami, Mohammad Taghi [2 ]
Adamowski, Jan [1 ]
机构
[1] McGill Univ, Dept Bioresource Engn, 21111 Lakeshore, Ste Anne De Bellevue, PQ H9X3V9, Canada
[2] Univ Tabriz, Fac Civil Engn, 29 Bahman Blvd, Tabriz 5166616471, Iran
基金
加拿大自然科学与工程研究理事会;
关键词
Long short-term memory; Convolutional neural network; Water quality modeling; Deep learning; SUPPORT VECTOR MACHINE; NONPOINT-SOURCE POLLUTION; NEURAL-NETWORK; RANDOM FOREST; IMPLEMENTATION; PARAMETERS; FORECASTS; CLIMATE; DESIGN; MEMORY;
D O I
10.1007/s00477-020-01776-2
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Water quality monitoring is an important component of water resources management. In order to predict two water quality variables, namely dissolved oxygen (DO; mg/L) and chlorophyll-a (Chl-a; mu g/L) in the Small Prespa Lake in Greece, two standalone deep learning (DL) models, the long short-term memory (LSTM) and convolutional neural network (CNN) models, along with their hybrid, the CNN-LSTM model, were developed. The main novelty of this study was to build a coupled CNN-LSTM model to predict water quality variables. Two traditional machine learning models, support-vector regression (SVR) and decision tree (DT), were also developed to compare with the DL models. Time series of the physicochemical water quality variables, specifically pH, oxidation-reduction potential (ORP; mV), water temperature (degrees C), electrical conductivity (EC; mu S/cm), DO and Chl-a, were obtained using a sensor at 15-min intervals from June 1, 2012 to May 31, 2013 for model development. Lag times of up to one (t - 1) and two (t - 2) for input variables pH, ORP, water temperature, and EC were used to predict DO and Chl-a concentrations, respectively. Each model's performance in both training and testing phases was assessed using statistical metrics including the correlation coefficient (r), root mean square error (RMSE), mean absolute error (MAE), their normalized equivalents (RRMSE, RMAE; %), percentage of bias (PBIAS), Nash-Sutcliffe coefficient Willmott's Index, and graphical plots (Taylor diagram, box plot and spider diagram). Results showed that LSTM outperformed the CNN model for DO prediction, but the standalone DL models yielded similar performances for Chl-a prediction. Generally, the hybrid CNN-LSTM models outperformed the standalone models (LSTM, CNN, SVR and DT models) in predicting both DO and Chl-a. By integrating the LSTM and CNN models, the hybrid model successfully captured both the low and high levels of the water quality variables, particularly for the DO concentrations.
引用
收藏
页码:415 / 433
页数:19
相关论文
共 88 条
[71]  
Song Y, 2015, AER ADV ENG RES, V2, P130
[72]   A Deep Neural Network Modeling Framework to Reduce Bias in Satellite Precipitation Products [J].
Tao, Yumeng ;
Gao, Xiaogang ;
Hsu, Kuolin ;
Sorooshian, Soroosh ;
Ihler, Alexander .
JOURNAL OF HYDROMETEOROLOGY, 2016, 17 (03) :931-945
[73]   Environmental monitoring of Micro Prespa Lake basin (Western Macedonia, Greece): hydrogeochemical characteristics of water resources and quality trends [J].
Tziritis, Evangelos P. .
ENVIRONMENTAL MONITORING AND ASSESSMENT, 2014, 186 (07) :4553-4568
[74]  
Wang Y, 2019, NANOMATER NANOTECHNO, V9, DOI [10.1177/1847980418825034, 10.3390/nano9081123]
[75]   Daily urban air quality index forecasting based on variational mode decomposition, sample entropy and LSTM neural network [J].
Wu, Qunli ;
Lin, Huaxing .
SUSTAINABLE CITIES AND SOCIETY, 2019, 50
[76]   Investigating the effects of point source and nonpoint source pollution on the water quality of the East River (Dongjiang) in South China [J].
Wu, Yiping ;
Chen, Ji .
ECOLOGICAL INDICATORS, 2013, 32 :294-304
[77]   Modeling of land use and reservoir effects on nonpoint source pollution in a highly agricultural basin [J].
Wu, Yiping ;
Liu, Shuguang .
JOURNAL OF ENVIRONMENTAL MONITORING, 2012, 14 (09) :2350-2361
[78]   Deep spatiotemporal residual early-late fusion network for city region vehicle emission pollution prediction [J].
Xu, Zhenyi ;
Cao, Yang ;
Kang, Yu .
NEUROCOMPUTING, 2019, 355 :183-199
[79]   Application of the Random Forest model for chlorophyll-a forecasts in fresh and brackish water bodies in Japan, using multivariate long-term databases [J].
Yajima, Hiroshi ;
Derot, Jonathan .
JOURNAL OF HYDROINFORMATICS, 2018, 20 (01) :206-220
[80]   Hybrid deep learning and empirical mode decomposition model for time series applications [J].
Yang, Hao-Fan ;
Chen, Yi-Ping Phoebe .
EXPERT SYSTEMS WITH APPLICATIONS, 2019, 120 :128-138