On the use of multivariate statistical methods for combining in-stream monitoring data and spatial analysis to characterize water quality conditions in the White River Basin, Indiana, USA

被引:53
作者
Gamble, Andrew [1 ]
Babbar-Sebens, Meghna [1 ]
机构
[1] Indiana Univ Purdue Univ, Dept Earth Sci, Indianapolis, IN 46202 USA
关键词
Water quality; Principal component analysis; Linear discriminant analysis; Kohonen self-organizing map; Support vector machine; Cluster analysis; CLASSIFICATION;
D O I
10.1007/s10661-011-2005-y
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Mechanistic hydrologic and water quality models provide useful alternatives for estimating water quality in unmonitored streams. However, developing these elaborate models for large watersheds can be time-consuming and expensive, in addition to challenges that arise during calibration when there is limited spatial and/or temporal monitored in-stream water quality data. The main objective of this research was to investigate different approaches for developing multivariate analysis models as alternative methods for rapidly assessing relationships between spatio-temporal physical attributes of the watershed and water quality conditions in monitored streams, and then using the developed relationships for estimating water quality conditions in unmonitored streams. The study compares the use of various statistical estimates (mean, geometric mean, trimmed mean, and median) of monitored water quality variables to represent annual and seasonal water quality conditions. The relationship between these estimates and the spatial data is then modeled via linear and non-linear multivariate methods. Overall, the non-linear techniques for classification outperformed the linear techniques with an average cross-validation accuracy of 79.7%. Additionally, the geometric mean based models outperformed models based on other statistical indicators with an average cross-validation accuracy of 80.2%. Dividing the data into annual and quarterly datasets also offered important insights into the behavior of certain water quality variables impacted by seasonal variations. The research provides useful guidance on the use and interpretation of the various statistical estimates and statistical models for multivariate water quality analyses.
引用
收藏
页码:845 / 875
页数:31
相关论文
共 45 条
[1]  
[Anonymous], 2002, SAS 913 HELP DOCUMEN
[2]  
[Anonymous], 2002, VYCHISL TEKHNOL
[3]  
[Anonymous], CEES PUBLICATION
[4]  
[Anonymous], SAS SUGI 30 P STAT D
[5]  
[Anonymous], SOM TOOLBOX 2 0 SOFT
[6]   Some new indexes of cluster validity [J].
Bezdek, JC ;
Pal, NR .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1998, 28 (03) :301-315
[7]   AN ANALYSIS OF TRANSFORMATIONS [J].
BOX, GEP ;
COX, DR .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1964, 26 (02) :211-252
[8]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[9]  
Chen YW, 2006, STUD FUZZ SOFT COMP, V207, P315
[10]   The role of hydrology in annual organic carbon loads and terrestrial organic matter export from a midwestern agricultural watershed [J].
Dalzell, Brent J. ;
Filley, Timothy R. ;
Harbor, Jon M. .
GEOCHIMICA ET COSMOCHIMICA ACTA, 2007, 71 (06) :1448-1462