A program to perform Ward's clustering method on several regionalized variables

被引:61
作者
Hervada-Sala, C
Jarauta-Bragulat, E
机构
[1] Univ Politecn Catalunya, Dept Phys & Nucl Engn, Terrassa 08222, Spain
[2] Univ Politecn Catalunya, Dept Appl Math 3, E-08028 Barcelona, Spain
关键词
FORTRAN program; regionalized variables; generalized ward's method; fast Fourier transform;
D O I
10.1016/j.cageo.2004.07.003
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
There are many statistical techniques that allow finding similarities or differences among data and variables. Cluster analysis encompasses many diverse techniques for discovering structure within complex sets of data. The objective of cluster analysis is to group either the data or the variables into clusters such that the elements within a cluster have a high degree of "natural association" among themselves while clusters are "relatively distinct" from one another. To do so, many criteria have been described: partitioning methods, arbitrary origin methods, mutual similarity procedures and hierarchical clustering techniques. One of the most widespread hierarchical clustering methods is the Ward's method. Earth science studies deal in general with multivariate and regionalized observations which may be compositional, i.e. data such as percentages, concentrations, mg/kg (ppm). Sometimes, it is interesting to know whether these data have to be divided into different subpopulations. This problem cannot be studied with traditional Ward's method because samples are not independent. In that case, an extension of Ward's clustering method to spatially dependent samples can be used. This methodology is based on a generalized Mahalanobis distance, which uses the covariance and cross-covariance (or variogram and cross-variogram) matrices. This paper describes a refinement of this method previously defined, which was iterative and tedious, as it was necessary to re-estimate the spatial covariance structure at each step. In this paper, we stay within the same theoretical framework, but we improve the methodology using the fast fourier Transform method to find the covariance structure. Thus, we obtain a generalization to several variables of adapted Ward's clustering method. (C) 2004 Elsevier Ltd. All rights reserved.
引用
收藏
页码:881 / 886
页数:6
相关论文
共 9 条
[1]   A stochastic wire length distribution for gigascale integration (GSI) [J].
Davis, JA ;
De, VK ;
Meindl, JD .
PROCEEDINGS OF THE IEEE 1997 CUSTOM INTEGRATED CIRCUITS CONFERENCE, 1997, :145-150
[2]  
Deutsch C., 1998, Geostatistical software library and users guide (gslib)
[3]  
Everitt B., 1993, CLUSTER ANAL
[4]  
JIMENEZGONZALEZ S, 1998, P 4 M INT ASS MATH G, P445
[5]  
JIMENEZGONZALEZ S, 1999, AGRUPACION DATOS DEP
[6]  
PAWLOWSKYGLAHN V, 1997, P 3 M INT ASS MATH G, P175
[7]  
TAUBER F, 1997, P IAMG 97 3 ANN C IN, P169
[9]  
YAO T, 1998, MATHEMATICAL GEOLOGY, V30, P615