Finding the number of natural clusters in groundwater data sets using the concept of equivalence class

被引:22
作者
Pacheco, FAL [1 ]
机构
[1] Univ Tras Os Montes & Alto Douro, Seccao Geol, P-5000 Vila Real, Portugal
关键词
cluster analysis; groundwater data set; equivalence class; graph theory;
D O I
10.1016/S0098-3004(97)00140-4
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Cluster analysis has numerous scientific and practical applications. This paper presents a computer program to find an adequate (natural) number of clusters and to isolate anomalous samples in a data set. The program incorporates an algorithm that is based on the mathematical concept of equivalence class and uses the framework of the graph theory to identify equivalence classes in multivariate data bases. This type of clustering algorithm is particularly useful when one is dealing with groundwater data sets, because anomalies are frequent in these sets, and because the number of groups that are present often are impossible to estimate; the number will depend on the combined effect of many factors, including geology, morphology, climate and pollution. As an example of the utility of this program, a set of groundwater samples is clustered, and the average chemistry of nine identified equivalence classes is related to weathering reactions of plagioclase in a Portuguese granitoid area. (C) 1998 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:7 / 15
页数:9
相关论文
共 21 条
[1]  
Bezdek JC, 1974, J CYBERNETICS, V3, P58, DOI [10.1080/01969727308546047, DOI 10.1080/01969727308546047]
[2]  
Christofides N, 1975, GRAPH THEORY ALGORIT
[3]  
COSTA CV, 1971, PUBLICACOES MUSEU LA, V71, P1
[4]  
Deer W.A., 1962, Rock-forming Minerals, Vone
[5]  
Everitt B., 1977, CLUSTER ANAL
[6]  
Ferreira M.P., 1985, MEMORIAS NOTICIAS PU, V99, P167
[7]  
FERREIRA MP, 1982, MEMORIAS NOTICIAS, V94, P31
[8]  
FORGY EW, 1965, BIOMETRICS, V21, P768
[9]  
Garrels R.M., 1967, Researches in Geochemistry, P405
[10]  
Hartigan J. A., 1975, CLUSTERING ALGORITHM