Data structures and data transformations for clustering chemical data

被引:14
作者
Massart, B
Guo, Q
Questier, F
Massart, DL
Boucon, C
de Jong, S
Vandeginste, BGM
机构
[1] Free Univ Brussels, Farmaceut Inst, Chemo AC, B-1090 Brussels, Belgium
[2] Unilever Res Labs Vlaardingen, NL-3130 AC Vlaardingen, Netherlands
关键词
chemometrics; clustering; data transforms;
D O I
10.1016/S0167-2940(01)90097-4
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The quality of a clustering of chemical data is determined by a proper choice of distance measures and data transformations. The latter aspect is often neglected and its importance is shown here. It is also shown that the V-shaped data structure that is often obtained in a principal component analysis of chemical data may indicate that the clustering of the raw data can lead to classifications that are not relevant from a chemical point of view and that the log double centering transform should be considered as a possible alternative. (C) 2001 Published by Elsevier Science B.V.
引用
收藏
页码:35 / 41
页数:7
相关论文
共 5 条
[1]   USING GENERALIZED DISTANCES IN CLASSIFICATION OF GROUPS [J].
DAGNELIE, P ;
MERCKX, A .
BIOMETRICAL JOURNAL, 1991, 33 (06) :683-695
[2]   CLUSTERING USING A SIMILARITY MEASURE BASED ON SHARED NEAR NEIGHBORS [J].
JARVIS, RA ;
PATRICK, EA .
IEEE TRANSACTIONS ON COMPUTERS, 1973, C-22 (11) :1025-1034
[3]  
LEWI PJ, 1976, ARZNEIMITTEL-FORSCH, V26, P1295
[4]   THE USE OF PRINCIPAL COMPONENTS-ANALYSIS FOR THE INVESTIGATION OF AN ORGANIC AIR-POLLUTANTS DATA SET [J].
SMEYERSVERBEKE, J ;
DENHARTOG, JC ;
DEKKER, WH ;
COOMANS, D ;
BUYDENS, L ;
MASSART, DL .
ATMOSPHERIC ENVIRONMENT, 1984, 18 (11) :2471-2478
[5]  
Vandeginste B. G. M., 1998, HDB CHEMOMETRICS Q B