Acceptance areas for multivariate classification derived by projection methods

被引:111
作者
Pomerantsev, Alexey L. [1 ]
机构
[1] Russian Acad Sci, Inst Chem Phys, Moscow 119991, Russia
关键词
PCA; SIMCA; leverage distribution; residual variance distribution; type I error; acceptance area; classification; influence plot; outlier;
D O I
10.1002/cem.1147
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 [计算机科学与技术];
摘要
In the projection methods (PCA, PLS) two distance measures are of importance. They are the score distance (SD, a.k.a. leverage) and the orthogonal distance (OD, a.k.a. the residual variance). This paper shows that both distance measures can be modeled by the chi(2)-distribution. Each model includes a scaling factor that can be described by an explicit equation. Moreover, the models depend on an unknown number of degrees of freedom, which have to be estimated using a training dataset. Such modeling is further applied to classification within the SIMCA framework, and various acceptance areas are built for a given significance level. A triangular area, constructed using the sum of the normalized SD and OD, is deemed to be the most practical. This theoretical notion is supported by three examples. The first is based on a simulated dataset, while the other two employ real world data. Copyright (C) 2008 John Wiley & Sons, Ltd.
引用
收藏
页码:601 / 609
页数:9
相关论文
共 34 条
[1]
Abramowitz M., 1964, HDB MATH FUNCTIONS F, V55
[2]
[Anonymous], CHEMOM INTELL LAB SY
[3]
SOME THEOREMS ON QUADRATIC FORMS APPLIED IN THE STUDY OF ANALYSIS OF VARIANCE PROBLEMS .1. EFFECT OF INEQUALITY OF VARIANCE IN THE ONE-WAY CLASSIFICATION [J].
BOX, GEP .
ANNALS OF MATHEMATICAL STATISTICS, 1954, 25 (02) :290-302
[4]
Robust statistics in data analysis - A review basic concepts [J].
Daszykowski, M. ;
Kaczmarek, K. ;
Heyden, Y. Vander ;
Walczak, B. .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2007, 85 (02) :203-219
[5]
Robust SIMCA-bounding influence of outliers [J].
Department of Chemometrics, The University of Silesia, 9 Szkolna Street, 40-006 Katowice, Poland ;
不详 .
Chemometr. Intelligent Lab. Syst., 2007, 1 (95-103) :95-103
[6]
Projection methods in chemistry [J].
Daszykowski, M ;
Walczak, B ;
Massart, DL .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2003, 65 (01) :97-112
[7]
On-line application of the orthogonal projection approach (OPA) and the soft independent modelling of class analogy approach (SIMCA) for the detection of the end point of a polymorph conversion reaction by near infrared spectroscopy (NIR) [J].
De Braekeleer, K ;
De Maesschalck, R ;
Hailey, PA ;
Sharp, DCA ;
Massart, DL .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1999, 46 (02) :103-116
[8]
The Mahalanobis distance [J].
De Maesschalck, R ;
Jouan-Rimbaud, D ;
Massart, DL .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2000, 50 (01) :1-18
[9]
Decision criteria for soft independent modelling of class analogy applied to near infrared data [J].
De Maesschalck, R ;
Candolfi, A ;
Massart, DL ;
Heuerding, S .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1999, 47 (01) :65-77
[10]
Pharmaceutical counterfeiting [J].
Deisingh, AK .
ANALYST, 2005, 130 (03) :271-279