A method of robust multivariate outlier replacement

被引:37
作者
Hoo, KA
Tvarlapati, KJ
Piovoso, MJ
Hajare, R
机构
[1] Texas Tech Univ, Dept Chem Engn, Lubbock, TX 79409 USA
[2] Univ S Carolina, Dept Chem Engn, Columbia, SC USA
[3] Penn State Univ, Sch Grad Prof Studies, Malvern, PA 19355 USA
[4] Exxon Chem Co, Core Engn Specialists, Baytown, TX 77522 USA
关键词
principal component analysis; multivariate outliers; winsorizing; location; scale; MADM;
D O I
10.1016/S0098-1354(01)00734-7
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Robust multivariate methods for dealing with problems caused by outliers in the data are essential especially when process data are used to validate mechanistic models, develop regression models, and in applications such as controller design and process monitoring. Gross outliers are detected easily by simple methods such as range checking, however, a multivariate outlier is very difficult to discern and techniques that rely on data to generate empirical models may produce erroneous results. In this work, a methodology to perform multivariate outlier replacement in the score space generated by principal component analysis (PCA) is proposed. The objective was to find an accurate estimate of the covariance matrix of the data so that a PCA model might be developed that could then be used for monitoring and fault detection and identification. The methodology uses the concept of winsorization to provide robust estimates of the mean (location) and S.D. (scale) iteratively, yielding a robust set of data. The paper develops the approach, discusses the concept of robust statistics and winsorization, and presents the procedures for robust multivariate outlier filtering. One simulated and two industrial examples are provided to demonstrate the approach. (C) 2002 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:17 / 39
页数:23
相关论文
共 31 条
[21]   ROBUST ESTIMATION OF LOCATION PARAMETER [J].
HUBER, PJ .
ANNALS OF MATHEMATICAL STATISTICS, 1964, 35 (01) :73-&
[22]  
Jackson JE, 1991, A user's guide to principal components
[23]   CHEMOMETRIC METHODS FOR PROCESS MONITORING AND HIGH-PERFORMANCE CONTROLLER-DESIGN [J].
KASPAR, MH ;
RAY, WH .
AICHE JOURNAL, 1992, 38 (10) :1593-1608
[24]  
KOSANOVICH KA, 1997, INTELLIGENCE DATA AN, V1
[25]   PROJECTION-PURSUIT APPROACH TO ROBUST DISPERSION MATRICES AND PRINCIPAL COMPONENTS - PRIMARY THEORY AND MONTE-CARLO [J].
LI, GY ;
CHEN, ZL .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1985, 80 (391) :759-766
[26]  
Mark H. L., 1985, ANAL CHEM, V57, P1449, DOI DOI 10.1021/AC00284A061
[27]   ROBUST M-ESTIMATORS OF MULTIVARIATE LOCATION AND SCATTER [J].
MARONNA, RA .
ANNALS OF STATISTICS, 1976, 4 (01) :51-67
[28]  
MARTENS H, 1979, MULTIVARIATE CALIBRA
[29]   PROCESS DATA CHEMOMETRICS [J].
PIOVOSO, MJ ;
KOSANOVICH, KA ;
YUK, JP .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 1992, 41 (02) :262-268
[30]  
RICKER NL, 1993, J PROCESS CONTR, V3, P2