A method of robust multivariate outlier replacement

被引:37
作者
Hoo, KA
Tvarlapati, KJ
Piovoso, MJ
Hajare, R
机构
[1] Texas Tech Univ, Dept Chem Engn, Lubbock, TX 79409 USA
[2] Univ S Carolina, Dept Chem Engn, Columbia, SC USA
[3] Penn State Univ, Sch Grad Prof Studies, Malvern, PA 19355 USA
[4] Exxon Chem Co, Core Engn Specialists, Baytown, TX 77522 USA
关键词
principal component analysis; multivariate outliers; winsorizing; location; scale; MADM;
D O I
10.1016/S0098-1354(01)00734-7
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Robust multivariate methods for dealing with problems caused by outliers in the data are essential especially when process data are used to validate mechanistic models, develop regression models, and in applications such as controller design and process monitoring. Gross outliers are detected easily by simple methods such as range checking, however, a multivariate outlier is very difficult to discern and techniques that rely on data to generate empirical models may produce erroneous results. In this work, a methodology to perform multivariate outlier replacement in the score space generated by principal component analysis (PCA) is proposed. The objective was to find an accurate estimate of the covariance matrix of the data so that a PCA model might be developed that could then be used for monitoring and fault detection and identification. The methodology uses the concept of winsorization to provide robust estimates of the mean (location) and S.D. (scale) iteratively, yielding a robust set of data. The paper develops the approach, discusses the concept of robust statistics and winsorization, and presents the procedures for robust multivariate outlier filtering. One simulated and two industrial examples are provided to demonstrate the approach. (C) 2002 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:17 / 39
页数:23
相关论文
共 31 条
[1]  
ANDREWS DF, 1972, ROBUST ESTIMATION LO
[2]  
[Anonymous], 2017, Introduction to robust estimation and hypothesis testing
[3]   Multivariate image analysis for real-time process monitoring and control [J].
Bharati, MH ;
MacGregor, JF .
INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 1998, 37 (12) :4715-4724
[4]   ON SOME ROBUST ESTIMATES OF LOCATION [J].
BICKEL, PJ .
ANNALS OF MATHEMATICAL STATISTICS, 1965, 36 (03) :847-858
[5]  
BOX GEP, 1953, BIOMETRIKA, V40, P318, DOI 10.1093/biomet/40.3-4.318
[6]  
Campbell N. A., 1980, Applied Statistics, V29, P231, DOI 10.2307/2346896
[7]   Robust statistical process monitoring [J].
Chen, J ;
Bandoni, A ;
Romagnoli, JA .
COMPUTERS & CHEMICAL ENGINEERING, 1996, 20 :S497-S502
[8]   Robust PCA and normal region in multivariate statistical process monitoring [J].
Chen, JG ;
Bandoni, JA ;
Romagnoli, JA .
AICHE JOURNAL, 1996, 42 (12) :3563-3566
[9]  
CHEN Z, 1981, ROBUST PRINCIPAL COM
[10]   ROBUST ESTIMATION OF DISPERSION MATRICES AND PRINCIPAL COMPONENTS [J].
DEVLIN, SJ ;
GNANADESIKAN, R ;
KETTENRING, JR .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1981, 76 (374) :354-362