Pattern matching in historical data

被引:98
作者
Johannesmeyer, MC
Singhal, A
Seborg, DE
机构
[1] Procter & Gamble Mfg Co, Sacramento, CA 95826 USA
[2] Univ Calif Santa Barbara, Dept Chem Engn, Santa Barbara, CA 93106 USA
关键词
D O I
10.1002/aic.690480916
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
For many engineering and business problems, it would be very usefull to have a general strategy for pattern matching in large databases. For example, the analysis of an abnormal plant condition would benefit if previous occurrences of the abnormal condition could be located in the historical data. A new pattern-matching strategy is proposed for multivariate time series based on statistical techniques, especially principal-component analysis (PCA). The new approach is both data-driven and unsupervised because neither training data nor a process model is required. Given an arbitrary set of multivariate data, the new approach can be used to locate similar patterns in a large historical database. The proposed pattern-matching strategy is based on two similarity factors: the standard PCA similarity factor and a new similarity factor that characterizes the pattern of alarm violations. An extensive simulation study for a chemical reactor demonstrates that this strategy is more effective than existing PCA methods and can successfully distinguish between 28 different operating conditions.
引用
收藏
页码:2022 / 2038
页数:17
相关论文
共 52 条
[1]  
Agrawal R., 1995, VLDB '95. Proceedings of the 21st International Conference on Very Large Data Bases, P490
[2]  
[Anonymous], P ACM SIG MOD INT C
[3]   Data mining: An industrial research perspective [J].
Apte, C .
IEEE COMPUTATIONAL SCIENCE & ENGINEERING, 1997, 4 (02) :6-9
[4]  
Bishop C. M., 1995, NEURAL NETWORKS PATT
[5]  
Chiang L.H., 2001, ADV TK CONT SIGN PRO, P71
[6]   A combined unifrontal multifrontal method for unsymmetric sparse matrices [J].
Davis, TA ;
Duff, IS .
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1999, 25 (01) :1-20
[7]  
DEHASPE L, 1998, P INT C KNOWL DISC D
[8]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[9]  
Duda R.O., 2001, Pattern Classification, V2nd
[10]  
Duda R. O., 2000, Pattern Classification and Scene Analysis, V2nd