Genetic algorithms combined with discriminant analysis for key variable identification

被引:75
作者
Chiang, LH [1 ]
Pell, RJ [1 ]
机构
[1] Dow Chem Co USA, Analyt Sci Lab, Midland, MI 48667 USA
关键词
genetic algorithms; contribution chart; principal component analysis (PCA); Fisher discriminant analysis (FDA); fault identification; variable selection;
D O I
10.1016/S0959-1524(03)00029-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many trouble-shooting problems in process industries are related to key variable identification for classifications. The contribution charts, based on principal component analysis (PCA), can be applied for this purpose. Genetic algorithms (GAs) have been proposed recently for many applications including variable selection for multivariate calibration, molecular modeling, regression analysis, model identification, curve fitting, and classification. In this paper, GAs are incorporated with Fisher discriminant analysis (FDA) for key variable identification. GAs are used as an optimization tool to determine variables that maximize the FDA classification success rate for two given data sets. GA/FDA is a proposed solution for the variable selection problem in discriminant analysis. The Tennessee. Eastman process (TEP) simulator was used to generate the data sets to evaluate the correctness of the key variable selection using GA/FDA, and the T-2 and Q statistic contribution charts. GA/FDA correctly identifies the key variables for the TEP case studies that were tested. For one case study where the correlation changes in two data sets, the contribution charts incorrectly suggest that the operating conditions are similar. On the other hand, GA/FDA not only determines that the operating conditions are different, but also identifies the key variables for the change. For another case study where many key variables are responsible for the changes in the two data sets, the contribution charts only identifies a fraction of the key variables, while GA/ FDA correctly identifies all of the key variables. GA/FDA is a promising technique for key variable identification, as is evidenced in successful applications at The Dow Chemical Company. (C) 2003 Elsevier Ltd. All rights reserved.
引用
收藏
页码:143 / 155
页数:13
相关论文
共 31 条
[1]  
Aarts E., 1989, Wiley-Interscience Series in Discrete Mathematics and Optimization
[2]  
[Anonymous], 1998, Chemometrics: A Practical Guide
[3]   CONFORMATIONAL-ANALYSIS OF A DINUCLEOTIDE PHOTODIMER WITH THE AID OF THE GENETIC ALGORITHM [J].
BLOMMERS, MJJ ;
LUCASIUS, CB ;
KATEMAN, G ;
KAPTEIN, R .
BIOPOLYMERS, 1992, 32 (01) :45-52
[4]   Discovery of operational spaces from process data for production of multiple grades of products [J].
Chen, FZ ;
Wang, XZ .
INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2000, 39 (07) :2378-2383
[5]  
Chiang L.H., 2001, ADV TK CONT SIGN PRO, P71
[6]   Fault diagnosis in chemical processes using Fisher discriminant analysis, discriminant partial least squares, and principal component analysis [J].
Chiang, LH ;
Russell, EL ;
Braatz, RD .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2000, 50 (02) :243-252
[7]   Exploring process data with the use of robust outlier detection algorithms [J].
Chiang, LH ;
Pell, RJ ;
Seasholtz, MB .
JOURNAL OF PROCESS CONTROL, 2003, 13 (05) :437-449
[8]   CURVE-FITTING USING NATURAL COMPUTATION [J].
DEWEIJER, AP ;
LUCASIUS, CB ;
BUYDENS, L ;
KATEMAN, G ;
HEUVEL, HM ;
MANNEE, H .
ANALYTICAL CHEMISTRY, 1994, 66 (01) :23-31
[9]   A PLANT-WIDE INDUSTRIAL-PROCESS CONTROL PROBLEM [J].
DOWNS, JJ ;
VOGEL, EF .
COMPUTERS & CHEMICAL ENGINEERING, 1993, 17 (03) :245-255
[10]  
Duda R. O., 1973, PATTERN CLASSIFICATI