Disjoint hard models for classification

被引:8
作者
Li, Dong [1 ]
Lloyd, Gavin R. [1 ]
Duncan, John C. [2 ]
Brereton, Richard G. [1 ]
机构
[1] Univ Bristol, Ctr Chemometr, Sch Chem, Bristol BS8 1TS, Avon, England
[2] Triton Technol Ltd, Keyworth NG12 5AW, Notts, England
关键词
classification; quadratic discriminant analysis; support vector machines; pattern recognition; disjoint PCA; polymers; LEARNING VECTOR QUANTIZATION; PATTERN-RECOGNITION; COMPONENTS; SELECTION; MACHINES;
D O I
10.1002/cem.1288
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The paper describes a new approach for disjoint hard modelling of classes. This involves developing independent PC models for each group in the class, and calculating both the Q statistic (square prediction error) for each sample to the class model and a separate statistic about how well samples are classified within the projected PC space. The latter statistic can be applied to different types of classifiers, in this paper we choose to illustrate by Quadratic Discriminant Analysis (D statistic) and one class Support Vector Domain Description (SVDD) (f-value). The two measures (Q and the classifier dependent statistic) are combined into a joint decision function which uniquely classifies each sample. The disjoint hard models are contrasted to conjoint models where PCA is performed on the entire dataset using both QDA and Support Vector Machines (SVMs) classifiers. The optimum number of PCs for each model is determined using the bootstrap, and model performance assessed on 100 test sets obtained using different iterative splits, using %PA (Predictive Ability) and %CR (Classification Rate). The method is illustrated using a dataset consisting of 293 samples from nine groups of polymers obtained using thermal profiling. The approach described, in this paper, has many of the advantages of one class disjoint models (e.g. SIMCA) and of conventional hard models, and is useful if it is known that all samples must belong to one of a series of known groups but each group has a very different structure. Copyright (C) 2010 John Wiley & Sons, Ltd.
引用
收藏
页码:273 / 287
页数:15
相关论文
共 57 条
[1]  
Abe S., 2005, ADV PTRN RECOGNIT
[2]   Variable selection in discriminant partial least-squares analysis [J].
Alsberg, BK ;
Kell, DB ;
Goodacre, R .
ANALYTICAL CHEMISTRY, 1998, 70 (19) :4126-4133
[3]  
[Anonymous], 2009, APPL SPECTROSC, DOI DOI 10.1366/000370210791114185
[4]  
[Anonymous], 1993, An introduction to the bootstrap
[5]  
[Anonymous], 2007, APPL CHEMOMETRICS SC
[6]  
[Anonymous], 1991, A User's Guide to Principal Components
[7]  
[Anonymous], 2009, CHEMOMETRICS PATTERN
[8]  
[Anonymous], 2003, User's Guide to Principal Components
[9]  
[Anonymous], 2003, DATA HANDLING SCI TE
[10]  
[Anonymous], 2002, PRINCIPAL COMPONENTS