WHICH PRINCIPAL COMPONENTS TO UTILIZE FOR PRINCIPAL COMPONENT REGRESSION

被引:95
作者
SUTTER, JM [1 ]
KALIVAS, JH [1 ]
LANG, PM [1 ]
机构
[1] IDAHO STATE UNIV, DEPT MATH, POCATELLO, ID 83209 USA
关键词
PRINCIPAL COMPONENT REGRESSION; CALIBRATION; OPTIMALITY; PRINCIPAL COMPONENT SELECTION; QUANTITATIVE STRUCTURE ACTIVITY RELATIONSHIP;
D O I
10.1002/cem.1180060406
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Principal components (PCs) for principal component regression (PCR) have historically been selected from the top down for a reliable predictive model. That is, the PCs are arranged in a list starting with the most informative (PC associated with the largest singular value) and proceeding to the least informative (PC associated with the smallest singular value). PCs are then chosen starting at the top of this list. This paper discusses an alternative procedure of treating PC selection as an optimization problem. Specifically, without any regard to the ordering, the optimal subset of PCs for an acceptable predictive model is desired. Five data sets are analyzed using the conventional and alternative approaches. Two data sets are spectroscopic in nature, two data sets deal with quantitative structure-activity relationships (QSARs) and one data set is concerned with modeling. All five data sets confirm that selection of a subset without consideration to order secures the best results with PCR. One data set is also compared using partial least squares 1.
引用
收藏
页码:217 / 225
页数:9
相关论文
共 21 条