A TEST OF SIGNIFICANCE FOR PARTIAL LEAST-SQUARES REGRESSION

被引:104
作者
WAKELING, IN [1 ]
MORRIS, JJ [1 ]
机构
[1] ZENECA PHARMACEUT, MACCLESFIELD SK10 4TG, CHESHIRE, ENGLAND
关键词
PARTIAL LEAST SQUARES; MONTE-CARLO METHODS; CROSS VALIDATION;
D O I
10.1002/cem.1180070407
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Partial least squares (PLS) regression is a commonly used statistical technique for performing multivariate calibration, especially in situations where there are more variables than samples. Choosing the number of factors to include in a model is a decision that all users of PLS must make, but is complicated by the large number of empirical tests available. In most instances predictive ability is the most desired property of a PLS model and so interest has centred on making this choice based on an internal validation process. A popular approach is the calculation of a cross-validated r2 to gauge how much variance in the dependent variable can be explained from leave-one-out predictions. Using Monte Carlo simulations for different sizes of data set, the influence of chance effects on the cross-validation process is investigated. The results are presented as tables of critical values which are compared against the values of cross-validated r2 obtained from the user's own data set. This gives a formal test for predictive ability of a PLS model with a given number of dimensions.
引用
收藏
页码:291 / 304
页数:14
相关论文
共 24 条