The pls package: Principal component and partial least squares regression in R

被引:1448
作者
Mevik, Bjorn-Helge
Wehrens, Ron
机构
[1] Norwegian Univ Life Sci, Dept Chem Biotechnol & Food Sci, N-1432 As, Norway
[2] Radboud Univ Nijmegen, Inst Mol & Mat, NL-6500 GL Nijmegen, Netherlands
来源
JOURNAL OF STATISTICAL SOFTWARE | 2007年 / 18卷 / 02期
关键词
principal component regression; PCR; partial least squares regression; PLSR; R; SPECTRA; MODELS; ERROR;
D O I
10.18637/jss.v018.i02
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The pls package implements principal component regression ( PCR) and partial least squares regression ( PLSR) in R ( R Development Core Team 2006b), and is freely available from the Comprehensive R Archive Network ( CRAN), licensed under the GNU General Public License ( GPL). The user interface is modelled after the traditional formula interface, as exemplified by 1m. This was done so that people used to R would not have to learn yet another interface, and also because we believe the formula interface is a good way of working interactively with models. It thus has methods for generic functions like predict, update and coef. It also has more specialised functions like scores, loadings and RMSEP, and a flexible cross-validation system. Visual inspection and assessment is important in chemometrics, and the pls package has a number of plot functions for plotting scores, loadings, predictions, coefficients and RMSEP estimates. The package implements PCR and several algorithms for PLSR. The design is modular, so that it should be easy to use the underlying algorithms in other functions. It is our hope that the package will serve well both for interactive data analysis and as a building block for other functions or packages using PLSR or PCR. We will here describe the package and how it is used for data analysis, as well as how it can be used as a part of other packages. Also included is a section about formulas and data frames, for people not used to the R modelling idioms.
引用
收藏
页码:1 / 23
页数:23
相关论文
共 22 条
[1]  
Chambers J.M., 1991, Statistical Models in S
[2]   COMPARATIVE MOLECULAR-FIELD ANALYSIS (COMFA) .1. EFFECT OF SHAPE ON BINDING OF STEROIDS TO CARRIER PROTEINS [J].
CRAMER, RD ;
PATTERSON, DE ;
BUNCE, JD .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1988, 110 (18) :5959-5967
[3]  
Dayal BS, 1997, J CHEMOMETR, V11, P73, DOI 10.1002/(SICI)1099-128X(199701)11:1<73::AID-CEM435>3.0.CO
[4]  
2-#
[5]   SIMPLS - AN ALTERNATIVE APPROACH TO PARTIAL LEAST-SQUARES REGRESSION [J].
DEJONG, S .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1993, 18 (03) :251-263
[6]   2 STRUCTURAL EQUATION MODELS - LISREL AND PLS APPLIED TO CONSUMER EXIT-VOICE THEORY [J].
FORNELL, C ;
BOOKSTEIN, FL .
JOURNAL OF MARKETING RESEARCH, 1982, 19 (04) :440-452
[7]   A STATISTICAL VIEW OF SOME CHEMOMETRICS REGRESSION TOOLS [J].
FRANK, IE ;
FRIEDMAN, JH .
TECHNOMETRICS, 1993, 35 (02) :109-135
[8]   LINEARIZATION AND SCATTER-CORRECTION FOR NEAR-INFRARED REFLECTANCE SPECTRA OF MEAT [J].
GELADI, P ;
MACDOUGALL, D ;
MARTENS, H .
APPLIED SPECTROSCOPY, 1985, 39 (03) :491-500
[9]  
Hastie T, 2009, The Elements of Statistical Learning, V2nd, DOI 10.1007/978-0-387-84858-7
[10]   Two data sets of near infrared spectra [J].
Kalivas, JH .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1997, 37 (02) :255-259