Kernel-based orthogonal projections to latent structures (K-OPLS)

被引:57
作者
Rantalainen, Mattias
Bylesjoe, Max
Cloarec, Olivier
Nicholson, Jeremy K.
Holmes, Elaine
Trygg, Johan [1 ]
机构
[1] Umea Univ, Dept Chem, Chemometr Res Grp, SE-90187 Umea, Sweden
[2] Univ London Imperial Coll Sci Technol & Med, Fac Med, SORA, Dept Biomol Med, London SW7 2AZ, England
关键词
K-OPLS; kernel methods; non-linear; OSQ; OPLS; SVM; kernel PLS;
D O I
10.1002/cem.1071
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The orthogonal projections to latent structures (OPLS) method has been successfully applied in various chemical and biological systems for modeling and interpretation of linear relationships between a descriptor matrix and response matrix. A kernel-based reformulation of the original OPLS algorithm is presented where the kernel Gram matrix is utilized as a replacement for the descriptor matrix. This enables usage of the 'kernel trick' to efficiently transform the data into a higher dimensional feature space where predictive and response-orthogonal components are calculated. This strategy has the capacity to improve predictive performance considerably in situations where strong non-linear relationships exist between descriptor and response variables while retaining the OPLS model framework. We put particular focus on describing properties of the rearranged algorithm in relation to the original OPLS algorithm. Four separate problems, two simulated and two real spectroscopic data sets, are employed to illustrate how the algorithm enables separate modeling of predictive and response-orthogonal variation in the feature space. This separation can be highly beneficial for model interpretation purposes while providing a flexible framework for supervised regression. Copyright (c) 2007 John Wiley & Sons, Ltd.
引用
收藏
页码:376 / 385
页数:10
相关论文
共 32 条
[1]  
Aizerman M., 1964, AUTOMAT REM CONTR, V25, P821, DOI DOI 10.1234/12345678
[2]  
[Anonymous], 2004, KERNEL METHODS PATTE
[3]   STANDARD NORMAL VARIATE TRANSFORMATION AND DE-TRENDING OF NEAR-INFRARED DIFFUSE REFLECTANCE SPECTRA [J].
BARNES, RJ ;
DHANOA, MS ;
LISTER, SJ .
APPLIED SPECTROSCOPY, 1989, 43 (05) :772-777
[4]   A flexible classification approach with optimal generalisation performance: support vector machines [J].
Belousov, AI ;
Verzakov, SA ;
von Frese, J .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2002, 64 (01) :15-25
[5]   Quantitative in-line monitoring of powder blending by near infrared reflection spectroscopy [J].
Berntsson, O ;
Danielsson, LG ;
Lagerholm, B ;
Folestad, S .
POWDER TECHNOLOGY, 2002, 123 (2-3) :185-193
[6]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[7]   OPLS discriminant analysis:: combining the strengths of PLS-DA and SIMCA classification [J].
Bylesjo, Max ;
Rantalainen, Mattias ;
Cloarec, Olivier ;
Nicholson, Jeremy K. ;
Holmes, Elaine ;
Trygg, Johan .
JOURNAL OF CHEMOMETRICS, 2006, 20 (8-10) :341-351
[8]   About kernel latent variable approaches and SVM [J].
Czekaj, T ;
Wu, W ;
Walczak, B .
JOURNAL OF CHEMOMETRICS, 2005, 19 (5-7) :341-354
[9]  
Eriksson L., 2006, MULTI MEGAVARIATE DA, DOI [10.1002/cem.713, DOI 10.1002/CEM.713]
[10]   Separating Y-predictive and Y-orthogonal variation in multi-block spectral data [J].
Eriksson, Lennart ;
Toft, Marianne ;
Johansson, Erik ;
Wold, Svante ;
Trygg, Johan .
JOURNAL OF CHEMOMETRICS, 2006, 20 (8-10) :352-361