K-OPLS package: Kernel-based orthogonal projections to latent structures for prediction and interpretation in feature space

被引:67
作者
Bylesjo, Max [1 ]
Rantalainen, Mattias [2 ]
Nicholson, Jeremy K. [2 ]
Holmes, Elaine [2 ]
Trygg, Johan [1 ]
机构
[1] Umea Univ, Dept Chem, Chemometr Res Grp, SE-90187 Umea, Sweden
[2] Univ London Imperial Coll Sci Technol & Med, Fac Med, Dept Biomol Med, Div Surg Oncol Reprod Biol & Anaesthet, London SW7 2AZ, England
关键词
D O I
10.1186/1471-2105-9-106
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Kernel- based classification and regression methods have been successfully applied to modelling a wide variety of biological data. The Kernel- based Orthogonal Projections to Latent Structures ( K- OPLS) method offers unique properties facilitating separate modelling of predictive variation and structured noise in the feature space. While providing prediction results similar to other kernel- based methods, K- OPLS features enhanced interpretational capabilities; allowing detection of unanticipated systematic variation in the data such as instrumental drift, batch variability or unexpected biological variation. Results: We demonstrate an implementation of the K- OPLS algorithm for MATLAB and R, licensed under the GNU GPL and available at http://www.sourceforge.net/projects/kopls/. The package includes essential functionality and documentation for model evaluation ( using cross-validation), training and prediction of future samples. Incorporated is also a set of diagnostic tools and plot functions to simplify the visualisation of data, e. g. for detecting trends or for identification of outlying samples. The utility of the software package is demonstrated by means of a metabolic profiling data set from a biological study of hybrid aspen. Conclusion: The properties of the K- OPLS method are well suited for analysis of biological data, which in conjunction with the availability of the outlined open- source package provides a comprehensive solution for kernel- based analysis in bioinformatics applications.
引用
收藏
页数:7
相关论文
共 24 条
[21]   Orthogonal projections to latent structures (O-PLS) [J].
Trygg, J ;
Wold, S .
JOURNAL OF CHEMOMETRICS, 2002, 16 (03) :119-128
[22]   O2-PLS, a two-block (X-Y) latent variable regression (LVR) method with an integral OSC filter [J].
Trygg, J ;
Wold, S .
JOURNAL OF CHEMOMETRICS, 2003, 17 (01) :53-64
[23]   A new metabonomic strategy for analysing the growth process of the poplar tree [J].
Wiklund, S ;
Karlsson, M ;
Antti, H ;
Johnels, D ;
Sjöström, M ;
Wingsle, G ;
Edlund, U .
PLANT BIOTECHNOLOGY JOURNAL, 2005, 3 (03) :353-362