Peak Aggregation as an Innovative Strategy for Improving the Predictive Power of LC-MS Metabolomic Profiles

被引：10

作者：

Fernandez-Albert, Francesc ^{[1
,2
,3
]}

Llorach, Rafael ^{[2
]}

Andres-Lacueva, Cristina ^{[2
]}

Perera-Lluna, Alexandre ^{[1
,3
]}

机构：

[1] Univ Politecn Cataluna, ESAII Dept, B2SLab, Barcelona, Spain

[2] Univ Barcelona, Sch Pharm, INGENIO CONSOLIDER Program, Dept Nutr & Food Sci,Biomarkers & Nutrimetabol La, Barcelona, Spain

[3] CIBER Bioengn Biomat & Nanomed CIBER BBN, Barcelona, Spain

来源：

ANALYTICAL CHEMISTRY | 2014年 / 86卷 / 05期

关键词：

MASS-SPECTROMETRY; INFORMATION; SPECTRA;

D O I：

10.1021/ac403702p

中图分类号：

O65 [分析化学];

学科分类号：

070302 [分析化学];

摘要：

Liquid chromatography-mass spectrometry (LC-MS)-based metabolomic datasets consist of different features including (de)protonated molecules, fragments, adducts, and isotopes that may show high correlation values related to a high level of collinearity. There have been described several sources of these high correlation patterns regarding metabolomic datasets; Among these sources, it should be highlighted the high level of correlation computed between features coming from the same metabolite. It is well-known that soft ionization methods (such as elcctrospray) produce several mass features from a particular compound (i.e., metabolite spectrum). Typically, the statistical methods used in metabolomics consider spectral peaks as variables. However, it has been reported that a high collinearity between variables might be the responsible for high uncertainty values in the predictors of a regression. In this context, this technical note proposes a new strategy based on the application of the so-called peak aggregation methods (NMF Reduction, PCA Decomposition, Maximum Peak, and Spectrum Mean) to take advantage of the variable collinearity and solve the issue of high variable collinearity. A set of real samples obtained after human nutritional intervention with placebo or polyphenol-rich beverages was used to test this methodology. The results showed that applying any peak aggregation method (especially NMF and PCA) improves the statistical prediction power of class pertinence independently of the nature of the classifier (linear PLS-DA or nonlinear SVM). Overall, the introduction of this new approach resulted in a reduction of the dimensionality of the data and, in addition, in a significant increase in the overall predictive power of the data.

引用

页码：2320 / 2325

页数：6

共 22 条

[1]

[Anonymous], J PROTEOME RES

[2]

[Anonymous], J CHEMOM

[3]

[Anonymous], METABOLOMICS

[4]

Algorithms and applications for approximate nonnegative matrix factorization [J].