A strategy for identifying differences in large series of metabolomic samples analyzed by GC/MS

被引:264
作者
Jonsson, P
Gullberg, J
Nordström, A
Kusano, M
Kowalczyk, M
Sjöström, M
Moritz, T [1 ]
机构
[1] Swedish Univ Agr Sci, Dept Forest Genet & Plant Physiol, Umea Plant Sci, SE-90187 Umea, Sweden
[2] Umea Univ, Dept Chem, Res Grp Chemometr Organ Chem, SE-90187 Umea, Sweden
关键词
D O I
10.1021/ac0352427
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In metabolomics, the purpose is to identify and quantify all the metabolites in a biological system. Combined gas chromatography and mass spectrometry (GC/MS) is one of the most commonly used techniques in metabolomics together with H-1 NMR, and it has been shown that more than 300 compounds can be distinguished with GC/MS after deconvolution of overlapping peaks. To avoid having to deconvolute all analyzed samples prior to multivariate analysis of the data, we have developed a strategy for rapid comparison of nonprocessed MS data files. The method includes baseline correction, alignment, time window determinations, alternating regression, PLS-DA, and identification of retention time windows in the chromatograms that explain the differences between the samples. Use of alternating regression also gives interpretable loadings, which retain the information provided by m/z values that vary between the samples in each retention time window. The method has been applied to plant extracts derived from leaves of different developmental stages and plants subjected to small changes in day length. The data show that the new method can detect differences between the samples and that it gives results comparable to those obtained when deconvolution is applied prior to the multivariate analysis. We suggest that this method can be used for rapid comparison of large sets of GC/MS data, thereby applying time-consuming deconvolution only to parts of the chromatograms that contribute to explain the differences between the samples.
引用
收藏
页码:1738 / 1745
页数:8
相关论文
共 37 条
[1]   High-throughput classification of yeast mutants for functional genomics using metabolic footprinting [J].
Allen, J ;
Davey, HM ;
Broadhurst, D ;
Heald, JK ;
Rowland, JJ ;
Oliver, SG ;
Kell, DB .
NATURE BIOTECHNOLOGY, 2003, 21 (06) :692-696
[2]   Calibration of gas chromatography mass spectrometry of two-component mixtures using univariate regression and two- and three-way partial least squares [J].
Demir, C ;
Brereton, RG .
ANALYST, 1997, 122 (07) :631-638
[3]  
EFRON B, 1986, ANN STAT, V14, P1301, DOI 10.1214/aos/1176350145
[4]   Resolution of GC-MS data of complex PAC mixtures and regression modeling of mutagenicity by PLS [J].
Eide, I ;
Neverdal, G ;
Thorvaldsen, B ;
Shen, HL ;
Grung, B ;
Kvalheim, O .
ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2001, 35 (11) :2314-2318
[5]   Daylength and spatial expression of a gibberellin 20-oxidase isolated from hybrid aspen (Populus tremula L. x P-tremuloides Michx.) [J].
Eriksson, ME ;
Moritz, T .
PLANTA, 2002, 214 (06) :920-930
[6]   Metabolite profiling for plant functional genomics [J].
Fiehn, O ;
Kopka, J ;
Dörmann, P ;
Altmann, T ;
Trethewey, RN ;
Willmitzer, L .
NATURE BIOTECHNOLOGY, 2000, 18 (11) :1157-1161
[7]   Metabolomics - the link between genotypes and phenotypes [J].
Fiehn, O .
PLANT MOLECULAR BIOLOGY, 2002, 48 (1-2) :155-171
[8]   Objective data alignment and chemometric analysis of comprehensive two-dimensional separations with run-to-run peak shifting on both dimensions [J].
Fraga, CG ;
Prazen, BJ ;
Synovec, RE .
ANALYTICAL CHEMISTRY, 2001, 73 (24) :5833-5840
[9]   A PRIORI ESTIMATES OF THE ELUTION PROFILES OF THE PURE COMPONENTS IN OVERLAPPED LIQUID-CHROMATOGRAPHY PEAKS USING TARGET FACTOR-ANALYSIS [J].
GEMPERLINE, PJ .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1984, 24 (04) :206-212
[10]  
Halket JM, 1999, RAPID COMMUN MASS SP, V13, P279, DOI 10.1002/(SICI)1097-0231(19990228)13:4<279::AID-RCM478>3.0.CO