Application of metabolomics to plant genotype discrimination using statistics and machine learning

被引:176
作者
Taylor, J [1 ]
King, RD
Altmann, T
Fiehn, O
机构
[1] Univ Wales, Dept Comp Sci, Aberystwyth SY23 3DB, Dyfed, Wales
[2] Max Planck Inst Mol Plant Physiol, D-14424 Potsdam, Germany
关键词
metabolome; Arabidopsis; clustering;
D O I
10.1093/bioinformatics/18.suppl_2.S241
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Metabolomics is a post genomic technology which seeks to provide a comprehensive profile of all the metabolites present in a biological sample. This complements the mRNA profiles provided by microarrays, and the protein profiles provided by proteomics. To test the power of metabolome analysis we selected the problem of discrimating between related genotypes of Arabidopsis. Specifically, the problem tackled was to discrimate between two background genotypes (Col0 and C24) and, more significantly, the offspring produced by the crossbreeding of these two lines, the progeny (whose genotypes would differ only in their maternally inherited mitichondia and chloroplasts). Overview: A gas chromotography-mass spectrometry (GCMS) profiling protocol was used to identify 433 metabolites in the samples. The metabolomic profiles were compared using descriptive statistics which indicated that key primary metabolites vary more than other metabolites. We then applied neural networks to discriminate between the genotypes. This showed clearly that the two background lines can be discrimated between each other and their progeny, and indicated that the two progeny lines can also be discriminated. We applied Euclidean hierarchical and Principal Component Analysis (PCA) to help understand the basis of genotype discrimination. PCA indicated that malic acid and citrate are the two most important metabolites for discriminating between the background lines, and glucose and fructose are two most important metabolites for discriminating between the crosses. These results are consistant with genotype differences in mitochondia and chloroplasts.
引用
收藏
页码:S241 / S248
页数:8
相关论文
共 25 条
[1]   Simultaneous determination by capillary gas chromatography of organic acids, sugars, and sugar alcohols in plant tissue extracts as their trimethylsilyl derivatives [J].
Adams, MA ;
Chen, ZL ;
Landman, P ;
Colmer, TD .
ANALYTICAL BIOCHEMISTRY, 1999, 266 (01) :77-84
[2]  
ANDERSON M, 1998, ARABIDOPSIS ANN PLAN, V1
[3]   A comparison of gel-based, nylon filter and microarray techniques to detect differential RNA expression in plants [J].
Baldwin, D ;
Crane, V ;
Rice, D .
CURRENT OPINION IN PLANT BIOLOGY, 1999, 2 (02) :96-103
[4]  
BUCHANAN B, 1994, BIOSCIENCE, V34, P378
[5]  
COHEN P, 1993, CONTROL ENZYME ACTIV
[6]  
CORNISHBOWDEN A, 1995, ADV MOL CEL, V11, P21, DOI DOI 10.1016/S1569-2558(08)60247-7
[7]  
DENNIS DT, 1990, PLANT PHYSL BIOCH MO
[8]  
Everitt B, 1974, CLUSTER ANAL
[9]   Identification of uncommon plant metabolites based on calculation of elemental compositions using gas chromatography and quadrupole mass spectrometry [J].
Fiehn, O ;
Kopka, J ;
Trethewey, RN ;
Willmitzer, L .
ANALYTICAL CHEMISTRY, 2000, 72 (15) :3573-3580
[10]   Metabolite profiling for plant functional genomics [J].
Fiehn, O ;
Kopka, J ;
Dörmann, P ;
Altmann, T ;
Trethewey, RN ;
Willmitzer, L .
NATURE BIOTECHNOLOGY, 2000, 18 (11) :1157-1161