Improved genome annotation through untargeted detection of pathway-specific metabolites

被引:8
作者
Bowen, Benjamin P. [1 ]
Fischer, Curt R. [2 ]
Baran, Richard [1 ]
Banfield, Jillian F. [2 ,3 ]
Northen, Trent [1 ]
机构
[1] Univ Calif Berkeley, Lawrence Berkeley Natl Lab, Div Life Sci, Dept GTL Bioenergy & Struct Biol, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Dept Earth & Planetary Sci Policy & Management, Berkeley, CA 94720 USA
[3] Univ Calif Berkeley, Dept Environm Sci Policy & Management, Berkeley, CA 94720 USA
来源
BMC GENOMICS | 2011年 / 12卷
关键词
MASS-SPECTROMETRY; ELEMENTAL COMPOSITIONS; MOLECULAR FORMULAS; IDENTIFICATION; ASSIGNMENT; PATTERNS; DATABASE; NETWORK; ENZYMES; SYSTEM;
D O I
10.1186/1471-2164-12-S1-S6
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Mass spectrometry-based metabolomics analyses have the potential to complement sequence-based methods of genome annotation, but only if raw mass spectral data can be linked to specific metabolic pathways. In untargeted metabolomics, the measured mass of a detected compound is used to define the location of the compound in chemical space, but uncertainties in mass measurements lead to "degeneracies" in chemical space since multiple chemical formulae correspond to the same measured mass. We compare two methods to eliminate these degeneracies. One method relies on natural isotopic abundances, and the other relies on the use of stable-isotope labeling (SIL) to directly determine C and N atom counts. Both depend on combinatorial explorations of the "chemical space" comprised of all possible chemical formulae comprised of biologically relevant chemical elements. Results: Of 1532 metabolic pathways curated in the MetaCyc database, 412 contain a metabolite having a chemical formula unique to that metabolic pathway. Thus, chemical formulae alone can suffice to infer the presence of some metabolic pathways. Of 248,928 unique chemical formulae selected from the PubChem database, more than 95% had at least one degeneracy on the basis of accurate mass information alone. Consideration of natural isotopic abundance reduced degeneracy to 64%, but mainly for formulae less than 500 Da in molecular weight, and only if the error in the relative isotopic peak intensity was less than 10%. Knowledge of exact C and N atom counts as determined by SIL enabled reduced degeneracy, allowing for determination of unique chemical formula for 55% of the PubChem formulae. Conclusions: To facilitate the assignment of chemical formulae to unknown mass-spectral features, profiling can be performed on cultures uniformly labeled with stable isotopes of nitrogen (N-15) or carbon (C-13). This makes it possible to accurately count the number of carbon and nitrogen atoms in each molecule, providing a robust means for reducing the degeneracy of chemical space and thus obtaining unique chemical formulae for features measured in untargeted metabolomics having a mass greater than 500 Da, with relative errors in measured isotopic peak intensity greater than 10%, and without the use of a chemical formula generator dependent on heuristic filtering. These chemical formulae can serve as indicators for the presence of particular metabolic pathways.
引用
收藏
页数:8
相关论文
共 22 条
[1]   Metabolite Identification in Synechococcus sp. PCC 7002 Using Untargeted Stable Isotope Assisted Metabolite Profiling [J].
Baran, Richard ;
Bowen, Benjamin P. ;
Bouskill, Nicholas J. ;
Brodie, Eoin L. ;
Yannone, Steven M. ;
Northen, Trent R. .
ANALYTICAL CHEMISTRY, 2010, 82 (21) :9034-9042
[2]   Mass spectrometry based metabolomics and enzymatic assays for functional genomics [J].
Baran, Richard ;
Reindl, Wolfgang ;
Northen, Trent R. .
CURRENT OPINION IN MICROBIOLOGY, 2009, 12 (05) :547-552
[3]   Biochemistry's new look [J].
Blow, Nathan .
NATURE, 2008, 455 (7213) :697-700
[4]   Towards de novo identification of metabolites by analyzing tandem mass spectra [J].
Boecker, Sebastian ;
Rasche, Florian .
BIOINFORMATICS, 2008, 24 (16) :I49-I55
[5]   SIRIUS: decomposing isotope patterns for metabolite identification [J].
Boecker, Sebastian ;
Letzel, Matthias C. ;
Liptak, Zsuzsanna ;
Pervukhin, Anton .
BIOINFORMATICS, 2009, 25 (02) :218-224
[6]  
Caspi R, 2008, NUCLEIC ACIDS RES, V36, pD623, DOI [10.1093/nar/gkm900, 10.1093/nar/gkt1103]
[7]   Untargeted large-scale plant metabolomics using liquid chromatography coupled to mass spectrometry [J].
De Vos, Ric C. H. ;
Moco, Sofia ;
Lommen, Arjen ;
Keurentjes, Joost J. B. ;
Bino, Raoul J. ;
Hall, Robert D. .
NATURE PROTOCOLS, 2007, 2 (04) :778-791
[8]   13C Isotope-Labeled Metabolomes Allowing for Improved Compound Annotation and Relative Quantification in Liquid Chromatography-Mass Spectrometry-based Metabolomic Research [J].
Giavalisco, Patrick ;
Koehl, Karin ;
Hummel, Jan ;
Seiwert, Bettina ;
Willmitzer, Lothar .
ANALYTICAL CHEMISTRY, 2009, 81 (15) :6546-6551
[9]   High-Resolution Direct Infusion-Based Mass Spectrometry in Combination with Whole 13C Metabolome Isotope Labeling Allows Unambiguous Assignment of Chemical Sum Formulas [J].
Giavalisco, Petvich ;
Hummel, Jan ;
Lisec, Jan ;
Inostroza, Alvaro Cuadros ;
Catchpole, Gareth ;
Willmitzer, Lothar .
ANALYTICAL CHEMISTRY, 2008, 80 (24) :9417-9425
[10]   Stable isotope assisted assignment of elemental compositions for metabolomics [J].
Hegeman, Adrian D. ;
Schulte, Christopher F. ;
Cui, Qiu ;
Lewis, Ian A. ;
Huttlin, Edward L. ;
Eghbalnia, Hamid ;
Harms, Amy C. ;
Ulrich, Eldon L. ;
Markley, John L. ;
Sussman, Michael R. .
ANALYTICAL CHEMISTRY, 2007, 79 (18) :6912-6921