Robust Automated Mass Spectra Interpretation and Chemical Formula Calculation Using Mixed Integer Linear Programming

被引:8
作者
Baran, Richard [1 ]
Northen, Trent R. [1 ]
机构
[1] Lawrence Berkeley Natl Lab, Div Life Sci, Berkeley, CA 94720 USA
关键词
METABOLOMICS; IDENTIFICATION; SPECTROMETRY; ANNOTATION; REVEALS; RULES;
D O I
10.1021/ac402180c
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Untargeted metabolite profiling using liquid chromatography and mass spectrometry coupled via electrospray ionization is a powerful tool for the discovery of novel natural products, metabolic capabilities, and biomarkers. However, the elucidation of the identities of uncharacterized metabolites from spectral features remains challenging. A critical step in the metabolite identification workflow is the assignment of redundant spectral features (adducts, fragments, multimers) and calculation of the underlying chemical formula. Inspection of the data by experts using computational tools solving partial problems (e.g., chemical formula calculation for individual ions) can be performed to disambiguate alternative solutions and provide reliable results. However, manual curation is tedious and not readily scalable or standardized. Here we describe an automated procedure for the robust automated mass spectra interpretation and chemical formula calculation using mixed integer linear programming optimization (RAMSI). Chemical rules among related ions are expressed as linear constraints and both the spectra interpretation and chemical formula calculation are performed in a single optimization step. This approach is unbiased in that it does not require predefined sets of neutral losses and positive and negative polarity spectra can be combined in a single optimization. The procedure was evaluated with 30 experimental mass spectra and was found to effectively identify the protonated or deprotonated molecule ([M + H](+) or [M - H](-)) while being robust to the presence of background ions. RAMSI provides a much-needed standardized tool for interpreting ions for subsequent identification in untargeted metabolomics workflows.
引用
收藏
页码:9777 / 9784
页数:8
相关论文
共 27 条
[1]   Untargeted metabolic footprinting reveals a surprising breadth of metabolite uptake and release by Synechococcus sp. PCC 7002 [J].
Baran, Richard ;
Bowen, Benjamin P. ;
Northen, Trent R. .
MOLECULAR BIOSYSTEMS, 2011, 7 (12) :3200-3206
[2]   Metabolite Identification in Synechococcus sp. PCC 7002 Using Untargeted Stable Isotope Assisted Metabolite Profiling [J].
Baran, Richard ;
Bowen, Benjamin P. ;
Bouskill, Nicholas J. ;
Brodie, Eoin L. ;
Yannone, Steven M. ;
Northen, Trent R. .
ANALYTICAL CHEMISTRY, 2010, 82 (21) :9034-9042
[3]   Mass spectrometry based metabolomics and enzymatic assays for functional genomics [J].
Baran, Richard ;
Reindl, Wolfgang ;
Northen, Trent R. .
CURRENT OPINION IN MICROBIOLOGY, 2009, 12 (05) :547-552
[4]   Towards de novo identification of metabolites by analyzing tandem mass spectra [J].
Boecker, Sebastian ;
Rasche, Florian .
BIOINFORMATICS, 2008, 24 (16) :I49-I55
[5]   SIRIUS: decomposing isotope patterns for metabolite identification [J].
Boecker, Sebastian ;
Letzel, Matthias C. ;
Liptak, Zsuzsanna ;
Pervukhin, Anton .
BIOINFORMATICS, 2009, 25 (02) :218-224
[6]   Improved genome annotation through untargeted detection of pathway-specific metabolites [J].
Bowen, Benjamin P. ;
Fischer, Curt R. ;
Baran, Richard ;
Banfield, Jillian F. ;
Northen, Trent .
BMC GENOMICS, 2011, 12
[7]   Dealing with the Unknown: Metabolomics and Metabolite Atlases [J].
Bowen, Benjamin P. ;
Northen, Trent R. .
JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 2010, 21 (09) :1471-1476
[8]   OptKnock: A bilevel programming framework for identifying gene knockout strategies for microbial strain optimization [J].
Burgard, AP ;
Pharkya, P ;
Maranas, CD .
BIOTECHNOLOGY AND BIOENGINEERING, 2003, 84 (06) :647-657
[9]   Statistical heterospectroscopy, an approach to the integrated analysis of NMR and UPLC-MS data sets: Application in metabonomic toxicology studies [J].
Crockford, DJ ;
Holmes, E ;
Lindon, JC ;
Plumb, RS ;
Zirah, S ;
Bruce, SJ ;
Rainville, P ;
Stumpf, CL ;
Nicholson, JK .
ANALYTICAL CHEMISTRY, 2006, 78 (02) :363-371
[10]   A Mixed Integer Linear Optimization Framework for the Identification and Quantification of Targeted Post-translational Modifications of Highly Modified Proteins Using Multiplexed Electron Transfer Dissociation Tandem Mass Spectrometry [J].
DiMaggio, Peter A., Jr. ;
Young, Nicolas L. ;
Baliban, Richard C. ;
Garcia, Benjamin A. ;
Floudas, Christodoulos A. .
MOLECULAR & CELLULAR PROTEOMICS, 2009, 8 (11) :2527-2543