Deconvolution and Database Search of Complex Tandem Mass Spectra of Intact Proteins

被引:133
作者
Liu, Xiaowen [1 ]
Inbar, Yuval [1 ]
Dorrestein, Pieter C. [2 ]
Wynne, Colin [3 ]
Edwards, Nathan [4 ]
Souda, Puneet [5 ]
Whitelegge, Julian P. [5 ]
Bafna, Vineet [1 ]
Pevzner, Pavel A. [1 ]
机构
[1] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
[2] Univ Calif San Diego, Dept Pharmacol Chem & Biochem, La Jolla, CA 92093 USA
[3] Univ Maryland, Dept Chem & Biochem, College Pk, MD 20742 USA
[4] Georgetown Univ, Med Ctr, Dept Biochem & Mol & Cellular Biol, Washington, DC 20007 USA
[5] Univ Calif Los Angeles, Pasarow Mass Spectrometry Lab, Neuropysychiat Inst, Semel Inst, Los Angeles, CA 90024 USA
基金
美国国家卫生研究院;
关键词
POSTTRANSLATIONAL MODIFICATIONS; MONOISOTOPIC MASSES; SPECTROMETRY; IDENTIFICATION; ALGORITHM;
D O I
10.1074/mcp.M110.002766
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Top-down proteomics studies intact proteins, enabling new opportunities for analyzing post-translational modifications. Because tandem mass spectra of intact proteins are very complex, spectral deconvolution (grouping peaks into isotopomer envelopes) is a key initial stage for their interpretation. In such spectra, isotopomer envelopes of different protein fragments span overlapping regions on the m/z axis and even share spectral peaks. This raises both pattern recognition and combinatorial challenges for spectral deconvolution. We present MS-Deconv, a combinatorial algorithm for spectral deconvolution. The algorithm first generates a large set of candidate isotopomer envelopes for a spectrum, then represents the spectrum as a graph, and finally selects its highest scoring subset of envelopes as a heaviest path in the graph. In contrast with other approaches, the algorithm scores sets of envelopes rather than individual envelopes. We demonstrate that MS-Deconv improves on Thrash and Xtract in the number of correctly recovered monoisotopic masses and speed. We applied MS-Deconv to a large set of top-down spectra from Yersinia rohdei (with a still unsequenced genome) and further matched them against the protein database of related and sequenced bacterium Yersinia enterocolitica. MS-Deconv is available at http://proteomics.ucsd.edu/Software.html. Molecular & Cellular Proteomics 9:2772-2782, 2010.
引用
收藏
页码:2772 / 2782
页数:11
相关论文
共 44 条
[1]  
Breen EJ, 2000, ELECTROPHORESIS, V21, P2243, DOI 10.1002/1522-2683(20000601)21:11<2243::AID-ELPS2243>3.0.CO
[2]  
2-K
[3]   Automated intensity descent algorithm for interpretation of complex high-resolution mass spectra [J].
Chen, Li ;
Sze, Siu Kwan ;
Yang, He .
ANALYTICAL CHEMISTRY, 2006, 78 (14) :5006-5018
[4]   The biosynthesis of the thiazole phosphate moiety of thiamin (Vitamin B1):: The early steps catalyzed by thiazole synthase [J].
Dorrestein, PC ;
Zhai, HL ;
Taylor, SV ;
McLafferty, FW ;
Begley, TP .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2004, 126 (10) :3091-3096
[5]   The bifunctional glyceryl transferase/phosphatase OzmB belonging to the HAD superfamily that diverts 1,3-bisphosphoglycerate into polyketide biosynthesis [J].
Dorrestein, Pieter C. ;
Van Lanen, Steven G. ;
Li, Wenli ;
Zhao, Chunhua ;
Deng, Zixin ;
Shen, Ben ;
Kelleher, Neil L. .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2006, 128 (32) :10386-10387
[6]   Automatic deconvolution of isotope-resolved mass spectra using variable selection and quantized peptide mass distribution [J].
Du, Peicheng ;
Angeletti, Ruth Hogue .
ANALYTICAL CHEMISTRY, 2006, 78 (10) :3385-3392
[7]   Interpreting top-down mass spectra using spectral alignment [J].
Frank, Ari M. ;
Pesavento, James J. ;
Mizzen, Craig A. ;
Kelleher, Neil L. ;
Pevzner, Pavel A. .
ANALYTICAL CHEMISTRY, 2008, 80 (07) :2499-2505
[8]  
Gras R, 1999, ELECTROPHORESIS, V20, P3535, DOI 10.1002/(SICI)1522-2683(19991201)20:18<3535::AID-ELPS3535>3.0.CO
[9]  
2-J
[10]   Automated reduction and interpretation of high resolution electrospray mass spectra of large molecules [J].
Horn, DM ;
Zubarev, RA ;
McLafferty, FW .
JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 2000, 11 (04) :320-332