Peptide Identification from Mixture Tandem Mass Spectra

被引:62
作者
Wang, Jian [3 ]
Perez-Santiago, Josue [3 ]
Katz, Jonathan E. [2 ]
Mallick, Parag [2 ]
Bandeira, Nuno [1 ]
机构
[1] Univ Calif San Diego, Ctr Computat Mass Spectrometry, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
[2] Univ Calif Los Angeles, Dept Chem & Biochem, Los Angeles, CA 90095 USA
[3] Univ Calif San Diego, Bioinformat Program, La Jolla, CA 92093 USA
基金
美国国家卫生研究院;
关键词
PROTEIN IDENTIFICATION; PROTEOMICS DATA; SPECTROMETRY; MS/MS; LIBRARIES; SOFTWARE; STRATEGY; DATABASE; SEARCH; MODEL;
D O I
10.1074/mcp.M000136-MCP201
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The success of high-throughput proteomics hinges on the ability of computational methods to identify peptides from tandem mass spectra (MS/MS). However, a common limitation of most peptide identification approaches is the nearly ubiquitous assumption that each MS/MS spectrum is generated from a single peptide. We propose a new computational approach for the identification of mixture spectra generated from more than one peptide. Capitalizing on the growing availability of large libraries of single-peptide spectra (spectral libraries), our quantitative approach is able to identify up to 98% of all mixture spectra from equally abundant peptides and automatically adjust to varying abundance ratios of up to 10:1. Furthermore, we show how theoretical bounds on spectral similarity avoid the need to compare each experimental spectrum against all possible combinations of candidate peptides (achieving speedups of over five orders of magnitude) and demonstrate that mixture-spectra can be identified in a matter of seconds against proteome-scale spectral libraries. Although our approach was developed for and is demonstrated on peptide spectra, we argue that the generality of the methods allows for their direct application to other types of spectral libraries and mixture spectra. Molecular & Cellular Proteomics 9:1476-1485, 2010.
引用
收藏
页码:1476 / 1485
页数:10
相关论文
共 27 条
[1]   RELIABILITY RANKING AND SCALING IMPROVEMENTS TO THE PROBABILITY BASED MATCHING SYSTEM FOR UNKNOWN MASS-SPECTRA [J].
ATWATER, BL ;
STAUFFER, DB ;
MCLAFFERTY, FW ;
PETERSON, DW .
ANALYTICAL CHEMISTRY, 1985, 57 (04) :899-903
[2]   Shotgun protein sequencing by tandem mass spectra assembly [J].
Bandeira, N ;
Tang, HX ;
Bafna, V ;
Pevzner, P .
ANALYTICAL CHEMISTRY, 2004, 76 (24) :7221-7233
[3]   Protein identification by spectral networks analysis [J].
Bandeira, Nuno ;
Tsur, Dekel ;
Frank, Ari ;
Pevzner, Pavel A. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (15) :6140-6145
[4]   Use of an integrated MS-multiplexed MS/MS data acquisition strategy for high-coverage peptide mapping studies [J].
Chakraborty, Asish B. ;
Berger, Scott J. ;
Gebler, John C. .
RAPID COMMUNICATIONS IN MASS SPECTROMETRY, 2007, 21 (05) :730-744
[5]   Using annotated peptide mass spectrum libraries for protein identification [J].
Craig, R. ;
Cortens, J. C. ;
Fenyo, D. ;
Beavis, R. C. .
JOURNAL OF PROTEOME RESEARCH, 2006, 5 (08) :1843-1849
[6]   TANDEM: matching proteins with tandem mass spectra [J].
Craig, R ;
Beavis, RC .
BIOINFORMATICS, 2004, 20 (09) :1466-1467
[7]   Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry [J].
Elias, Joshua E. ;
Gygi, Steven P. .
NATURE METHODS, 2007, 4 (03) :207-214
[8]   AN APPROACH TO CORRELATE TANDEM MASS-SPECTRAL DATA OF PEPTIDES WITH AMINO-ACID-SEQUENCES IN A PROTEIN DATABASE [J].
ENG, JK ;
MCCORMACK, AL ;
YATES, JR .
JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 1994, 5 (11) :976-989
[9]   ProteomeCommons.org IO Framework: reading and writing multiple proteomics data formats [J].
Falkner, J. A. ;
Falkner, J. W. ;
Andrews, P. C. .
BIOINFORMATICS, 2007, 23 (02) :262-263
[10]   NovoHMM: A hidden Markov model for de novo peptide sequencing [J].
Fischer, B ;
Roth, V ;
Roos, F ;
Grossmann, J ;
Baginsky, S ;
Widmayer, P ;
Gruissem, W ;
Buhmann, JM .
ANALYTICAL CHEMISTRY, 2005, 77 (22) :7265-7273