Automatic validation of phosphopeptide identifications from tandem mass spectra

被引:73
作者
Lu, Bingwen [1 ]
Ruse, Cristian [1 ]
Xu, Tao [1 ]
Park, Sung Kyu [1 ]
Yates, John, III [1 ]
机构
[1] Scripps Res Inst, Dept Cell Biol, La Jolla, CA 92037 USA
关键词
D O I
10.1021/ac061334v
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
We developed and compared two approaches for automated validation of phosphopeptide tandem mass spectra identified using database searching algorithms. Phosphopeptide identifications were obtained through SEQUEST searches of a protein database appended with its decoy (reversed sequences). Statistical evaluation and iterative searches were employed to create a high-quality data set of phosphopeptides. Automation of postsearch validation was approached by two different strategies. By using statistical multiple testing, we calculate a p value for each tentative peptide phosphorylation. In a second method, we use a support vector machine (SVM; a machine learning algorithm) binary classifier to predict whether a tentative peptide phosphorylation is true. We show good agreement (85%) between postsearch validation of phosphopeptide/spectrum matches by multiple testing and that from support vector machines. Automatic methods conform very well with manual expert validation in a blinded test. Additionally, the algorithms were tested on the identification of synthetic phosphopeptides. We show that phosphate neutral losses in tandem mass spectra can be used to assess the correctness of phosphopeptide/spectrum matches. An SVM classifier with a radial basis function provided classification accuracy from 95.7% to 96.8% of the positive data set, depending on search algorithm used. Establishing the efficacy of an identification is a necessary step for further postsearch interrogation of the spectra for complete localization of phosphorylation sites. Our current implementation performs validation of phosphoserine/phosphothreonine-containing peptides having one or two phosphorylation sites from data gathered on an ion trap mass spectrometer. The SVM-based algorithm has been implemented in the software package DeBunker. We illustrate the application of the SVM-based software DeBunker on a large phosphorylation data set.
引用
收藏
页码:1301 / 1310
页数:10
相关论文
共 35 条
[1]   A new algorithm for the evaluation of shotgun peptide sequencing in proteomics: Support vector machine classification of peptide MS/MS spectra and SEQUEST scores [J].
Anderson, DC ;
Li, WQ ;
Payan, DG ;
Noble, WS .
JOURNAL OF PROTEOME RESEARCH, 2003, 2 (02) :137-146
[2]   Large-scale characterization of HeLa cell nuclear phosphoproteins [J].
Beausoleil, SA ;
Jedrychowski, M ;
Schwartz, D ;
Elias, JE ;
Villén, J ;
Li, JX ;
Cohn, MA ;
Cantley, LC ;
Gygi, SP .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (33) :12130-12135
[3]   A probability-based approach for high-throughput protein phosphorylation analysis and site localization [J].
Beausoleil, Sean A. ;
Villen, Judit ;
Gerber, Scott A. ;
Rush, John ;
Gygi, Steven P. .
NATURE BIOTECHNOLOGY, 2006, 24 (10) :1285-1292
[4]   Automatic Quality Assessment of Peptide Tandem Mass Spectra [J].
Bern, Marshall ;
Goldberg, David ;
McDonald, W. Hayes ;
Yates, John R., III .
BIOINFORMATICS, 2004, 20 :49-54
[5]   Robust phosphoproteomic profiling of tyrosine phosphorylation sites from human T cells using immobilized metal affinity chromatography and tandem mass spectrometry [J].
Brill, LM ;
Salomon, AR ;
Ficarro, SB ;
Mukherji, M ;
Stettler-Gill, M ;
Peters, EC .
ANALYTICAL CHEMISTRY, 2004, 76 (10) :2763-2772
[6]   Strategies for shotgun identification of post-translational modifications by mass spectrometry [J].
Cantin, GT ;
Yates, JR .
JOURNAL OF CHROMATOGRAPHY A, 2004, 1053 (1-2) :7-14
[7]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[8]  
Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482
[9]  
COCIORVA D, 2006, 54 ASMS C MASS SPECT
[10]   The role of electron capture dissociation in biomolecular analysis [J].
Cooper, HJ ;
Håkansson, K ;
Marshall, AG .
MASS SPECTROMETRY REVIEWS, 2005, 24 (02) :201-222