PepHMM: a hidden Markov model based scoring function for mass spectrometry database search

被引:33
作者
Wan, YH
Yang, A
Chen, T [1 ]
机构
[1] Univ So Calif, Dept Biol, Los Angeles, CA 90089 USA
[2] Univ So Calif, Dept Math, Los Angeles, CA 90089 USA
[3] Univ So Calif, Dept Pharmaceut Sci, Los Angeles, CA 90089 USA
关键词
D O I
10.1021/ac051319a
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
An accurate scoring function for database search is crucial for peptide identification using tandem mass spectrometry. Although many mathematical models have been proposed to score peptides against tandem mass spectra, our method (called PepHMM, http://msms.cmb.usc.edu) is unique in that it combines information on machine accuracy, mass peak intensity, and correlation among ions into a hidden Markov model (HMM). In addition, we develop a method to calculate statistical significance of the HMM scores. We implement the method and test them on two sets of experimental data generated by two different types of mass spectrometers and compare the results with MASCOT and SEQUEST under the same condition. One experimental results show that PepHMM has a much higher accuracy (with 6.5% error rate) than MASCOT (with 17.4% error rate), and the other experimental results show that PepHMM identifies 43 and 31% more correct spectra than SEQUEST and MASCOT, respectively.
引用
收藏
页码:432 / 437
页数:6
相关论文
共 60 条
[1]   Mass spectrometry-based proteomics [J].
Aebersold, R ;
Mann, M .
NATURE, 2003, 422 (6928) :198-207
[2]   A new algorithm for the evaluation of shotgun peptide sequencing in proteomics: Support vector machine classification of peptide MS/MS spectra and SEQUEST scores [J].
Anderson, DC ;
Li, WQ ;
Payan, DG ;
Noble, WS .
JOURNAL OF PROTEOME RESEARCH, 2003, 2 (02) :137-146
[3]  
Bafna V, 2001, Bioinformatics, V17 Suppl 1, pS13
[4]  
BAFNA V, 2003, P 7 ANN INT C COMP M
[5]   Reducing mass degeneracy in SAR by MS by stable isotopic labeling [J].
Bailey-Kellogg, C ;
Kelley, JJ ;
Stein, C ;
Donald, BR .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2001, 8 (01) :19-36
[6]   Shotgun protein sequencing by tandem mass spectra assembly [J].
Bandeira, N ;
Tang, HX ;
Bafna, V ;
Pevzner, P .
ANALYTICAL CHEMISTRY, 2004, 76 (24) :7221-7233
[7]  
BERN M, 2005, RECOMB
[8]   Automatic Quality Assessment of Peptide Tandem Mass Spectra [J].
Bern, Marshall ;
Goldberg, David ;
McDonald, W. Hayes ;
Yates, John R., III .
BIOINFORMATICS, 2004, 20 :49-54
[9]   A dynamic programming approach to de novo peptide sequencing via tandem mass spectrometry [J].
Chen, T ;
Kao, MY ;
Tepel, M ;
Rush, J ;
Church, GM .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2001, 8 (03) :325-337
[10]   Algorithms for identifying protein cross-links via tandem mass spectrometry [J].
Chen, T ;
Jaffe, JD ;
Church, GM .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2001, 8 (06) :571-583