Peptide charge state determination for low-resolution tandem mass spectra

被引:26
作者
Klammer, AA [1 ]
Wu, CC [1 ]
MacCoss, MJ [1 ]
Noble, WS [1 ]
机构
[1] Dept Genome Sci, Seattle, WA 98195 USA
来源
2005 IEEE COMPUTATIONAL SYSTEMS BIOINFORMATICS CONFERENCE, PROCEEDINGS | 2005年
关键词
mass spectrometry; proteomics; charge state; machine learning; support vector machine;
D O I
10.1109/CSB.2005.44
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Mass spectrometry. is a particularly useful technology for the rapid and robust identification of peptides and proteins in complex mixtures. Peptide sequences can be identified by correlating their observed tandem mass spectra (MS/MS) with theoretical spectra of-peptides from a sequence database. Unfortunately, to perform this search the change of the peptide must be known, and current charge state-determination algorithms only discriminate singly from multiply-charged spectra: distinguishing +2 from +3, for example, is unreliable. Thus, search software is forced to search multiply-charged spectra multiple times. To minimize this inefficiency, we present a support vector machine (SVM) that quickly and reliably classifies multiply charged spectra as having either a +2 or +3 precursor peptide ion. By classifying a multiply-charged spectra, we obtain a 40% reduction in search time while maintaining an average of 99% of peptide and 99% of protein identifications originally-obtained from these spectra.
引用
收藏
页码:175 / 185
页数:11
相关论文
共 16 条
[1]   Automatic Quality Assessment of Peptide Tandem Mass Spectra [J].
Bern, Marshall ;
Goldberg, David ;
McDonald, W. Hayes ;
Yates, John R., III .
BIOINFORMATICS, 2004, 20 :49-54
[2]  
Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
[3]   Improved peptide charge state assignment [J].
Colinge, J ;
Magnin, J ;
Dessingy, T ;
Giron, M ;
Masselot, A .
PROTEOMICS, 2003, 3 (08) :1434-1440
[4]  
Cristianini N., 2000, Intelligent Data Analysis: An Introduction, DOI 10.1017/CBO9780511801389
[5]   De novo peptide sequencing via tandem mass spectrometry [J].
Dancík, V ;
Addona, TA ;
Clauser, KR ;
Vath, JE ;
Pevzner, PA .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1999, 6 (3-4) :327-342
[6]  
Keller Andrew, 2002, OMICS A Journal of Integrative Biology, V6, P207, DOI 10.1089/153623102760092805
[7]  
Kinter M., 2005, PROTEIN SEQUENCING I, V9
[8]   Probability-based validation of protein identifications using a modified SEQUEST algorithm [J].
MacCoss, MJ ;
Wu, CC ;
Yates, JR .
ANALYTICAL CHEMISTRY, 2002, 74 (21) :5593-5599
[9]   Direct analysis and identification of proteins in mixtures by LC/MS/MS and database searching at the low-femtomole level [J].
McCormack, AL ;
Schieltz, DM ;
Goode, B ;
Yang, S ;
Barnes, G ;
Drubin, D ;
Yates, JR .
ANALYTICAL CHEMISTRY, 1997, 69 (04) :767-776
[10]   BASIC PRINCIPLES OF ROC ANALYSIS [J].
METZ, CE .
SEMINARS IN NUCLEAR MEDICINE, 1978, 8 (04) :283-298