Analysis of peptide MS/MS spectra from large-scale proteomics experiments using spectrum libraries

被引:188
作者
Frewen, Barbara E.
Merrihew, Gennifer E.
Wu, Christine C.
Noble, William Stafford
MacCoss, Michael J. [1 ]
机构
[1] Univ Washington, Dept Genome Sci, Seattle, WA 98195 USA
[2] Univ Washington, Dept Comp Sci & Engn, Seattle, WA 98195 USA
[3] Univ Colorado, Hlth Sci Ctr, Dept Pharmacol, Aurora, CO 80045 USA
关键词
D O I
10.1021/ac060279n
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
A widespread proteomics procedure for characterizing a complex mixture of proteins combines tandem mass spectrometry and database search software to yield mass spectra with identified peptide sequences. The same peptides are often detected in multiple experiments, and once they have been identified, the respective spectra can be used for future identifications. We present a method for collecting previously identified tandem mass spectra into a reference library that is used to identify new spectra. Query spectra are compared to references in the library to find the ones that are most similar. A dot product metric is used to measure the degree of similarity. With our largest library, the search of a query set finds 91% of the spectrum identifications and 93.7% of the protein identifications that could be made with a SEQUEST database search. A second experiment demonstrates that queries acquired on an LCQ ion trap mass spectrometer can be identified with a library of references acquired on an LTQ ion trap mass spectrometer. The dot product similarity score provides good separation of correct and incorrect identifications.
引用
收藏
页码:5678 / 5684
页数:7
相关论文
共 22 条
[1]   A new algorithm for the evaluation of shotgun peptide sequencing in proteomics: Support vector machine classification of peptide MS/MS spectra and SEQUEST scores [J].
Anderson, DC ;
Li, WQ ;
Payan, DG ;
Noble, WS .
JOURNAL OF PROTEOME RESEARCH, 2003, 2 (02) :137-146
[2]   The use of proteotypic peptide libraries for protein identification [J].
Craig, R ;
Cortens, JP ;
Beavis, RC .
RAPID COMMUNICATIONS IN MASS SPECTROMETRY, 2005, 19 (13) :1844-1850
[3]   Parallel tandem: A program for parallel processing of tandem mass spectra using PVM or MPI and X!Tandem [J].
Duncan, DT ;
Craig, R ;
Link, AJ .
JOURNAL OF PROTEOME RESEARCH, 2005, 4 (05) :1842-1847
[4]   AN APPROACH TO CORRELATE TANDEM MASS-SPECTRAL DATA OF PEPTIDES WITH AMINO-ACID-SEQUENCES IN A PROTEIN DATABASE [J].
ENG, JK ;
MCCORMACK, AL ;
YATES, JR .
JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 1994, 5 (11) :976-989
[5]   Direct analysis of protein complexes using mass spectrometry [J].
Link, AJ ;
Eng, J ;
Schieltz, DM ;
Carmack, E ;
Mize, GJ ;
Morris, DR ;
Garvik, BM ;
Yates, JR .
NATURE BIOTECHNOLOGY, 1999, 17 (07) :676-682
[6]   MS1, MS2, and SQT - three unified, compact, and easily parsed file formats for the storage of shotgun proteomic spectra and identifications [J].
McDonald, WH ;
Tabb, DL ;
Sadygov, RG ;
MacCoss, MJ ;
Venable, J ;
Graumann, J ;
Johnson, JR ;
Cociorva, D ;
Yates, JR .
RAPID COMMUNICATIONS IN MASS SPECTROMETRY, 2004, 18 (18) :2162-2168
[7]   Qscore: An algorithm for evaluating SEQUEST database search results [J].
Moore, RE ;
Young, MK ;
Lee, TD .
JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 2002, 13 (04) :378-386
[8]   Evaluation of multidimensional chromatography coupled with tandem mass spectrometry (LC/LC-MS/MS) for large-scale protein analysis: The yeast proteome [J].
Peng, JM ;
Elias, JE ;
Thoreen, CC ;
Licklider, LJ ;
Gygi, SP .
JOURNAL OF PROTEOME RESEARCH, 2003, 2 (01) :43-50
[9]  
Perkins DN, 1999, ELECTROPHORESIS, V20, P3551, DOI 10.1002/(SICI)1522-2683(19991201)20:18<3551::AID-ELPS3551>3.0.CO
[10]  
2-2