Comparative evaluation of tandem MS search algorithms using a target-decoy search strategy

被引:105
作者
Balgley, Brian M.
Laudeman, Tom
Yang, Li
Song, Tao
Lee, Cheng S.
机构
[1] Calibrant Bio Syst, Gaithersburg, MD 20878 USA
[2] Univ Virginia, Acad Comp Hlth Sci, Charlottesville, VA 22908 USA
[3] Univ Maryland, Dept Chem & Biochem, College Pk, MD 20742 USA
关键词
D O I
10.1074/mcp.M600469-MCP200
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Peptide identification of tandem mass spectra by a variety of available search algorithms forms the foundation for much of modern day mass spectrometry- based proteomics. Despite the critical importance of proper evaluation and interpretation of the results generated by these algorithms there is still little consistency in their application or understanding of their similarities and differences. A survey was conducted of four tandem mass spectrometry peptide identification search algorithms, including Mascot, Open Mass Spectrometry Search Algorithm, Sequest, and X! Tandem. The same input data, search parameters, and sequence library were used for the searches. Comparisons were based on commonly used scoring methodologies for each algorithm and on the results of a targetdecoy approach to sequence library searching. The results indicated that there is little difference in the output of the algorithms so long as consistent scoring procedures are applied. The results showed that some commonly used scoring procedures may lead to excessive false discovery rates. Finally an alternative method for the determination of an optimal cutoff threshold is proposed.
引用
收藏
页码:1599 / 1608
页数:10
相关论文
共 30 条
[1]   Achieving in-depth proteomics profiling by mass spectrometry [J].
Ahn, Natalie G. ;
Shabb, John B. ;
Old, William M. ;
Resing, Katheryn A. .
ACS CHEMICAL BIOLOGY, 2007, 2 (01) :39-52
[2]   Gel based isoelectric focusing of peptides and the utility of isoelectric point in protein identification [J].
Cargile, BJ ;
Bundy, JL ;
Freeman, TW ;
Stephenson, JL .
JOURNAL OF PROTEOME RESEARCH, 2004, 3 (01) :112-119
[3]   Potential for false positive identifications from large databases through tandem mass spectrometry [J].
Cargile, BJ ;
Bundy, JL ;
Stephenson, JL .
JOURNAL OF PROTEOME RESEARCH, 2004, 3 (05) :1082-1085
[4]   Open source system for analyzing, validating, and storing protein identification data [J].
Craig, R ;
Cortens, JP ;
Beavis, RC .
JOURNAL OF PROTEOME RESEARCH, 2004, 3 (06) :1234-1242
[5]   TANDEM: matching proteins with tandem mass spectra [J].
Craig, R ;
Beavis, RC .
BIOINFORMATICS, 2004, 20 (09) :1466-1467
[6]  
Desiere F, 2005, GENOME BIOL, V6
[7]  
Desiere F, 2006, NUCLEIC ACIDS RES, V34, pD655, DOI [10.1093/nar/gkj040, 10.1007/978-1-60761-444-9_19]
[8]   Comparative evaluation of mass spectrometry platforms used in large-scale proteomics investigations [J].
Elias, JE ;
Haas, W ;
Faherty, BK ;
Gygi, SP .
NATURE METHODS, 2005, 2 (09) :667-675
[9]   Intensity-based protein identification by machine learning from a library of tandem mass spectra [J].
Elias, JE ;
Gibbons, FD ;
King, OD ;
Roth, FP ;
Gygi, SP .
NATURE BIOTECHNOLOGY, 2004, 22 (02) :214-219
[10]   Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry [J].
Elias, Joshua E. ;
Gygi, Steven P. .
NATURE METHODS, 2007, 4 (03) :207-214