Comparison of different search engines using validated MS/MS test datasets

被引:33
作者
Boutilier, K
Ross, M
Podtelejnikov, AV
Orsi, C
Taylor, R
Taylor, P
Figeys, D
机构
[1] Univ Ottawa, Dept Biochem Microbiol & Immunol, Ottawa, ON, Canada
[2] MDS Proteom, Toronto, ON M9W 7H4, Canada
[3] MDS Proteom, Charlottesville, VA USA
[4] MDS Inc, DK-5230 Odense, Denmark
关键词
MS/MS search engines; proteomics; mass spectrometry; MUDPIT;
D O I
10.1016/j.aca.2004.04.047
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Massive amounts of tandem mass spectra are produced in high-throughput proteomics studies. The manual interpretation of these spectra is not feasible. Instead, search engines are used to match the tandem mass spectra with sequence information contained in proteomics and genomics databases. Typically, these search engines provide a list of the best matching peptide sequences for an individual tandem mass spectrum. As well, they provide scores that are somewhat related to the confidence level in the match. Many peptide tandem mass spectra search engines have been reported. These search engines provide very different results depending on the type of mass spectrometers used and their input parameters. Here we describe a comparative analysis of different search engines using validated test sets of tandem mass spectra. We have defined test sets of MS/MS spectra derived from high throughput proteomics experiments performed by HPLC-ESI-MS/MS on ion trap (LCQ) and tandem quadrupole time-of-flight instruments with a pulsar functionality (Qstar Pulsar) mass spectrometers. We analyzed the ability of the different search engines to identify the correct peptides, and the cross-validations of the different search engines. (c) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:11 / 20
页数:10
相关论文
共 25 条
[1]   A new algorithm for the evaluation of shotgun peptide sequencing in proteomics: Support vector machine classification of peptide MS/MS spectra and SEQUEST scores [J].
Anderson, DC ;
Li, WQ ;
Payan, DG ;
Noble, WS .
JOURNAL OF PROTEOME RESEARCH, 2003, 2 (02) :137-146
[2]  
Bafna V, 2001, Bioinformatics, V17 Suppl 1, pS13
[3]   OLAV: Towards high-throughput tandem mass spectrometry data identification [J].
Colinge, J ;
Masselot, A ;
Giron, M ;
Dessingy, T ;
Magnin, J .
PROTEOMICS, 2003, 3 (08) :1454-1463
[4]   A statistical basis for testing the significance of mass spectrometric protein identification results [J].
Eriksson, J ;
Chait, BT ;
Fenyö, D .
ANALYTICAL CHEMISTRY, 2000, 72 (05) :999-1005
[5]   A method for assessing the statistical significance of mass spectrometry-based protein identifications using general scoring schemes [J].
Fenyö, D ;
Beavis, RC .
ANALYTICAL CHEMISTRY, 2003, 75 (04) :768-774
[6]  
Field HI, 2002, PROTEOMICS, V2, P36, DOI 10.1002/1615-9861(200201)2:1<36::AID-PROT36>3.3.CO
[7]  
2-N
[8]   Preprocessing of tandem mass spectrometric data to support automatic protein identification [J].
Gentzel, M ;
Köcher, T ;
Ponnusamy, S ;
Wilm, M .
PROTEOMICS, 2003, 3 (08) :1597-1610
[9]   Popitam: Towards new heuristic strategies to improve protein identification from tandem mass spectrometry data [J].
Hernandez, P ;
Gras, R ;
Frey, J ;
Appel, RD .
PROTEOMICS, 2003, 3 (06) :870-878
[10]   Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search [J].
Keller, A ;
Nesvizhskii, AI ;
Kolker, E ;
Aebersold, R .
ANALYTICAL CHEMISTRY, 2002, 74 (20) :5383-5392