Lookup peaks: A hybrid of de novo sequencing and database search for protein identification by tandem mass spectrometry

被引:150
作者
Bern, Marshall [1 ]
Cai, Yuhan [1 ]
Goldberg, David [1 ]
机构
[1] Palo Alto Res Ctr, Palo Alto, CA 94304 USA
关键词
D O I
10.1021/ac0617013
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
A powerful technique for peptide and protein identification is tandem mass spectrometry followed by database search using a program such as SEQUEST or Mascot. These programs, however, become slow and lose sensitivity when allowing nonspecific cleavages or peptide modifications. De novo sequencing and hybrid methods such as sequence tagging offer speed and robustness for wider searches, yet these approaches require better spectra with more complete and consecutive fragmentation and, hence, are less sensitive to low-abundance peptides. Here we describe a new hybrid method that retains the sensitivity of pure database search. The method uses a small amount of de novo analysis to identify likely b- and y-ion peaks"lookup peaks"that can then be used to extract candidate peptides from the database, with the number of candidates tunable to fit a computing budget. We describe a program called ByOnic that implements this method, and we benchmark ByOnic on several data sets, including one of mouse blood plasma spiked with low concentrations of recombinant human proteins. We demonstrate that ByOnic is more sensitive than sequence tagging and, indeed, more sensitive than the three most popular pure database search toolsSEQUEST, Mascot, and X!Tandemon both the peptide and protein levels. On the mouse plasma samples, ByOnic consistently found spiked proteins missed by the other tools.
引用
收藏
页码:1393 / 1400
页数:8
相关论文
共 39 条
[1]   A proteomic study of the HUPO Plasma Proteome Project's pilot samples using an accurate mass and time tag strategy [J].
Adkins, JN ;
Monroe, ME ;
Auberry, KJ ;
Shen, YF ;
Jacobs, JM ;
Camp, DG ;
Vitzthum, F ;
Rodland, KD ;
Zangar, RC ;
Smith, RD ;
Pounds, JG .
PROTEOMICS, 2005, 5 (13) :3454-3466
[2]   The human plasma proteome - A nonredundant list developed by combination of four separate sources [J].
Anderson, NL ;
Polanski, M ;
Pieper, R ;
Gatlin, T ;
Tirumalai, RS ;
Conrads, TP ;
Veenstra, TD ;
Adkins, JN ;
Pounds, JG ;
Fagan, R ;
Lobley, A .
MOLECULAR & CELLULAR PROTEOMICS, 2004, 3 (04) :311-326
[3]   De novo analysis of peptide tandem mass spectra by spectral graph partitioning [J].
Bern, M ;
Goldberg, D .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2006, 13 (02) :364-378
[4]   Automatic Quality Assessment of Peptide Tandem Mass Spectra [J].
Bern, Marshall ;
Goldberg, David ;
McDonald, W. Hayes ;
Yates, John R., III .
BIOINFORMATICS, 2004, 20 :49-54
[5]   Evaluation of algorithms for protein identification from sequence databases using mass spectrometry data [J].
Chamrad, DC ;
Körting, G ;
Stühler, K ;
Meyer, HE ;
Klose, J ;
Blüggel, M .
PROTEOMICS, 2004, 4 (03) :619-628
[6]   A method for reducing the time required to match protein sequences with tandem mass spectra [J].
Craig, R ;
Beavis, RC .
RAPID COMMUNICATIONS IN MASS SPECTROMETRY, 2003, 17 (20) :2310-2316
[7]   TANDEM: matching proteins with tandem mass spectra [J].
Craig, R ;
Beavis, RC .
BIOINFORMATICS, 2004, 20 (09) :1466-1467
[8]  
Creasy DM, 2002, PROTEOMICS, V2, P1426, DOI 10.1002/1615-9861(200210)2:10<1426::AID-PROT1426>3.0.CO
[9]  
2-5
[10]  
EDWARDS N, 2002, 2 INT WORKSH ALG BIO, P68