Peptide sequence tags for fast database search in mass-spectrometry

被引:100
作者
Frank, A
Tanner, S
Bafna, V
Pevzner, P
机构
[1] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
[2] Univ Calif San Diego, Bioinformat Program, La Jolla, CA 92093 USA
关键词
tags; tandem mass spectrometry; filtering; database search; PepNovo;
D O I
10.1021/pr050011x
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Filtration techniques in the form of rapid elimination of candidate sequences while retaining the true one are key ingredients of database searches in genomics. Although SEQUEST and Mascot perform a conceptually similar task to the tool BLAST, the key algorithmic idea of BLAST (filtration) was never implemented in these tools. As a result MS/MS protein identification tools are becoming too time-consuming for many applications including search for post-translationally modified pepticles. Moreover, matching millions of spectra against all known proteins will soon make these tools too slow in the same way that "genome vs genome" comparisons instantly made BLAST too slow. We describe the development of filters for MS/MS database searches that dramatically reduce the running time and effectively remove the bottlenecks in searching the huge space of protein modifications. Our approach, based on a probability model for determining the accuracy of sequence tags, achieves superior results compared to GutenTag, a popular tag generation algorithm. Our tag generating algorithm along with our de novo sequencing algorithm PepNovo can be accessed via the URL http://peptide.ucsd.edu/.
引用
收藏
页码:1287 / 1295
页数:9
相关论文
共 49 条
[41]   Statistical characterization of ion trap tandem mass spectra from doubly charged tryptic peptides [J].
Tabb, DL ;
Smith, LL ;
Breci, LA ;
Wysocki, VH ;
Lin, D ;
Yates, JR .
ANALYTICAL CHEMISTRY, 2003, 75 (05) :1155-1163
[42]   GutenTag: High-throughput sequence tagging via an empirically derived fragmentation model [J].
Tabb, DL ;
Saraf, A ;
Yates, JR .
ANALYTICAL CHEMISTRY, 2003, 75 (23) :6415-6421
[43]  
TANNER S, 2005, UNPUB INSPECT FAST A
[44]   Implementation and uses of automated de novo peptide sequencing by tandem mass spectrometry [J].
Taylor, JA ;
Johnson, RS .
ANALYTICAL CHEMISTRY, 2001, 73 (11) :2594-2604
[45]  
Taylor JA, 1997, RAPID COMMUN MASS SP, V11, P1067, DOI 10.1002/(SICI)1097-0231(19970615)11:9<1067::AID-RCM953>3.0.CO
[46]  
2-L
[47]   A graph-theoretic approach for the separation of b and y ions in tandem mass spectra [J].
Yan, B ;
Pan, C ;
Olman, VN ;
Hettich, RL ;
Xu, Y .
BIOINFORMATICS, 2005, 21 (05) :563-574
[48]   MINING GENOMES - CORRELATING TANDEM MASS-SPECTRA OF MODIFIED AND UNMODIFIED PEPTIDES TO SEQUENCES IN NUCLEOTIDE DATABASES [J].
YATES, JR ;
ENG, JK ;
MCCORMACK, AL .
ANALYTICAL CHEMISTRY, 1995, 67 (18) :3202-3210
[49]   METHOD TO CORRELATE TANDEM MASS-SPECTRA OF MODIFIED PEPTIDES TO AMINO-ACID-SEQUENCES IN THE PROTEIN DATABASE [J].
YATES, JR ;
ENG, JK ;
MCCORMACK, AL ;
SCHIELTZ, D .
ANALYTICAL CHEMISTRY, 1995, 67 (08) :1426-1436