Robust accurate identification of peptides (RAId):: deciphering MS2 data using a structured library search with de novo based statistics

被引:15
作者
Alves, G [1 ]
Yu, YK [1 ]
机构
[1] Natl Lib Med, Natl Ctr Biotechnol Informat, NIH, Bethesda, MD 20894 USA
关键词
D O I
10.1093/bioinformatics/bti620
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The key to MS -based proteomics is peptide sequencing. The major challenge in peptide sequencing, whether library search or de novo, is to better infer statistical significance and better attain noise reduction. Since the noise in a spectrum depends on experimental conditions, the instrument used and many other factors, it cannot be predicted even if the peptide sequence is known. The characteristics of the noise can only be uncovered once a spectrum is given. We wish to overcome such issues. Results: We designed RAId to identify peptides from their associated tandem mass spectrometry data. RAId performs a novel de novo sequencing followed by a search in a peptide library that we created. Through de novo sequencing, we establish the spectrum-specific background score statistics for the library search. When the database search fails to return significant hits, the top-ranking de novo sequences become potential candidates for new peptides that are not yet in the database. The use of spectrum-specific background statistics seems to enable RAId to perform well even when the spectral quality is marginal. Other important features of RAId include its potential in de novo sequencing alone and the ease of incorporating post-translational modifications.
引用
收藏
页码:3726 / 3732
页数:7
相关论文
共 34 条
[1]  
ADERSON DC, 2003, J PROTEOME RES, V2, P137
[2]  
ALVES G, UNPUB
[3]  
[Anonymous], 2001, Bioinformatics
[4]   NOMENCLATURE FOR PEPTIDE FRAGMENT IONS (POSITIVE-IONS) [J].
BIEMANN, K .
METHODS IN ENZYMOLOGY, 1990, 193 :886-887
[5]   A dynamic programming approach to de novo peptide sequencing via tandem mass spectrometry [J].
Chen, T ;
Kao, MY ;
Tepel, M ;
Rush, J ;
Church, GM .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2001, 8 (03) :325-337
[6]  
CLAUSER KR, 1996, P 44 ASMS C MASS SPE, P365
[7]   FOURIER-TRANSFORM ION-CYCLOTRON RESONANCE SPECTROSCOPY [J].
COMISAROW, MB ;
MARSHALL, AG .
CHEMICAL PHYSICS LETTERS, 1974, 25 (02) :282-283
[8]   TANDEM: matching proteins with tandem mass spectra [J].
Craig, R ;
Beavis, RC .
BIOINFORMATICS, 2004, 20 (09) :1466-1467
[9]   De novo peptide sequencing via tandem mass spectrometry [J].
Dancík, V ;
Addona, TA ;
Clauser, KR ;
Vath, JE ;
Pevzner, PA .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1999, 6 (3-4) :327-342
[10]   Ranked solutions to a class of combinatorial optimizations - with applications in mass spectrometry based peptide sequencing and a variant of directed paths in random media [J].
Doerr, TP ;
Alves, G ;
Yu, YK .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2005, 354 :558-570