Implementation and uses of automated de novo peptide sequencing by tandem mass spectrometry

被引:217
作者
Taylor, JA [1 ]
Johnson, RS [1 ]
机构
[1] Immunex Res & Dev Corp, Seattle, WA 98101 USA
关键词
D O I
10.1021/ac001196o
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
There are several computer programs that can match peptide tandem mass spectrometry data to their exactly corresponding database sequences, and in most protein identification projects, these programs are utilized in the early stages of data interpretation. However, situations frequently arise where tandem mass spectral data cannot be correlated with any database sequences, in these cases, the unmatched data could be due to peptides derived fr-om novel proteins, allelic or species-derived variants of known proteins, or posttranslational or chemical modifications. Two additional problems are frequently encountered in high-throughput protein identification. First, it is difficult to quickly sift through large amounts of data to identify those spectra that, due to poor signal or contaminants, can be ignored. Second, it is important to find incorrect database matches (false positives). We have chosen to address these difficulties by performing automatic de novo sequencing using a computer program called Lutefisk. Sequence candidates obtained are used as input in a homology-based database search program called CIDentify to identify variants of known proteins. Comparison of database-derived sequences with de novo sequences allows for electronic validation of database matches even if the latter are not completely correct. Modifications to the original Lutefisk program have been implemented to handle data obtained from triple quadrupole, ion trap, and quadrupole/time-of-flight hybrid (Qtof) mass spectrometers. For example, the linearity of mass errors due to temperature-dependent expansion of the night tube iu a Qtof was exploited such that isobaric amino acids (glutamine/lysine and oxidized methionine/ phenylalanine) can be differentiated without careful attention to mass calibration.
引用
收藏
页码:2594 / 2604
页数:11
相关论文
共 40 条
[1]   FAST ALGORITHM FOR PEPTIDE SEQUENCING BY MASS-SPECTROSCOPY [J].
BARTELS, C .
BIOMEDICAL AND ENVIRONMENTAL MASS SPECTROMETRY, 1990, 19 (06) :363-368
[2]   NOMENCLATURE FOR PEPTIDE FRAGMENT IONS (POSITIVE-IONS) [J].
BIEMANN, K .
METHODS IN ENZYMOLOGY, 1990, 193 :886-887
[3]  
BLEMANN K, 1966, J AM CHEM SOC, V88, P5598
[4]   Role of accurate mass measurement (±10 ppm) in protein identification strategies employing MS or MS MS and database searching [J].
Clauser, KR ;
Baker, P ;
Burlingame, AL .
ANALYTICAL CHEMISTRY, 1999, 71 (14) :2871-2882
[5]   A poxvirus-encoded semaphorin induces cytokine production from monocytes and binds to a novel cellular semaphorin receptor, VESPR [J].
Comeau, MR ;
Johnson, R ;
DuBose, RF ;
Petersen, M ;
Gearing, P ;
VandenBos, T ;
Park, L ;
Farrah, T ;
Buller, RM ;
Cohen, JI ;
Strockbine, LD ;
Rauch, C ;
Spriggs, MK .
IMMUNITY, 1998, 8 (04) :473-482
[6]   De novo peptide sequencing via tandem mass spectrometry [J].
Dancík, V ;
Addona, TA ;
Clauser, KR ;
Vath, JE ;
Pevzner, PA .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1999, 6 (3-4) :327-342
[7]   AN APPROACH TO CORRELATE TANDEM MASS-SPECTRAL DATA OF PEPTIDES WITH AMINO-ACID-SEQUENCES IN A PROTEIN DATABASE [J].
ENG, JK ;
MCCORMACK, AL ;
YATES, JR .
JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 1994, 5 (11) :976-989
[8]   Examination of micro-tip reversed-phase liquid chromatographic extraction of peptide pools for mass spectrometric analysis [J].
Erdjument-Bromage, H ;
Lui, M ;
Lacomis, L ;
Grewal, A ;
Annan, RS ;
McNulty, DE ;
Carr, SA ;
Tempst, P .
JOURNAL OF CHROMATOGRAPHY A, 1998, 826 (02) :167-181
[9]   Protein identification using mass spectrometric information [J].
Fenyö, D ;
Qin, J ;
Chait, BT .
ELECTROPHORESIS, 1998, 19 (06) :998-1005
[10]  
Fernandez-de-Cossio J, 2000, ELECTROPHORESIS, V21, P1694, DOI 10.1002/(SICI)1522-2683(20000501)21:9<1694::AID-ELPS1694>3.0.CO