Identification of bacteria using tandem mass spectrometry combined with a proteome database and statistical scoring

被引:79
作者
Dworzanski, JP [1 ]
Snyder, AP
Chen, R
Zhang, HY
Wishart, D
Li, L
机构
[1] Geocenters Inc, Aberdeen Proving Ground, MD 21010 USA
[2] USA, Edgewood Chem Biol Ctr, Aberdeen Proving Ground, MD 21010 USA
[3] Univ Alberta, Dept Chem, Edmonton, AB T6G 2G2, Canada
[4] Univ Alberta, Dept Biol Sci, Edmonton, AB T6G 2N8, Canada
关键词
D O I
10.1021/ac0349781
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Detection and identification of pathogenic bacteria and their protein toxins play a crucial role in a proper response to natural or terrorist-caused outbreaks of infectious diseases. The recent availability of whole genome sequences of priority bacterial pathogens opens new diagnostic possibilities for identification of bacteria by retrieving their genomic or proteomic information. We describe a method for identification of bacteria based on tandem mass spectrometric (MS/MS) analysis of peptides derived from bacterial proteins. This method involves bacterial cell protein extraction, trypsin digestion, liquid chromatography MS/MS analysis of the resulting peptides, and a statistical scoring algorithm to rank MS/MS spectral matching results for bacterial identification. To facilitate spectral data searching, a proteome database was constructed by translating genomes of bacteria of interest with fully or partially determined sequences. In this work, a prototype database was constructed by the automated analysis of 87 publicly available, fully sequenced bacterial genomes with the GLIMMER gene finding software. MS/MS peptide spectral matching for peptide sequence assignment against this proteome database was done by SEQUEST. To gauge the relative significance of the SEQUEST-generated matching parameters for correct peptide assignment, discriminant function (DF) analysis of these parameters was applied and DF scores were used to calculate probabilities of correct MS/MS spectra assignment to peptide sequences in the database. The peptides with DF scores exceeding a threshold value determined by the probability of correct peptide assignment were accepted and matched to the bacterial proteomes represented in the database. Sequence filtering or removal of degenerate peptides matched with multiple bacteria was then performed to further improve identification. It is demonstrated that using a preset criterion with known distributions of discriminant function scores and probabilities of correct peptide sequence assignments, a test bacterium within the 87 database microorganisms can be unambiguously identified.
引用
收藏
页码:2355 / 2366
页数:12
相关论文
共 39 条
[1]   Mass spectrometry in proteomics [J].
Aebersold, R ;
Goodlett, DR .
CHEMICAL REVIEWS, 2001, 101 (02) :269-295
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   A new algorithm for the evaluation of shotgun peptide sequencing in proteomics: Support vector machine classification of peptide MS/MS spectra and SEQUEST scores [J].
Anderson, DC ;
Li, WQ ;
Payan, DG ;
Noble, WS .
JOURNAL OF PROTEOME RESEARCH, 2003, 2 (02) :137-146
[4]   Observation of Escherichia coli ribosomal proteins and their posttranslational modifications by mass spectrometry [J].
Arnold, RJ ;
Reilly, JP .
ANALYTICAL BIOCHEMISTRY, 1999, 269 (01) :105-112
[5]   The complete genome sequence of Escherichia coli K-12 [J].
Blattner, FR ;
Plunkett, G ;
Bloch, CA ;
Perna, NT ;
Burland, V ;
Riley, M ;
ColladoVides, J ;
Glasner, JD ;
Rode, CK ;
Mayhew, GF ;
Gregor, J ;
Davis, NW ;
Kirkpatrick, HA ;
Goeden, MA ;
Rose, DJ ;
Mau, B ;
Shao, Y .
SCIENCE, 1997, 277 (5331) :1453-+
[6]   Searching the Porphyromonas gingivalis genome with peptide fragmentation mass spectra [J].
Chen, WB ;
Laidig, KE ;
Park, Y ;
Park, K ;
Yates, JR ;
Lamont, RJ ;
Hackett, M .
ANALYST, 2001, 126 (01) :52-57
[7]   Improved microbial gene identification with GLIMMER [J].
Delcher, AL ;
Harmon, D ;
Kasif, S ;
White, O ;
Salzberg, SL .
NUCLEIC ACIDS RESEARCH, 1999, 27 (23) :4636-4641
[8]   Microorganism identification by mass spectrometry and protein database searches [J].
Demirev, PA ;
Ho, YP ;
Ryzhov, V ;
Fenselau, C .
ANALYTICAL CHEMISTRY, 1999, 71 (14) :2732-2738
[9]   Biodiversity: Microbial genomes multiply [J].
Doolittle, RF .
NATURE, 2002, 416 (6882) :697-700
[10]   AN APPROACH TO CORRELATE TANDEM MASS-SPECTRAL DATA OF PEPTIDES WITH AMINO-ACID-SEQUENCES IN A PROTEIN DATABASE [J].
ENG, JK ;
MCCORMACK, AL ;
YATES, JR .
JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 1994, 5 (11) :976-989