Genome-based peptide fingerprint scanning

被引:39
作者
Giddings, MC [1 ]
Shah, AA
Gesteland, R
Moore, B
机构
[1] Univ N Carolina, Dept Microbiol & Immunol, Chapel Hill, NC 27599 USA
[2] Univ N Carolina, Dept Biomed Engn, Chapel Hill, NC 27599 USA
[3] Univ Utah, Dept Human Genet, Salt Lake City, UT 84112 USA
关键词
D O I
10.1073/pnas.0136893100
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
We have implemented a method that identifies the genomic origins of sample proteins by scanning their peptide-mass fingerprint against the theoretical translation and proteolytic digest of an entire genome. Unlike previously reported techniques, this method requires no predefined ORF or protein annotations. Fixed-size windows along the genome sequence are scored by an equation accounting for the number of matching peptides, the number of missed enzymatic cleavages in each peptide, the number of in-frame stop codons within a window, the adjacency between peptides, and duplicate peptide matches. Statistical significance of matching regions is assessed by comparing their scores to scores from windows matching randomly generated mass data. Tests with samples from Saccharomyces cerevisiae mitochondria and Escherichia coli have demonstrated the ability to produce statistically significant identifications, agreeing with two commonly used programs, PEPTIDENT and MASCOT, in 86% of samples analyzed. This genome fingerprint scanning method has the potential to aid in genome annotation, identify proteins for which annotation is incorrect or missing, and handle cases where sequencing errors have caused framing mistakes in the databases. It might also aid in the identification of proteins in which recoding events such as frameshifting or stop-codon read-through have occurred, elucidating alternative translation mechanisms. The prototype is implemented as a client/server pair, allowing the distribution, among a set of cluster nodes, of a single or multiple genomes for concurrent analysis.
引用
收藏
页码:20 / 25
页数:6
相关论文
共 26 条
[1]   Functional genomics by mass spectrometry [J].
Andersen, JS ;
Mann, M .
FEBS LETTERS, 2000, 480 (01) :25-31
[2]  
Bafna V, 2001, Bioinformatics, V17 Suppl 1, pS13
[3]   Protein diversity from alternative splicing: A challenge for bioinformatics and post-genome biology [J].
Black, DL .
CELL, 2000, 103 (03) :367-370
[4]  
Choudhary JS, 2001, PROTEOMICS, V1, P651, DOI 10.1002/1615-9861(200104)1:5<651::AID-PROT651>3.0.CO
[5]  
2-N
[6]   A statistical basis for testing the significance of mass spectrometric protein identification results [J].
Eriksson, J ;
Chait, BT ;
Fenyö, D .
ANALYTICAL CHEMISTRY, 2000, 72 (05) :999-1005
[7]   Recoding: Dynamic reprogramming of translation [J].
Gesteland, RF ;
Atkins, JF .
ANNUAL REVIEW OF BIOCHEMISTRY, 1996, 65 :741-768
[8]   Isolation of highly purified mitochondria from Saccharomyces cerevisiae [J].
Glick, BS ;
Pon, LA .
MITOCHONDRIAL BIOGENESIS AND GENETICS, PT A, 1995, 260 :213-223
[9]   IDENTIFYING PROTEINS FROM 2-DIMENSIONAL GELS BY MOLECULAR MASS SEARCHING OF PEPTIDE-FRAGMENTS IN PROTEIN-SEQUENCE DATABASES [J].
HENZEL, WJ ;
BILLECI, TM ;
STULTS, JT ;
WONG, SC ;
GRIMLEY, C ;
WATANABE, C .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1993, 90 (11) :5011-5015
[10]   PROTEIN IDENTIFICATION BY MASS PROFILE FINGERPRINTING [J].
JAMES, P ;
QUADRONI, M ;
CARAFOLI, E ;
GONNET, G .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 1993, 195 (01) :58-64