Whole genome searching with shotgun proteomic data: Applications for genome annotation

被引:23
作者
Sevinsky, Joel R. [1 ]
Cargile, Benjamin J. [1 ]
Bunger, Maureen K. [1 ]
Meng, Fanyu [2 ]
Yates, Nathan A. [2 ]
Hendrickson, Ronald C. [2 ]
Stephenson, James L., Jr. [1 ]
机构
[1] Res Triangle Inst, Mass Spectrometry Program, Durham, NC 27709 USA
[2] Merck & Co Inc, Merck Res Labs, Rahway, NJ 08854 USA
关键词
isoelectric focusing; mass spectrometry; peptide identification; genome annotation; database searching;
D O I
10.1021/pr070198n
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
High-throughput genome sequencing continues to accelerate the rate at which complete genomes are available for biological research. Many of these new genome sequences have little or no genome annotation currently available and hence rely upon computational predictions of protein coding genes. Evidence of translation from proteomic techniques could facilitate experimental validation of protein coding genes, but the techniques for whole genome searching with MS/MS data have not been adequately developed to date. Here we describe GENQUEST, a novel method using peptide isoelectric focusing and accurate mass to greatly reduce the peptide search space, making fast, accurate, and sensitive whole human genome searching possible on common desktop computers. In an initial experiment, almost all exonic peptides identified in a protein database search were identified when searching genomic sequence. Many peptides identified exclusively in the genome searches were incorrectly identified or could not be experimentally validated, highlighting the importance of orthogonal validation. Experimentally validated peptides exclusive to the genomic searches can be used to reannotate protein coding genes. GENQUEST represents an experimental tool that can be used by the proteomics community at large for validating computational approaches to genome annotation.
引用
收藏
页码:80 / 88
页数:9
相关论文
共 25 条
[21]   EID: the Exon-Intron Database - an exhaustive database of protein-coding intron-containing genes [J].
Saxonov, S ;
Daizadeh, I ;
Fedorov, A ;
Gilbert, W .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :185-190
[22]   Advances in the Exon-Intron Database (EID) [J].
Shepelev, Valery ;
Fedorov, Alexei .
BRIEFINGS IN BIOINFORMATICS, 2006, 7 (02) :178-185
[23]   Novel linear quadrupole ion trap/FT mass spectrometer: Performance characterization and use in the comparative analysis of histone H3 post-translational modifications [J].
Syka, JEP ;
Marto, JA ;
Bai, DL ;
Horning, S ;
Senko, MW ;
Schwartz, JC ;
Ueberheide, B ;
Garcia, B ;
Busby, S ;
Muratore, T ;
Shabanowitz, J ;
Hunt, DF .
JOURNAL OF PROTEOME RESEARCH, 2004, 3 (03) :621-626
[24]   Improving gene annotation using peptide mass spectrometry [J].
Tanner, Stephen ;
Shen, Zhouxin ;
Ng, Julio ;
Florea, Liliana ;
Guigo, Roderic ;
Briggs, Steven P. ;
Bafna, Vineet .
GENOME RESEARCH, 2007, 17 (02) :231-239
[25]   MINING GENOMES - CORRELATING TANDEM MASS-SPECTRA OF MODIFIED AND UNMODIFIED PEPTIDES TO SEQUENCES IN NUCLEOTIDE DATABASES [J].
YATES, JR ;
ENG, JK ;
MCCORMACK, AL .
ANALYTICAL CHEMISTRY, 1995, 67 (18) :3202-3210