Proteogenomics for environmental microbiology

被引:41
作者
Armengaud, Jean [1 ]
Hartmann, Erica Marie [1 ]
Bland, Celine [1 ]
机构
[1] CEA, DSV, IBEB, Lab Biochim Syst Perturb, Bagnols Sur Ceze, France
关键词
Genome annotation; High-throughput proteomics; Microbiology; N-Terminomics; Proteogenomics; Translational start site; MASS-SPECTROMETRY DATA; GENOMIC ANNOTATION; PROTEIN BIOMARKERS; PROTEOMIC ANALYSIS; N-TERMINOME; SEQUENCE; GENES; DATABASES; SOFTWARE; INSIGHTS;
D O I
10.1002/pmic.201200576
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Proteogenomics sensu stricto refers to the use of proteomic data to refine the annotation of genomes from model organisms. Because of the limitations of automatic annotation pipelines, a relatively high number of errors occur during the structural annotation of genes coding for proteins. Whether putative orphan sequences or short genes encoding low-molecular-weight proteins really exist is still frequently a mystery. Whether start codons are well defined is also an open debate. These problems are exacerbated for genomes of microorganisms belonging to poorly documented genera, as related sequences are not always available for homology-guided annotation. The functional annotation of a significant proportion of genes is also another well-known issue when annotating environmental microorganisms. High-throughput shotgun proteomics has recently greatly evolved, allowing the exploration of the proteome from any microorganism at an unprecedented depth. The structural and functional annotation process may be usefully complemented with experimental data. Indeed, proteogenomic mapping has been successfully performed for a wide variety of organisms. Specific approaches devoted to systematically establishing the N-termini of a large set of proteins are being developed. N-terminomics is giving rise to datasets of experimentally proven translational start codons as well as validated peptide signals for secreted proteins. By extension, combining genomic and proteomic data is becoming routine in many research projects. The proteomic analysis of organisms with unfinished genome sequences, the so-called composite proteomics, and the search for microbial biomarkers by bottom-up and top-down combined approaches are some examples of proteogenomic-flavored studies. They illustrate the advent of a new era of environmental microbiology where proteomics and genomics are intimately integrated to answer key biological questions.
引用
收藏
页码:2731 / 2742
页数:12
相关论文
共 74 条
[1]   Experimental annotation of post-translational features and translated coding regions in the pathogen Salmonella Typhimurium [J].
Ansong, Charles ;
Tolic, Nikola ;
Purvine, Samuel O. ;
Porwollik, Steffen ;
Jones, Marcus ;
Yoon, Hyunjin ;
Payne, Samuel H. ;
Martin, Jessica L. ;
Burnet, Meagan C. ;
Monroe, Matthew E. ;
Venepally, Pratap ;
Smith, Richard D. ;
Peterson, Scott N. ;
Heffron, Fred ;
McClelland, Michael ;
Adkins, Joshua N. .
BMC GENOMICS, 2011, 12
[2]   GeneTack database: genes with frameshifts in prokaryotic genomes and eukaryotic mRNA sequences [J].
Antonov, Ivan ;
Baranov, Pavel ;
Borodovsky, Mark .
NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) :D152-D156
[3]  
Armengaud J., 2011, J BACTERIOL PARASI S, VS3, P001
[4]   Microbiology and proteomics, getting the best of both worlds! [J].
Armengaud, Jean .
ENVIRONMENTAL MICROBIOLOGY, 2013, 15 (01) :12-23
[5]   A perfect genome annotation is within reach with the proteomics and genomics alliance [J].
Armengaud, Jean .
CURRENT OPINION IN MICROBIOLOGY, 2009, 12 (03) :292-300
[6]   Proteomics-based Refinement of Deinococcus deserti Genome Annotation Reveals an Unwonted Use of Non-canonical Translation Initiation Codons [J].
Baudet, Mathieu ;
Ortet, Philippe ;
Gaillard, Jean-Charles ;
Fernandez, Bernard ;
Guerin, Philippe ;
Enjalbal, Christine ;
Subra, Gilles ;
de Groot, Arjan ;
Barakat, Mohamed ;
Dedieu, Alain ;
Armengaud, Jean .
MOLECULAR & CELLULAR PROTEOMICS, 2010, 9 (02) :415-426
[7]   Linking environmental processes to the in situ functioning of microorganisms by high-resolution secondary ion mass spectrometry (NanoSIMS) and scanning transmission X-ray microscopy (STXM) [J].
Behrens, Sebastian ;
Kappler, Andreas ;
Obst, Martin .
ENVIRONMENTAL MICROBIOLOGY, 2012, 14 (11) :2851-2869
[8]   Addressing Statistical Biases in Nucleotide-Derived Protein Databases for Proteogenomic Search Strategies [J].
Blakeley, Paul ;
Overton, Ian M. ;
Hubbard, Simon J. .
JOURNAL OF PROTEOME RESEARCH, 2012, 11 (11) :5221-5234
[9]   N-terminal Protein Processing: A Comparative Proteogenomic Analysis [J].
Bonissone, Stefano ;
Gupta, Nitin ;
Romine, Margaret ;
Bradshaw, Ralph A. ;
Pevzner, Pavel A. .
MOLECULAR & CELLULAR PROTEOMICS, 2013, 12 (01) :14-28
[10]  
Boutet Emmanuel, 2007, V406, P89