Functional phylogenomics analysis of bacteria and archaea using consistent genome annotation with UniFam

被引:17
作者
Chai, Juanjuan [1 ]
Kora, Guruprasad [1 ]
Ahn, Tae-Hyuk [1 ]
Hyatt, Doug [2 ,3 ]
Pan, Chongle [1 ,2 ]
机构
[1] Oak Ridge Natl Lab, Comp Sci & Math Div, Oak Ridge, TN 37830 USA
[2] Oak Ridge Natl Lab, BioSci Div, Oak Ridge, TN USA
[3] Univ Tennessee, Joint Inst Biol Sci, Knoxville, TN USA
关键词
Prokaryotes; Cellular function; Pathway; Genomes; Evolution; Phylogenomics; HORIZONTAL GENE-TRANSFER; METABOLIC PATHWAYS; BIOCYC COLLECTION; METACYC DATABASE; INFORMATICS; RECOGNITION; EVOLUTION; SOFTWARE; ORIGINS; ENZYMES;
D O I
10.1186/s12862-014-0207-y
中图分类号
Q [生物科学];
学科分类号
090105 [作物生产系统与生态工程];
摘要
Background: Phylogenetic studies have provided detailed knowledge on the evolutionary mechanisms of genes and species in Bacteria and Archaea. However, the evolution of cellular functions, represented by metabolic pathways and biological processes, has not been systematically characterized. Many clades in the prokaryotic tree of life have now been covered by sequenced genomes in GenBank. This enables a large-scale functional phylogenomics study of many computationally inferred cellular functions across all sequenced prokaryotes. Results: A total of 14,727 GenBank prokaryotic genomes were re-annotated using a new protein family database, UniFam, to obtain consistent functional annotations for accurate comparison. The functional profile of a genome was represented by the biological process Gene Ontology (GO) terms in its annotation. The GO term enrichment analysis differentiated the functional profiles between selected archaeal taxa. 706 prokaryotic metabolic pathways were inferred from these genomes using Pathway Tools and MetaCyc. The consistency between the distribution of metabolic pathways in the genomes and the phylogenetic tree of the genomes was measured using parsimony scores and retention indices. The ancestral functional profiles at the internal nodes of the phylogenetic tree were reconstructed to track the gains and losses of metabolic pathways in evolutionary history. Conclusions: Our functional phylogenomics analysis shows divergent functional profiles of taxa and clades. Such function-phylogeny correlation stems from a set of clade-specific cellular functions with low parsimony scores. On the other hand, many cellular functions are sparsely dispersed across many clades with high parsimony scores. These different types of cellular functions have distinct evolutionary patterns reconstructed from the prokaryotic tree.
引用
收藏
页数:13
相关论文
共 49 条
[1]
Alexa A., 2010, topGO: enrichment analysis for gene ontology
[2]
Improved scoring of functional groups from gene expression data by decorrelating GO graph structure [J].
Alexa, Adrian ;
Rahnenfuehrer, Joerg ;
Lengauer, Thomas .
BIOINFORMATICS, 2006, 22 (13) :1600-1607
[3]
A systematic comparison of the MetaCyc and KEGG pathway databases [J].
Altman, Tomer ;
Travers, Michael ;
Kothari, Anamika ;
Caspi, Ron ;
Karp, Peter D. .
BMC BIOINFORMATICS, 2013, 14
[4]
[Anonymous], CRAN PACKAGE GPLOTS
[5]
The RAST server: Rapid annotations using subsystems technology [J].
Aziz, Ramy K. ;
Bartels, Daniela ;
Best, Aaron A. ;
DeJongh, Matthew ;
Disz, Terrence ;
Edwards, Robert A. ;
Formsma, Kevin ;
Gerdes, Svetlana ;
Glass, Elizabeth M. ;
Kubal, Michael ;
Meyer, Folker ;
Olsen, Gary J. ;
Olson, Robert ;
Osterman, Andrei L. ;
Overbeek, Ross A. ;
McNeil, Leslie K. ;
Paarmann, Daniel ;
Paczian, Tobias ;
Parrello, Bruce ;
Pusch, Gordon D. ;
Reich, Claudia ;
Stevens, Rick ;
Vassieva, Olga ;
Vonstein, Veronika ;
Wilke, Andreas ;
Zagnitko, Olga .
BMC GENOMICS, 2008, 9 (1)
[6]
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
[7]
Benson DA, 2013, NUCLEIC ACIDS RES, V41, pD36, DOI [10.1093/nar/gkn723, 10.1093/nar/gkp1024, 10.1093/nar/gkw1070, 10.1093/nar/gkr1202, 10.1093/nar/gkx1094, 10.1093/nar/gkl986, 10.1093/nar/gkq1079, 10.1093/nar/gks1195, 10.1093/nar/gkg057]
[8]
Horizontal gene transfer in evolution: facts and challenges [J].
Boto, Luis .
PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2010, 277 (1683) :819-827
[9]
Lateral gene transfer and the origins of prokaryotic groups [J].
Boucher, Y ;
Douady, CJ ;
Papke, RT ;
Walsh, DA ;
Boudreau, MER ;
Nesbo, CL ;
Case, RJ ;
Doolittle, WF .
ANNUAL REVIEW OF GENETICS, 2003, 37 :283-328
[10]
Estimating divergence times in large phylogenetic trees [J].
Britton, Tom ;
Anderson, Cajsa Lisa ;
Jacquet, David ;
Lundqvist, Samuel ;
Bremer, Kare .
SYSTEMATIC BIOLOGY, 2007, 56 (05) :741-752