Computing prokaryotic gene ubiquity: Rescuing the core from extinction

被引:147
作者
Charlebois, RL
Doolittle, WF [1 ]
机构
[1] Dalhousie Univ, Genome Atlantic, Halifax, NS B3H 1X5, Canada
[2] Dalhousie Univ, Dept Biochem & Mol Biol, Halifax, NS B3H 1X5, Canada
关键词
D O I
10.1101/gr.3024704
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The genomic core concept has found several uses in comparative and evolutionary genomics. Defined as the set of all genes common to (ubiquitous among) all genomes in a phylogenetically coherent group, core size decreases as the number and phylogenetic diversity of the relevant group increases. Here, we focus on methods for defining the size and composition of the core of all genes shared by sequenced genomes of prokaryotes (Bacteria and Archaea). There are few (almost certainly less than 50) genes shared by all of the 147 genomes compared, surely insufficient to conduct all essential functions. Sequencing and annotation errors are responsible for the apparent absence of some genes, while very limited but genuine disappearances (from just one or a few genomes) can account for several others. Core size will continue to decrease as more genome sequences appear, unless the requirement for ubiquity is relaxed. Such relaxation seems consistent with any reasonable biological purpose for seeking a core, but it renders the problem of definition more problematic. We propose an alternative approach (the phylogenetically balanced core), which preserves some of the biological utility of the core concept. Cores, however delimited, preferentially contain informational rather than operational genes; we present a new hypothesis for why this might be so.
引用
收藏
页码:2469 / 2477
页数:9
相关论文
共 36 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] Lateral gene transfer and the origins of prokaryotic groups
    Boucher, Y
    Douady, CJ
    Papke, RT
    Walsh, DA
    Boudreau, MER
    Nesbo, CL
    Case, RJ
    Doolittle, WF
    [J]. ANNUAL REVIEW OF GENETICS, 2003, 37 : 283 - 328
  • [3] Eubacterial phylogeny based on translational apparatus proteins
    Brochier, C
    Bapteste, E
    Moreira, D
    Philippe, H
    [J]. TRENDS IN GENETICS, 2002, 18 (01) : 1 - 5
  • [4] Universal trees based on large combined protein sequence data sets
    Brown, JR
    Douady, CJ
    Italia, MJ
    Marshall, WE
    Stanhope, MJ
    [J]. NATURE GENETICS, 2001, 28 (03) : 281 - 285
  • [5] Characterization of species-specific genes using a flexible, web-based querying system
    Charlebois, RL
    Clarke, GDP
    Beiko, RG
    Jean, AS
    [J]. FEMS MICROBIOLOGY LETTERS, 2003, 225 (02) : 213 - 220
  • [6] Charlebois RL, 2004, SYST ASSOC SPEC VOL, P189
  • [7] Inferring genome trees by using a filter to eliminate phylogenetically discordant sequences and a distance matrix based on mean normalized BLASTP scores
    Clarke, GDP
    Beiko, RG
    Ragan, MA
    Charlebois, RL
    [J]. JOURNAL OF BACTERIOLOGY, 2002, 184 (08) : 2072 - 2080
  • [8] Phylogenetics and the cohesion of bacterial genomes
    Daubin, V
    Moran, NA
    Ochman, H
    [J]. SCIENCE, 2003, 301 (5634) : 829 - 832
  • [9] A phylogenomic approach to bacterial phylogeny:: Evidence of a core of genes sharing a common history
    Daubin, V
    Gouy, M
    Perrière, G
    [J]. GENOME RESEARCH, 2002, 12 (07) : 1080 - 1090
  • [10] GOPHNA U, 2004, IN PRESS J BACTERIOL