Using the COG database to improve gene recognition in complete genomes

被引:44
作者
Natale, DA [1 ]
Galperin, MY [1 ]
Tatusov, RL [1 ]
Koonin, EV [1 ]
机构
[1] Natl Lib Med, Natl Ctr Biotechnol Informat, NIH, Bethesda, MD 20894 USA
关键词
essential genes; microbial genomes; phylogenetic patterns; short proteins;
D O I
10.1023/A:1004031323748
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
A complete understanding of the biology of an organism necessarily starts with knowledge of its genetic makeup. Proteins encoded in a genome must be identified and characterized, and the presence or absence of specific sets of proteins must be noted in order to determine the possible biochemical pathways or functional systems utilized by that organism. The COG database presents a set of tools suited to these purposes, including the ability to select protein families (COGs) that contain proteins from a specified set of species. The selection is based upon a phylogenetic pattern, which is a shorthand representation of the presence or absence of a particular species in a COG. Here we present the use of phylogenetic patterns as a means to perform targeted searches for undetected protein-coding genes in complete genomes.
引用
收藏
页码:9 / 17
页数:9
相关论文
共 14 条
  • [1] The genome sequence of Rickettsia prowazekii and the origin of mitochondria
    Andersson, SGE
    Zomorodipour, A
    Andersson, JO
    Sicheritz-Pontén, T
    Alsmark, UCM
    Podowski, RM
    Näslund, AK
    Eriksson, AS
    Winkler, HH
    Kurland, CG
    [J]. NATURE, 1998, 396 (6707) : 133 - 140
  • [2] The Sec system
    Driessen, AJM
    Fekkes, P
    van der Wolk, JPW
    [J]. CURRENT OPINION IN MICROBIOLOGY, 1998, 1 (02) : 216 - 222
  • [3] USES FOR EVOLUTIONARY TREES
    FITCH, WM
    [J]. PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY OF LONDON SERIES B-BIOLOGICAL SCIENCES, 1995, 349 (1327) : 93 - 102
  • [4] DISTINGUISHING HOMOLOGOUS FROM ANALOGOUS PROTEINS
    FITCH, WM
    [J]. SYSTEMATIC ZOOLOGY, 1970, 19 (02): : 99 - &
  • [5] WHOLE-GENOME RANDOM SEQUENCING AND ASSEMBLY OF HAEMOPHILUS-INFLUENZAE RD
    FLEISCHMANN, RD
    ADAMS, MD
    WHITE, O
    CLAYTON, RA
    KIRKNESS, EF
    KERLAVAGE, AR
    BULT, CJ
    TOMB, JF
    DOUGHERTY, BA
    MERRICK, JM
    MCKENNEY, K
    SUTTON, G
    FITZHUGH, W
    FIELDS, C
    GOCAYNE, JD
    SCOTT, J
    SHIRLEY, R
    LIU, LI
    GLODEK, A
    KELLEY, JM
    WEIDMAN, JF
    PHILLIPS, CA
    SPRIGGS, T
    HEDBLOM, E
    COTTON, MD
    UTTERBACK, TR
    HANNA, MC
    NGUYEN, DT
    SAUDEK, DM
    BRANDON, RC
    FINE, LD
    FRITCHMAN, JL
    FUHRMANN, JL
    GEOGHAGEN, NSM
    GNEHM, CL
    MCDONALD, LA
    SMALL, KV
    FRASER, CM
    SMITH, HO
    VENTER, JC
    [J]. SCIENCE, 1995, 269 (5223) : 496 - 512
  • [6] Galperin M.Y., 1999, ORG PROKARYOTIC GENO, P91
  • [7] Analogous enzymes: Independent inventions in enzyme evolution
    Galperin, MY
    Walker, DR
    Koonin, EV
    [J]. GENOME RESEARCH, 1998, 8 (08): : 779 - 790
  • [8] Functional genomics and enzyme evolution - Homologous and analogous enzymes encoded in microbial genomes
    Galperin, MY
    Koonin, EV
    [J]. GENETICA, 1999, 106 (1-2) : 159 - 170
  • [9] Beyond complete genomes: from sequence to structure and function
    Koonin, EV
    Tatusov, RL
    Galperin, MY
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 1998, 8 (03) : 355 - 363
  • [10] Non-orthologous gene displacement
    Koonin, EV
    Mushegian, AR
    Bork, P
    [J]. TRENDS IN GENETICS, 1996, 12 (09) : 334 - 336