LARGE-SCALE BACTERIAL GENE DISCOVERY BY SIMILARITY SEARCH

被引:38
作者
ROBISON, K
GILBERT, W
CHURCH, GM
机构
[1] HARVARD UNIV,SCH MED,DEPT GENET,BOSTON,MA 02115
[2] HOWARD HUGHES MED INST,BOSTON,MA 02115
关键词
D O I
10.1038/ng0694-205
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
DNA sequencing efforts frequently uncover genes other than the targeted ones. We have used rapid database scanning methods to search for undescribed eubacterial and archean protein coding frames in regions flanking known genes. By searching all prokaryotic DNA sequences not marked as coding for proteins or stable RNAs against the protein databases, we have identified more than 450 new examples of bacterial proteins, as well as a smaller number of possible revisions to known proteins, at a surprisingly high rate of one new protein or revision for every 24 initial DNA sequences or 8,300 nucleotides examined. Seven proteins are members of families which have not been described in prokaryotic sequences. We also describe 49 re-interpretations of existing sequence data of particular biological significance.
引用
收藏
页码:205 / 214
页数:10
相关论文
共 42 条
  • [1] ALKSNE LE, 1993, J BIOL CHEM, V268, P10813
  • [2] AMINO-ACID SUBSTITUTION MATRICES FROM AN INFORMATION THEORETIC PERSPECTIVE
    ALTSCHUL, SF
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1991, 219 (03) : 555 - 565
  • [3] ALTSCHUL SF, 1990, J MOL BIOL, V214, P1
  • [4] ORGANIZATION AND NUCLEOTIDE-SEQUENCE OF A GENE-CLUSTER COMPRISING THE TRANSLATION ELONGATION FACTOR-1-ALPHA FROM SULFOLOBUS-ACIDOCALDARIUS
    AUER, J
    SPICKER, G
    MAYERHOFER, L
    PUHLER, G
    BOCK, A
    [J]. SYSTEMATIC AND APPLIED MICROBIOLOGY, 1991, 14 (01) : 14 - 22
  • [5] THE SWISS-PROT PROTEIN-SEQUENCE DATA-BANK
    BAIROCH, A
    BOECKMANN, B
    [J]. NUCLEIC ACIDS RESEARCH, 1991, 19 : 2247 - 2248
  • [6] THE PIR PROTEIN-SEQUENCE DATABASE
    BARKER, WC
    GEORGE, DG
    HUNT, LT
    GARAVELLI, JS
    [J]. NUCLEIC ACIDS RESEARCH, 1991, 19 : 2231 - 2236
  • [7] BELUNIS CJ, 1992, J BIOL CHEM, V267, P18702
  • [8] GENBANK
    BENSON, D
    LIPMAN, DJ
    OSTELL, J
    [J]. NUCLEIC ACIDS RESEARCH, 1993, 21 (13) : 2963 - 2965
  • [9] NUCLEOTIDE-SEQUENCE OF THE METHYL COENZYME-M REDUCTASE GENE-CLUSTER FROM METHANOSARCINA-BARKERI
    BOKRANZ, M
    KLEIN, A
    [J]. NUCLEIC ACIDS RESEARCH, 1987, 15 (10) : 4350 - 4351
  • [10] CLONING AND CHARACTERIZATION OF THE METHYL COENZYME-M REDUCTASE GENES FROM METHANOBACTERIUM-THERMOAUTOTROPHICUM
    BOKRANZ, M
    BAUMNER, G
    ALLMANSBERGER, R
    ANKELFUCHS, D
    KLEIN, A
    [J]. JOURNAL OF BACTERIOLOGY, 1988, 170 (02) : 568 - 577