Interpolated Markov models for eukaryotic gene finding

被引:139
作者
Salzberg, SL
Pertea, M
Delcher, AL
Gardner, MJ
Tettelin, H
机构
[1] Inst Genom Res, Rockville, MD 20850 USA
[2] Johns Hopkins Univ, Dept Comp Sci, Baltimore, MD 21218 USA
[3] Loyola Coll, Dept Comp Sci, Baltimore, MD 21210 USA
[4] Celara Genom, Rockville, MD 20850 USA
关键词
D O I
10.1006/geno.1999.5854
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Computational gene finding research has emphasized the development of gene finders for bacterial and human DNA. This has left genome projects for some small eukaryotes without a system that addresses their needs. This paper reports on a new system, GLIMMERM, that was developed to find genes in the malaria parasite Plasmodium falciparum. Because the gene density in P. falciparum is relatively high, the system design was based on a successful bacterial gene finder, GLIMMER. The system was augmented with specially trained modules to find splice sites and was trained on all available data from the P. falciparum genome. Although a precise evaluation of ids accuracy is impossible at this time, laboratory tests (using RT-PCR) on a small selection of predicted genes confirmed all of those predictions. With the rapid progress in sequencing the genome of P. falciparum, the availability of C-his new gene finder will greatly facilitate the annotation process, (C) 1999 Academic Press.
引用
收藏
页码:24 / 31
页数:8
相关论文
共 23 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] BASIC LOCAL ALIGNMENT SEARCH TOOL
    ALTSCHUL, SF
    GISH, W
    MILLER, W
    MYERS, EW
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) : 403 - 410
  • [3] DETECTION OF NEW GENES IN A BACTERIAL GENOME USING MARKOV-MODELS FOR 3 GENE CLASSES
    BORODOVSKY, M
    MCININCH, JD
    KOONIN, EV
    RUDD, KE
    MEDIGUE, C
    DANCHIN, A
    [J]. NUCLEIC ACIDS RESEARCH, 1995, 23 (17) : 3554 - 3562
  • [4] GENMARK - PARALLEL GENE RECOGNITION FOR BOTH DNA STRANDS
    BORODOVSKY, M
    MCININCH, J
    [J]. COMPUTERS & CHEMISTRY, 1993, 17 (02): : 123 - 133
  • [5] Prediction of complete gene structures in human genomic DNA
    Burge, C
    Karlin, S
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1997, 268 (01) : 78 - 94
  • [6] Genomic sequence of a Lyme disease spirochaete, Borrelia burgdorferi
    Fraser, CM
    Casjens, S
    Huang, WM
    Sutton, GG
    Clayton, R
    Lathigra, R
    White, O
    Ketchum, KA
    Dodson, R
    Hickey, EK
    Gwinn, M
    Dougherty, B
    Tomb, JF
    Fleischmann, RD
    Richardson, D
    Peterson, J
    Kerlavage, AR
    Quackenbush, J
    Salzberg, S
    Hanson, M
    vanVugt, R
    Palmer, N
    Adams, MD
    Gocayne, J
    Weidman, J
    Utterback, T
    Watthey, L
    McDonald, L
    Artiach, P
    Bowman, C
    Garland, S
    Fujii, C
    Cotton, MD
    Horst, K
    Roberts, K
    Hatch, B
    Smith, HO
    Venter, JC
    [J]. NATURE, 1997, 390 (6660) : 580 - 586
  • [7] Complete genome sequence of Treponema pallidum, the syphilis spirochete
    Fraser, CM
    Norris, SJ
    Weinstock, CM
    White, O
    Sutton, GG
    Dodson, R
    Gwinn, M
    Hickey, EK
    Clayton, R
    Ketchum, KA
    Sodergren, E
    Hardham, JM
    McLeod, MP
    Salzberg, S
    Peterson, J
    Khalak, H
    Richardson, D
    Howell, JK
    Chidambaram, M
    Utterback, T
    McDonald, L
    Artiach, P
    Bowman, C
    Cotton, MD
    Fujii, C
    Garland, S
    Hatch, B
    Horst, K
    Roberts, K
    Sandusky, M
    Weidman, J
    Smith, HO
    Venter, JC
    [J]. SCIENCE, 1998, 281 (5375) : 375 - 388
  • [8] Chromosome 2 sequence of the human malaria parasite Plasmodium falciparum
    Gardner, MJ
    Tettelin, H
    Carucci, DJ
    Cummings, LM
    Aravind, L
    Koonin, EV
    Shallom, S
    Mason, T
    Yu, K
    Fujii, C
    Pederson, J
    Shen, K
    Jing, JP
    Aston, C
    Lai, ZW
    Schwartz, DC
    Pertea, M
    Salzberg, S
    Zhou, LX
    Sutton, GG
    Clayton, R
    White, O
    Smith, HO
    Fraser, CM
    Adams, MD
    Venter, JC
    Hoffman, SL
    [J]. SCIENCE, 1998, 282 (5391) : 1126 - 1132
  • [9] Finding genes in DNA with a Hidden Markov Model
    Henderson, J
    Salzberg, S
    Fasman, KH
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 1997, 4 (02) : 127 - 141
  • [10] A tool for analyzing and annotating genomic sequences
    Huang, XQ
    Adams, MD
    Zhou, H
    Kerlavage, AR
    [J]. GENOMICS, 1997, 46 (01) : 37 - 45