GlimmerM, Exonomy and Unveil:: three ab initio eukaryotic genefinders

被引:51
作者
Majoros, WH [1 ]
Pertea, M [1 ]
Antonescu, C [1 ]
Salzberg, SL [1 ]
机构
[1] Inst Genom Res, Rockville, MD 20850 USA
关键词
D O I
10.1093/nar/gkg527
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We present three programs for ab initio gene prediction in eukaryotes: Exonomy, Unveil and GlimmerM. Exonomy is a 23-state Generalized Hidden Markov Model (GHMM), Unveil is a 283-state standard Hidden Markov Model (HMM) and GlimmerM is a previously-described genefinder which utilizes decision trees and Interpolated Markov Models (IMMs). All three are readily re-trainable for new organisms and have been found to perform well compared to other genefinders. Results are presented for Arabidopsis thaliana. Cases have been found where each of the genefinders outperforms each of the others, demonstrating the collective value of this ensemble of genefinders. These programs are all accessible through webservers at http://www.tigr.org/software.
引用
收藏
页码:3601 / 3604
页数:4
相关论文
共 16 条
  • [1] [Anonymous], 1998, COMPUTATIONAL METHOD
  • [2] A MAXIMIZATION TECHNIQUE OCCURRING IN STATISTICAL ANALYSIS OF PROBABILISTIC FUNCTIONS OF MARKOV CHAINS
    BAUM, LE
    PETRIE, T
    SOULES, G
    WEISS, N
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1970, 41 (01): : 164 - &
  • [3] Prediction of complete gene structures in human genomic DNA
    Burge, C
    Karlin, S
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1997, 268 (01) : 78 - 94
  • [4] Assessment of the total number of human transcription units
    Das, M
    Burge, CB
    Park, E
    Colinas, J
    Pelletier, J
    [J]. GENOMICS, 2001, 77 (1-2) : 71 - 78
  • [5] Durbin R., 1998, BIOL SEQUENCE ANAL
  • [6] Comparison of mouse and human genomes followed by experimental verification yields an estimated 1,019 additional genes
    Guigó, R
    Dermitzakis, ET
    Agarwal, P
    Ponting, CP
    Parra, G
    Reymond, A
    Abril, JF
    Keibler, E
    Lyle, R
    Ucla, C
    Antonarakis, SE
    Brent, MR
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (03) : 1140 - 1145
  • [7] Haas BJ, 2002, GENOME BIOL, V3, DOI DOI 10.1186/GB-2002-3-6-RESEARCH0029
  • [8] Finding genes in DNA with a Hidden Markov Model
    Henderson, J
    Salzberg, S
    Fasman, KH
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 1997, 4 (02) : 127 - 141
  • [9] Analysis of the genome sequence of the flowering plant Arabidopsis thaliana
    Kaul, S
    Koo, HL
    Jenkins, J
    Rizzo, M
    Rooney, T
    Tallon, LJ
    Feldblyum, T
    Nierman, W
    Benito, MI
    Lin, XY
    Town, CD
    Venter, JC
    Fraser, CM
    Tabata, S
    Nakamura, Y
    Kaneko, T
    Sato, S
    Asamizu, E
    Kato, T
    Kotani, H
    Sasamoto, S
    Ecker, JR
    Theologis, A
    Federspiel, NA
    Palm, CJ
    Osborne, BI
    Shinn, P
    Conway, AB
    Vysotskaia, VS
    Dewar, K
    Conn, L
    Lenz, CA
    Kim, CJ
    Hansen, NF
    Liu, SX
    Buehler, E
    Altafi, H
    Sakano, H
    Dunn, P
    Lam, B
    Pham, PK
    Chao, Q
    Nguyen, M
    Yu, GX
    Chen, HM
    Southwick, A
    Lee, JM
    Miranda, M
    Toriumi, MJ
    Davis, RW
    [J]. NATURE, 2000, 408 (6814) : 796 - 815
  • [10] Current methods of gene prediction, their strengths and weaknesses
    Mathé, C
    Sagot, MF
    Schiex, T
    Rouzé, P
    [J]. NUCLEIC ACIDS RESEARCH, 2002, 30 (19) : 4103 - 4117