Phat -: a gene finding program for Plasmodium falciparum

被引:31
作者
Cawley, SE [1 ]
Wirth, AI
Speed, TP
机构
[1] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
[2] Affymetrix, Emeryville, CA 94608 USA
[3] Princeton Univ, Dept Comp Sci, Princeton, NJ 08544 USA
关键词
Plasmodium falciparum; gene-finding; generalized hidden Markov model; viterbi algorithm;
D O I
10.1016/S0166-6851(01)00363-2
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We describe and assess the performance of the gene finding program pretty handy annotation tool (Phat) on sequence from the malaria parasite Plasmodium falciparum. Phat is based on a generalized hidden Markov model (GHMM) similar to the models used in GENSCAN, Genie and HMMgcne. In a test set of 44 confirmed gene structures Phat achieves nucleotide-level sensitivity and specificity of greater than 95%, performing as well as the other P. falciparum gene finding programs Hexamer and GlimmerM. Phat is particularly useful for P. falciparum and other eukaryotes for which there are few gene finding programs available as it is distributed with code for retraining it on new organisms. Moreover, the full source code is freely available under the GNU General Public License, allowing for users to further develop and customize it. (C) 2001 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:167 / 174
页数:8
相关论文
共 13 条
[1]   The complete nucleotide sequence of chromosome 3 of Plasmodium falciparum [J].
Bowman, S ;
Lawson, D ;
Basham, D ;
Brown, D ;
Chillingworth, T ;
Churcher, CM ;
Craig, A ;
Davies, RM ;
Devlin, K ;
Feltwell, T ;
Gentles, S ;
Gwilliam, R ;
Hamlin, N ;
Harris, D ;
Holroyd, S ;
Hornsby, T ;
Horrocks, P ;
Jagels, K ;
Jassal, B ;
Kyes, S ;
McLean, J ;
Moule, S ;
Mungall, K ;
Murphy, L ;
Oliver, K ;
Quail, MA ;
Rajandream, MA ;
Rutter, S ;
Skelton, J ;
Squares, R ;
Squares, S ;
Sulston, JE ;
Whitehead, S ;
Woodward, JR ;
Newbold, C ;
Barrell, BG .
NATURE, 1999, 400 (6744) :532-538
[2]  
Bühlmann P, 1999, ANN STAT, V27, P480
[3]   Prediction of complete gene structures in human genomic DNA [J].
Burge, C ;
Karlin, S .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 268 (01) :78-94
[4]   Evaluation of gene structure prediction programs [J].
Burset, M ;
Guigo, R .
GENOMICS, 1996, 34 (03) :353-367
[5]   Chromosome 2 sequence of the human malaria parasite Plasmodium falciparum [J].
Gardner, MJ ;
Tettelin, H ;
Carucci, DJ ;
Cummings, LM ;
Aravind, L ;
Koonin, EV ;
Shallom, S ;
Mason, T ;
Yu, K ;
Fujii, C ;
Pederson, J ;
Shen, K ;
Jing, JP ;
Aston, C ;
Lai, ZW ;
Schwartz, DC ;
Pertea, M ;
Salzberg, S ;
Zhou, LX ;
Sutton, GG ;
Clayton, R ;
White, O ;
Smith, HO ;
Fraser, CM ;
Adams, MD ;
Venter, JC ;
Hoffman, SL .
SCIENCE, 1998, 282 (5391) :1126-1132
[6]   Using database matches with HMMGene for automated gene detection in Drosophila [J].
Krogh, A .
GENOME RESEARCH, 2000, 10 (04) :523-528
[7]  
Kulp D, 1996, Proc Int Conf Intell Syst Mol Biol, V4, P134
[8]   Bioinformatics -: Finding genes in Plasmodium falciparum [J].
Pertea, M ;
Salzberg, SL ;
Gardner, MJ .
NATURE, 2000, 404 (6773) :34-34
[9]   A TUTORIAL ON HIDDEN MARKOV-MODELS AND SELECTED APPLICATIONS IN SPEECH RECOGNITION [J].
RABINER, LR .
PROCEEDINGS OF THE IEEE, 1989, 77 (02) :257-286
[10]   Genie -: Gene finding in Drosophila melanogaster [J].
Reese, MG ;
Kulp, D ;
Tammana, H ;
Haussler, D .
GENOME RESEARCH, 2000, 10 (04) :529-538