Applications of generalized pair hidden Markov models to alignment and gene finding problems

被引:46
作者
Pachter, L
Alexandersson, M
Cawley, S
机构
[1] Univ Calif Berkeley, Dept Math, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
关键词
hidden Markov model; alignment; gene finding; comparative genomics;
D O I
10.1089/10665270252935520
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Hidden Markov models (HMMs) have been successfully applied to a variety of problems in molecular biology, ranging from alignment problems to gene finding and annotation. Alignment problems can be solved with pair HMMs, while gene finding programs rely on generalized HMMs in order to model exon lengths. In this paper, we introduce the generalized pair HMM (GPHMM), which is an extension of both pair and generalized HMMs. We show how GPHMMs, in conjunction with approximate alignments, can be used for cross-species gene finding and describe applications to DNA-cDNA and DNA-protein alignment. GPHMMs provide a unifying and probabilistically sound theory for modeling these problems.
引用
收藏
页码:389 / 399
页数:11
相关论文
共 35 条
[1]  
ALEXANDERSSON M, 2002, UNPUB SLAM CROSS SPE
[2]  
[Anonymous], 1997, THESIS STANFORD U ST
[3]  
BAFNA V, 2000, P INT C INT SYST MOL
[4]   Human and mouse gene structure: Comparative analysis and application to exon prediction [J].
Batzoglou, S ;
Pachter, L ;
Mesirov, JP ;
Berger, B ;
Lander, ES .
GENOME RESEARCH, 2000, 10 (07) :950-958
[5]   Using GeneWise in the Drosophila annotation experiment [J].
Birney, E ;
Durbin, R .
GENOME RESEARCH, 2000, 10 (04) :547-548
[6]   Prediction of complete gene structures in human genomic DNA [J].
Burge, C ;
Karlin, S .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 268 (01) :78-94
[7]  
CAWLEY S, 2000, THESIS U CALIFORNIA
[8]  
CHURCHILL GA, 1989, B MATH BIOL, V51, P79
[9]  
Dayhoff M.O., 1978, ATLAS PROTEIN SEQ ST, V5
[10]  
Durbin R., 1998, BIOL SEQUENCE ANAL