Algorithms for incorporating prior topological information in HMMs: application to transmembrane proteins

被引:50
作者
Bagos, Pantelis G. [1 ]
Liakopoulos, Theodore D. [1 ]
Hamodrakas, Stavros J. [1 ]
机构
[1] Univ Athens, Fac Biol, Dept Cell Biol & Biophys, Athens 15701, Greece
关键词
D O I
10.1186/1471-2105-7-189
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Hidden Markov Models (HMMs) have been extensively used in computational molecular biology, for modelling protein and nucleic acid sequences. In many applications, such as transmembrane protein topology prediction, the incorporation of limited amount of information regarding the topology, arising from biochemical experiments, has been proved a very useful strategy that increased remarkably the performance of even the top-scoring methods. However, no clear and formal explanation of the algorithms that retains the probabilistic interpretation of the models has been presented so far in the literature. Results: We present here, a simple method that allows incorporation of prior topological information concerning the sequences at hand, while at the same time the HMMs retain their full probabilistic interpretation in terms of conditional probabilities. We present modifications to the standard Forward and Backward algorithms of HMMs and we also show explicitly, how reliable predictions may arise by these modifications, using all the algorithms currently available for decoding HMMs. A similar procedure may be used in the training procedure, aiming at optimizing the labels of the HMM's classes, especially in cases such as transmembrane proteins where the labels of the membrane-spanning segments are inherently misplaced. We present an application of this approach developing a method to predict the transmembrane regions of alpha-helical membrane proteins, trained on crystallographically solved data. We show that this method compares well against already established algorithms presented in the literature, and it is extremely useful in practical applications. Conclusion: The algorithms presented here, are easily implemented in any kind of a Hidden Markov Model, whereas the prediction method (HMM-TM) is freely available for academic users at http://bioinformatics.biol.uoa.gr/HMM-TM, offering the most advanced decoding options currently available.
引用
收藏
页数:17
相关论文
共 58 条
[31]  
Krogh A, 1997, ISMB-97 - FIFTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS FOR MOLECULAR BIOLOGY, PROCEEDINGS, P179
[32]  
KROGH A, 1994, INT C PATT RECOG, P140, DOI 10.1109/ICPR.1994.576891
[33]   A HIDDEN MARKOV MODEL THAT FINDS GENES IN ESCHERICHIA-COLI DNA [J].
KROGH, A ;
MIAN, IS ;
HAUSSLER, D .
NUCLEIC ACIDS RESEARCH, 1994, 22 (22) :4768-4778
[34]   Predicting transmembrane protein topology with a hidden Markov model: Application to complete genomes [J].
Krogh, A ;
Larsson, B ;
von Heijne, G ;
Sonnhammer, ELL .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 305 (03) :567-580
[35]   The presence of signal peptide significantly affects transmembrane topology prediction [J].
Lao, DM ;
Arai, M ;
Ikeda, M ;
Shimizu, T .
BIOINFORMATICS, 2002, 18 (12) :1562-1566
[36]   Determining the structure and mechanism of the human multidrug resistance P-glycoprotein using cysteine-scanning mutagenesis and thiol-modification techniques [J].
Loo, TW ;
Clarke, DM .
BIOCHIMICA ET BIOPHYSICA ACTA-BIOMEMBRANES, 1999, 1461 (02) :315-325
[37]  
MANOIL C, 1991, METHOD CELL BIOL, V34, P61
[38]   An ENSEMBLE machine learning approach for the prediction of all-alpha membrane proteins [J].
Martelli, Pier Luigi ;
Fariselli, Piero ;
Casadio, Rita .
BIOINFORMATICS, 2003, 19 :i205-i211
[39]   Reliability measures for membrane protein topology prediction algorithms [J].
Melén, K ;
Krogh, A ;
von Heijne, G .
JOURNAL OF MOLECULAR BIOLOGY, 2003, 327 (03) :735-744
[40]   Evaluation of methods for the prediction of membrane spanning regions [J].
Möller, S ;
Croning, MDR ;
Apweiler, R .
BIOINFORMATICS, 2001, 17 (07) :646-653