MaxSubSeq: an algorithm for segment-length optimization. The case study of the transmembrane spanning segments

被引:15
作者
Fariselli, P
Finelli, M
Marchignoli, D
Martelli, PL
Rossi, I
Casadio, R
机构
[1] Univ Bologna, Dept Biol, CIRB, I-40126 Bologna, Italy
[2] BioDec, I-40127 Bologna, Italy
关键词
D O I
10.1093/bioinformatics/btg023
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: A problem in predicting the topography of transmembrane proteins is the optimal localization of the transmembrane segments along the protein sequences, provided that each residue is associated with a propensity of being or not being included in the transmembrane protein region. From previous work it is known that post-processing of propensity signals with suited algorithms can greatly improve the quality and the accuracy of the predictions. In this paper we describe a general dynamic programming-like algorithm (MaxSubSeq, Maximal SubSequence) specifically designed to optimize the number and length of segments with constrained length in a given protein sequence. Previous application of our algorithm, has proved its effectiveness in the optimization task of both neural network and hidden Markov models output, and in this paper we present the detailed description of MaxSubSeq. Results: We describe the application of MaxSubSeq to the location of both helical and beta strand transmembrane segments, optimizing the outputs derived with different predictive algorithms. For all-alpha transmembrane proteins we use both the standard Kyte-Doolittle (KD) hydropathy scale and the TMHMM predictor (http://www.cbs.dtu.dk/). Using a set of 188 well characterized membrane proteins, MaxSubSeq nearly doubles the correct location of transmembrane segments as compared to the standard KD hydrophobicity plot, reaching 51% accuracy. If MaxSubSeq is used to optimize the TMHMM method the accuracy increases from 68 to 72%. When used to regularize the prediction of beta transmembrane strands, obtained using both a neural network and a HMM based predictors, MaxSubSeq increases the accuracy per protein up to 72 and 73% respectively.
引用
收藏
页码:500 / 505
页数:6
相关论文
共 17 条
[1]   The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 [J].
Bairoch, A ;
Apweiler, R .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :45-48
[2]   A 3D model of the voltage-dependent anion channel (VDAQ) [J].
Casadio, R ;
Jacoboni, I ;
Messina, A ;
De Pinto, V .
FEBS LETTERS, 2002, 520 (1-3) :1-7
[3]  
Durbin R., 1998, BIOL SEQUENCE ANAL P
[4]   Prediction of the transmembrane regions of β-barrel membrane proteins with a neural network-based predictor [J].
Jacoboni, I ;
Martelli, PL ;
Fariselli, P ;
De Pinto, V ;
Casadio, R .
PROTEIN SCIENCE, 2001, 10 (04) :779-787
[5]   A MODEL RECOGNITION APPROACH TO THE PREDICTION OF ALL-HELICAL MEMBRANE-PROTEIN STRUCTURE AND TOPOLOGY [J].
JONES, DT ;
TAYLOR, WR ;
THORTON, JM .
BIOCHEMISTRY, 1994, 33 (10) :3038-3049
[6]   Predicting transmembrane protein topology with a hidden Markov model: Application to complete genomes [J].
Krogh, A ;
Larsson, B ;
von Heijne, G ;
Sonnhammer, ELL .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 305 (03) :567-580
[7]   A SIMPLE METHOD FOR DISPLAYING THE HYDROPATHIC CHARACTER OF A PROTEIN [J].
KYTE, J ;
DOOLITTLE, RF .
JOURNAL OF MOLECULAR BIOLOGY, 1982, 157 (01) :105-132
[8]  
Martelli Pier Luigi, 2002, Bioinformatics, V18 Suppl 1, pS46
[9]   Evaluation of methods for the prediction of membrane spanning regions [J].
Möller, S ;
Croning, MDR ;
Apweiler, R .
BIOINFORMATICS, 2001, 17 (07) :646-653
[10]   Topology prediction for helical transmembrane proteins at 86% accuracy [J].
Rost, B ;
Fariselli, P ;
Casadio, R .
PROTEIN SCIENCE, 1996, 5 (08) :1704-1718