Hidden Markov model approach for identifying the modular framework of the protein backbone

被引:64
作者
Camproux, AC
Tuffery, P
Chevrolat, JP
Boisvieux, JF
Hazout, S
机构
[1] Univ Paris 07, INSERM, U155, Equipe Bioinformat Mol, F-75251 Paris 05, France
[2] CHU Pitie Salpetriere, Dept Biomath, F-75013 Paris 13, France
来源
PROTEIN ENGINEERING | 1999年 / 12卷 / 12期
关键词
Markov chain; pattern classification; protein backbone; protein conformation; protein structure;
D O I
10.1093/protein/12.12.1063
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The hidden Markov model (HMM) was used to identify recurrent short 3D structural building blocks (SBBs) describing protein backbones, independently of any a priori knowledge. Polypeptide chains are decomposed into a series of short segments defined by their inter-a-carbon distances. Basically, the model takes into account the sequentiality of the observed segments and assumes that each one corresponds to one of several possible SBBs. Fitting the model to a database of non-redundant proteins allowed us to decode proteins in terms of 12 distinct SBBs with different roles in protein structure. Some SBBs correspond to classical regular secondary structures. Others correspond to a significant subdivision elf their bounding regions previously considered to be a single pattern. The major contribution of the HMM is that this model implicitly takes into account the sequential connections between SBBs and thus describes the most probable pathways by which the blocks are connected to form the framework of the protein structures. Validation of the SBBs code was performed by extracting SBB series repeated in recoding proteins and examining their structural similarities. Preliminary results on the sequence specificity of SBBs suggest promising perspectives for the prediction of SBBs or series of SBBs from the protein sequences.
引用
收藏
页码:1063 / 1073
页数:11
相关论文
共 50 条
[1]  
ARGOS P, 1982, INT J PEPT PROT RES, V19, P380
[2]  
ASAI K, 1993, COMPUT APPL BIOSCI, V9, P141
[3]   RULES FOR ALPHA-HELIX TERMINATION BY GLYCINE [J].
AURORA, R ;
SRINIVASAN, R ;
ROSE, GD .
SCIENCE, 1994, 264 (5162) :1126-1130
[4]   A MAXIMIZATION TECHNIQUE OCCURRING IN STATISTICAL ANALYSIS OF PROBABILISTIC FUNCTIONS OF MARKOV CHAINS [J].
BAUM, LE ;
PETRIE, T ;
SOULES, G ;
WEISS, N .
ANNALS OF MATHEMATICAL STATISTICS, 1970, 41 (01) :164-&
[5]   PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES [J].
BERNSTEIN, FC ;
KOETZLE, TF ;
WILLIAMS, GJB ;
MEYER, EF ;
BRICE, MD ;
RODGERS, JR ;
KENNARD, O ;
SHIMANOUCHI, T ;
TASUMI, M .
JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) :535-542
[6]   Prediction of local structure in proteins using a library of sequence-structure motifs [J].
Bystroff, C ;
Baker, D .
JOURNAL OF MOLECULAR BIOLOGY, 1998, 281 (03) :565-577
[7]   Analyzing patterns between regular secondary structures using short structural building blocks defined by a hidden Markov model [J].
Camproux, AC ;
Tuffery, P ;
Buffat, L ;
André, C ;
Boisvieux, JF ;
Hazout, S .
THEORETICAL CHEMISTRY ACCOUNTS, 1999, 101 (1-3) :33-40
[8]   A hidden Markov model approach to neuron firing patterns [J].
Camproux, AC ;
Saunier, F ;
Chouvet, G ;
Thalabard, JC ;
Thomas, G .
BIOPHYSICAL JOURNAL, 1996, 71 (05) :2404-2412
[9]  
CHURCHILL GA, 1989, B MATH BIOL, V51, P79
[10]   COMPARISON OF 3 ALGORITHMS FOR THE ASSIGNMENT OF SECONDARY STRUCTURE IN PROTEINS - THE ADVANTAGES OF A CONSENSUS ASSIGNMENT [J].
COLLOCH, N ;
ETCHEBEST, C ;
THOREAU, E ;
HENRISSAT, B ;
MORNON, JP .
PROTEIN ENGINEERING, 1993, 6 (04) :377-382