Identification of several types of periodicities in the collagens and their simulation

被引:8
作者
Arques, DG
Fallot, JP
Michel, CJ
机构
[1] UNIV FRANCHE COMTE,EQUIPE BIOL THEOR,INST TECHNOL BELFORT MONTBELIARD,F-90016 BELFORT,FRANCE
[2] UNIV MARNE VALLEE,EQUIPE BIOL THEOR,INST GASPARD MONGE,F-93160 NOISY LE GRAND,FRANCE
关键词
identification; modelling; autocorrelation function; automaton; collagens;
D O I
10.1016/0141-8130(96)01115-4
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The collagens constitute an important population of proteins providing the structural support in vertebrate tissues. A collagen is mainly based on a series of tripeptides of the type GX(1)X(2) (G=Glycine, X(1) and X(2) being any residues). The nine amino acids occurring with significant frequencies in the X, and X, residue sites and G form the reduced protein alphabet Q={A,D,E,G,K,L,P,Q,R,S} (A=Alanine, D=Aspartic acid, E=Glutamic acid, K=Lysine, L=Leucine, P=Proline, Q=Glutamine, R=Arginine, S=Serine). Surprisingly, the method based on the autocorrelation function w(X)(i)w' analysing the probability that an amino acid w' in Q occurs any i residues X after an amino acid w in Q (called i-motif w(X)(i)w'), identifies six types of module 3 periodicities in collagens: three basic types 0, 1 and 2 module 3 and three combined types 0, 1, 0,2 and 1,2 module 3. Furthermore, the classification of these 100 i-motifs according to the types of periodicities shows several strong relations between four sub-sets of Q {G}, {A,D,P,S}, {E,L} and {K,Q,R}. Then, these relations allow the construction of a simple automaton for the generation of model collagen sequences. Indeed, this automaton can simulate the six types of periodicities and it retrieves the types of periodicities for almost all i-motifs. Finally, the autocorrelation function based on the sub-set I K,Q,R) identifies segments of 18 amino acids in collagens which may correspond to the exons (segments of genes of 54 nucleotides) coding for those collagens.
引用
收藏
页码:131 / 138
页数:8
相关论文
共 21 条
[1]  
Arques D., 1995, Technique et Science Informatiques, V14, P197
[2]   STUDY OF A PERTURBATION IN THE CODING PERIODICITY [J].
ARQUES, DG ;
MICHEL, CJ .
MATHEMATICAL BIOSCIENCES, 1987, 86 (01) :1-14
[3]  
ARQUES DG, 1990, B MATH BIOL, V52, P741, DOI 10.1016/S0092-8240(05)80383-0
[4]   A PURINE PYRIMIDINE MOTIF VERIFYING AN IDENTICAL PRESENCE IN ALMOST ALL GENE TAXONOMIC GROUPS [J].
ARQUES, DG ;
MICHEL, CJ .
JOURNAL OF THEORETICAL BIOLOGY, 1987, 128 (04) :457-461
[5]   PERIODICITIES IN CODING AND NONCODING REGIONS OF THE GENES [J].
ARQUES, DG ;
MICHEL, CJ .
JOURNAL OF THEORETICAL BIOLOGY, 1990, 143 (03) :307-318
[6]  
ARQUES DG, 1992, COMPUT APPL BIOSCI, V8, P5
[7]  
ARQUES DG, 1993, MODEL SIMULAT, V13, P110
[8]   MAJOR TRANSCRIPT OF THE FRAMESHIFTED COXLL GENE FROM TRYPANOSOME MITOCHONDRIA CONTAINS 4 NUCLEOTIDES THAT ARE NOT ENCODED IN THE DNA [J].
BENNE, R ;
VANDENBURG, J ;
BRAKENHOFF, JPJ ;
SLOOF, P ;
VANBOOM, JH ;
TROMP, MC .
CELL, 1986, 46 (06) :819-826
[9]   ON THE EVOLUTION OF RNA EDITING [J].
COVELLO, PS ;
GRAY, MW .
TRENDS IN GENETICS, 1993, 9 (08) :265-268
[10]  
Creighton TE, 1993, PROTEINS STRUCTURES