A GRAPHIC APPROACH TO ANALYZING CODON USAGE IN 1562-ESCHERICHIA-COLI PROTEIN-CODING SEQUENCES

被引:98
作者
ZHANG, CT [1 ]
CHOU, KC [1 ]
机构
[1] UPJOHN CO,RES LABS,COMPUTAT CHEM,KALAMAZOO,MI 49001
关键词
PROTEIN CODING SEQUENCES; BASE FREQUENCY; CODON POSITIONS; COLLECTIVE PARAMETERS;
D O I
10.1006/jmbi.1994.1263
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The occurrence frequencies of the four bases (adenine, cytosine, guanine and thymine) at each of the three codon positions for 1562 Escherichia coli. protein coding sequences have been calculated. The 1562 x 4 x 3 = 18,744 data thus obtained have been analyzed by a graphic method in which the four base occurrence frequencies at each codon position for each coding sequence are represented by a point in a three-dimensional space. Thus, the 18,744 data, which would otherwise occupy several printed pages, can be intuitively displayed by a graph. The point distribution pattern for each of the three codon positions has been analyzed. The results of our analysis indicate that the patterns for the first two codon positions reflect the origin for producing native folding structures of proteins. We thus come to the conclusion that the distribution patterns for the first two codon positions should be basically species-independent, as confirmed by studies for a number of other species. However, the distribution pattern for the third codon position is species-dependent. Based on the point distribution of the third codon position, six collective parameters have been defined to describe the overall feature of the pattern concerned. These collective parameters can be generally used to classify different species, and hence would be a useful vehicle for studies in taxonomy. In addition to E. coli, the collective parameters for a number of other species have been calculated and analyzed. © 1994 Academic Press, Inc.
引用
收藏
页码:1 / 8
页数:8
相关论文
共 14 条
[1]   A JOINT PREDICTION OF THE FOLDING TYPES OF 1490 HUMAN PROTEINS FROM THEIR GENETIC CODONS [J].
CHOU, JJW ;
ZHANG, CT .
JOURNAL OF THEORETICAL BIOLOGY, 1993, 161 (02) :251-262
[2]   A CORRELATION-COEFFICIENT METHOD TO PREDICTING PROTEIN-STRUCTURAL CLASSES FROM AMINO-ACID COMPOSITIONS [J].
CHOU, KC ;
ZHANG, CT .
EUROPEAN JOURNAL OF BIOCHEMISTRY, 1992, 207 (02) :429-433
[3]   DIAGRAMMATIZATION OF CODON USAGE IN 339 HUMAN-IMMUNODEFICIENCY-VIRUS PROTEINS AND ITS BIOLOGICAL IMPLICATION [J].
CHOU, KC ;
ZHANG, CT .
AIDS RESEARCH AND HUMAN RETROVIRUSES, 1992, 8 (12) :1967-1976
[4]  
CHOU PY, 1989, PREDICTION PROTEIN S, P549
[5]   CODON USAGE IN BACTERIA - CORRELATION WITH GENE EXPRESSIVITY [J].
GOUY, M ;
GAUTIER, C .
NUCLEIC ACIDS RESEARCH, 1982, 10 (22) :7055-7074
[7]  
IKEMURA T, 1985, MOL BIOL EVOL, V2, P13
[8]   THE FOLDING TYPE OF A PROTEIN IS RELEVANT TO THE AMINO-ACID-COMPOSITION [J].
NAKASHIMA, H ;
NISHIKAWA, K ;
OOI, T .
JOURNAL OF BIOCHEMISTRY, 1986, 99 (01) :153-162
[9]  
Richardson J. S., 1989, PREDICTION PROTEIN S, P1
[10]  
SHARP PM, 1986, NUCLEIC ACIDS RES, V14, P7734