ENTROPIC PROFILES OF DNA-SEQUENCES THROUGH CHAOS-GAME-DERIVED IMAGES

被引:44
作者
OLIVER, JL
BERNAOLAGALVAN, P
GUERREROGARCIA, J
ROMANROLDAN, R
机构
[1] UNIV GRANADA,DEPT APPL PHYS,GRANADA,SPAIN
[2] UNIV GRANADA,FAC SCI,DEPT THEORET PHYS,GRANADA,SPAIN
关键词
D O I
10.1006/jtbi.1993.1030
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
A new method to determine entropic profiles in DNA sequences is presented. It is based on the chaos-game representation (CGR) of gene structure, a technique which produces a fractal-like picture of DNA sequences. First, the CGR image was divided into squares 4-m in size (m being the desired resolution), and the point density counted. Second, appropriate intervals were adjusted, and then a histogram of densities was prepared. Third, Shannon’s formula was applied to the probability-distribution histogram, thus obtaining a new entropic estimate for DNA sequences, the histogram entropy, a measurement that goes with the level of constraints on the DNA sequence. Lastly, the entropic profile for the sequence was drawn, by considering the entropies at each resolution level, thus providing a way to summarize the complexity of large genomic regions or even entire genomes at different resolution levels. The application of the method to DNA sequences reveals that entropic profiles obtained in this way, as opposed to previously published ones, clearly discriminate between random and natural DNA sequences. Entropic profiles also show a different degree of variability within and between genomes. The results of these analyses are discussed in relation both to the genome compartmentalization in vertebrates and to the differential action of compositional and/or functional constraints on DNA sequences. © 1993 by Academic Press.
引用
收藏
页码:457 / 470
页数:14
相关论文
共 30 条
[1]  
[Anonymous], 1987, EVOLUTION THERMODYNA
[2]   MONONUCLEOTIDE THROUGH HEXANUCLEOTIDE COMPOSITION OF THE SENSE STRAND OF YEAST DNA - A MARKOV-CHAIN ANALYSIS [J].
ARNOLD, J ;
CUTICCHIA, AJ ;
NEWSOME, DA ;
JENNINGS, WW ;
IVARIE, R .
NUCLEIC ACIDS RESEARCH, 1988, 16 (14) :7145-7158
[3]   THE MOSAIC GENOME OF WARM-BLOODED VERTEBRATES [J].
BERNARDI, G ;
OLOFSSON, B ;
FILIPSKI, J ;
ZERIAL, M ;
SALINAS, J ;
CUNY, G ;
MEUNIERROTIVAL, M ;
RODIER, F .
SCIENCE, 1985, 228 (4702) :953-958
[4]   DNA METHYLATION AND THE FREQUENCY OF CPG IN ANIMAL DNA [J].
BIRD, AP .
NUCLEIC ACIDS RESEARCH, 1980, 8 (07) :1499-1504
[5]  
BLAISDELL BE, 1984, J MOL EVOL, V21, P278
[6]   LINGUISTICS OF NUCLEOTIDE-SEQUENCES - MORPHOLOGY AND COMPARISON OF VOCABULARIES [J].
BRENDEL, V ;
BECKMANN, JS ;
TRIFONOV, EN .
JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 1986, 4 (01) :11-21
[7]   GENBANK [J].
BURKS, C ;
CASSIDY, M ;
CINKOSKY, MJ ;
CUMELLA, KE ;
GILNA, P ;
HAYDEN, JED ;
KEEN, GM ;
KELLEY, TA ;
KELLY, M ;
KRISTOFFERSON, D ;
RYALS, J .
NUCLEIC ACIDS RESEARCH, 1991, 19 :2221-2225
[8]  
GATLIN L, 1972, INFORMATION THEORY L
[9]  
GUIASU S, 1977, INFORMATION THEORY A
[10]   ON THE VALIDITY OF SHANNON-INFORMATION CALCULATIONS FOR MOLECULAR BIOLOGICAL SEQUENCES [J].
HARIRI, A ;
WEBER, B ;
OLMSTED, J .
JOURNAL OF THEORETICAL BIOLOGY, 1990, 147 (02) :235-254