MATHEMATICAL CHARACTERIZATION OF CHAOS GAME REPRESENTATION - NEW ALGORITHMS FOR NUCLEOTIDE-SEQUENCE ANALYSIS

被引:35
作者
DUTTA, C [1 ]
DAS, J [1 ]
机构
[1] INDIAN INST CHEM BIOL,DIV BIOPHYS,THEORET BIOL GRP,4 RAJA SC MULLICK RD,CALCUTTA 700032,W BENGAL,INDIA
关键词
CHAOS GAME; FRACTAL STRUCTURES; NUCLEOTIDE SEQUENCE; PATTERN ANALYSIS; SIMULATION;
D O I
10.1016/0022-2836(92)90857-G
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Chaos Game Representation (CGR) can recognize patterns in the nucleotide sequences, obtained from databases, of a class of genes using the techniques of fractal structures and by considering DNA sequences as strings composed of four units, G, A, T and C. Such recognition of patterns relies only on visual identification and no mathematical characterization of CGR is known. The present report describes two algorithms that can predict the presence or absence of a stretch of nucleotides in any gene family. The first algorithm can be used to generate DNA sequences represented by any point in the CGR. The second algorithm can simulate known CGR patterns for different gene families by setting the probabilities of occurrence of different di- or trinucleotides by a trial and error process using some guidelines and approximate rules-of-thumb. The validity of the second algorithm has been tested by simulating sequences that can mimic the CGRs of vertebrate non-oncogenes, proto-oncogenes and oncogenes. These algorithms can provide a mathematical basis of the CGR patterns obtained using nucleotide sequences from databases. © 1992.
引用
收藏
页码:715 / 719
页数:5
相关论文
共 12 条
[1]  
Barnsley MF., 2014, FRACTALS EVERYWHERE
[2]   COMPUTERS IN MOLECULAR-BIOLOGY - CURRENT APPLICATIONS AND EMERGING TRENDS [J].
DELISI, C .
SCIENCE, 1988, 240 (4848) :47-52
[3]  
DEVISON D, 1985, B MATH BIOL, V47, P437
[4]  
DEWDNEY AK, 1987, SCI AM, V256, P16
[5]   RIGOROUS PATTERN-RECOGNITION METHODS FOR DNA-SEQUENCES - ANALYSIS OF PROMOTER SEQUENCES FROM ESCHERICHIA-COLI [J].
GALAS, DJ ;
EGGERT, M ;
WATERMAN, MS .
JOURNAL OF MOLECULAR BIOLOGY, 1985, 186 (01) :117-128
[6]   PATTERN-RECOGNITION IN NUCLEIC-ACID SEQUENCES .1. A GENERAL-METHOD FOR FINDING LOCAL HOMOLOGIES AND SYMMETRIES [J].
GOAD, WB ;
KANEHISA, MI .
NUCLEIC ACIDS RESEARCH, 1982, 10 (01) :247-263
[7]   CHAOS GAME VISUALIZATION OF SEQUENCES [J].
JEFFREY, HJ .
COMPUTERS & GRAPHICS, 1992, 16 (01) :25-33
[8]   CHAOS GAME REPRESENTATION OF GENE STRUCTURE [J].
JEFFREY, HJ .
NUCLEIC ACIDS RESEARCH, 1990, 18 (08) :2163-2170
[9]  
SOLOVYEV VV, 1992, IN PRESS INT J GENOM
[10]  
SOLOVYEV VV, 1991, ROLE BIOCOMPUTING CH, P76