Distribution of base pair repeats in coding and noncoding DNA sequences

被引:33
作者
Dokholyan, NV [1 ]
Buldyrev, SV
Havlin, S
Stanley, HE
机构
[1] Boston Univ, Ctr Polymer Studies, Dept Phys, Boston, MA 02215 USA
[2] Bar Ilan Univ, Dept Phys, IL-52900 Ramat Gan, Israel
[3] Bar Ilan Univ, Gonda Goldschmied Ctr, IL-52900 Ramat Gan, Israel
关键词
D O I
10.1103/PhysRevLett.79.5182
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
We analyze the histograms for the lengths of the 16 possible distinct repeats of identical dimers, known as dimeric tandem repeats, in DNA sequences. For coding regions, the probability of finding a repetitive sequence of l copies of a particular dimer decreases exponentially as l increases. For the noncoding regions, the distribution functions for most of the 16 dimers have long tails and can be approximated by power-law functions. while for coding DNA, they can be well fit by a first-order Markov process, We propose a model, based on known biophysical processes, which leads to the observed probability distribution functions for noncoding DNA. We argue that this difference in the shape of the distribution functions between coding and noncoding DNA arises from the fact that noncoding DNA is more tolerant to evolutionary mutational alterations than coding DNA.
引用
收藏
页码:5182 / 5185
页数:4
相关论文
共 19 条
[1]  
Alberts B., 1994, MOL BIOL CELL
[2]   CHARACTERIZING LONG-RANGE CORRELATIONS IN DNA-SEQUENCES FROM WAVELET ANALYSIS [J].
ARNEODO, A ;
BACRY, E ;
GRAVES, PV ;
MUZY, JF .
PHYSICAL REVIEW LETTERS, 1995, 74 (16) :3293-3296
[3]   The length distribution of perfect dimer repetitive DNA is consistent with its evolution by an unbiased single-step mutation process [J].
Bell, GI ;
Jurka, J .
JOURNAL OF MOLECULAR EVOLUTION, 1997, 44 (04) :414-421
[4]   HIGH-RESOLUTION OF HUMAN EVOLUTIONARY TREES WITH POLYMORPHIC MICROSATELLITES [J].
BOWCOCK, AM ;
RUIZLINARES, A ;
TOMFOHRDE, J ;
MINCH, E ;
KIDD, JR ;
CAVALLISFORZA, LL .
NATURE, 1994, 368 (6470) :455-457
[5]   LONG-RANGE CORRELATION-PROPERTIES OF CODING AND NONCODING DNA-SEQUENCES - GENBANK ANALYSIS [J].
BULDYREV, SV ;
GOLDBERGER, AL ;
HAVLIN, S ;
MANTEGNA, RN ;
MATSA, ME ;
PENG, CK ;
SIMONS, M ;
STANLEY, HE .
PHYSICAL REVIEW E, 1995, 51 (05) :5084-5091
[6]  
BULDYREV SV, 1994, FRACTALS SCI
[7]  
FRICKETT JW, 1992, NUCLEIC ACIDS RES, V20, P6441
[8]  
Gnedenko B.V, 1968, THEORY PROBABILITY
[9]   UBIQUITOUS SOMATIC MUTATIONS IN SIMPLE REPEATED SEQUENCES REVEAL A NEW MECHANISM FOR COLONIC CARCINOGENESIS [J].
IONOV, Y ;
PEINADO, MA ;
MALKHOSYAN, S ;
SHIBATA, D ;
PERUCHO, M .
NATURE, 1993, 363 (6429) :558-561
[10]   SIMPLE REPETITIVE DNA-SEQUENCES FROM PRIMATES - COMPILATION AND ANALYSIS [J].
JURKA, J ;
PETHIYAGODA, C .
JOURNAL OF MOLECULAR EVOLUTION, 1995, 40 (02) :120-126