LONG-RANGE CORRELATION AND PARTIAL 1/F-ALPHA SPECTRUM IN A NONCODING DNA-SEQUENCE

被引:387
作者
LI, W
KANEKO, K
机构
[1] SANTA FE INST, SANTA FE, NM 87501 USA
[2] UNIV TOKYO, DEPT PURE & APPL SCI, TOKYO 153, JAPAN
来源
EUROPHYSICS LETTERS | 1992年 / 17卷 / 07期
关键词
GENERAL; THEORETICAL; MATHEMATICAL BIOPHYSICS; PROBABILITY THEORY; STOCHASTIC PROCESSES; STATISTICS;
D O I
10.1209/0295-5075/17/7/014
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Mutual information function, which is an alternative to correlation function for symbolic sequences, and a <<symbolic spectrum>> are calculated for a human DNA sequence containing mostly intron segments, those that do not code for proteins. It is observed that the mutual information function of this sequence decays very slowly, and the correlation length is extremely long (at least 800 bases). The symbolic spectrum of the sequence at very low frequencies can be approximated by 1/f(alpha), where f is the frequency and alpha ranges from 0.5 to 0.85. It is suggested that the existence of the repetitive patterns in the sequence is mainly responsible for the observed long-range correlation. A possible connection between this long-range correlation and those in music notes is also briefly discussed.
引用
收藏
页码:655 / 660
页数:6
相关论文
共 27 条
  • [1] [Anonymous], 1984, DRIPPING FAUCET MODE
  • [2] REPEATED SEQUENCES IN DNA
    BRITTEN, RJ
    KOHNE, DE
    [J]. SCIENCE, 1968, 161 (3841) : 529 - &
  • [3] REPEATED SEGMENTS OF DNA
    BRITTEN, RJ
    KOHNE, DE
    [J]. SCIENTIFIC AMERICAN, 1970, 222 (04) : 24 - &
  • [4] BURKS C, 1990, METHOD ENZYMOL, V183, P3
  • [5] GENBANK
    BURKS, C
    CASSIDY, M
    CINKOSKY, MJ
    CUMELLA, KE
    GILNA, P
    HAYDEN, JED
    KEEN, GM
    KELLEY, TA
    KELLY, M
    KRISTOFFERSON, D
    RYALS, J
    [J]. NUCLEIC ACIDS RESEARCH, 1991, 19 : 2221 - 2225
  • [6] ELECTRONIC DATA PUBLISHING AND GENBANK
    CINKOSKY, MJ
    FICKETT, JW
    GILNA, P
    BURKS, C
    [J]. SCIENCE, 1991, 252 (5010) : 1273 - 1277
  • [7] RECOGNITION OF PROTEIN CODING REGIONS IN DNA-SEQUENCES
    FICKETT, JW
    [J]. NUCLEIC ACIDS RESEARCH, 1982, 10 (17) : 5303 - 5318
  • [8] REPETITIVE SEQUENCES IN EUKARYOTIC DNA AND THEIR EXPRESSION
    JELINEK, WR
    SCHMID, CW
    [J]. ANNUAL REVIEW OF BIOCHEMISTRY, 1982, 51 : 813 - 844
  • [9] LI W, 1989, UNPUB
  • [10] LI W, 1991, SFI91002 SANT FE I