LINGUISTIC FEATURES OF NONCODING DNA-SEQUENCES

被引:229
作者
MANTEGNA, RN
BULDYREV, SV
GOLDBERGER, AL
HAVLIN, S
PENG, CK
SIMONS, M
STANLEY, HE
机构
[1] BOSTON UNIV, DEPT PHYS, BOSTON, MA 02215 USA
[2] HARVARD UNIV, BETH ISRAEL HOSP, SCH MED, DIV CARDIOVASC, BOSTON, MA 02215 USA
关键词
D O I
10.1103/PhysRevLett.73.3169
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
We extend the Zipf approach to analyzing linguistic texts to the statistical study of DNA base pair sequences and find that the noncoding regions are more similar to natural languages than the coding regions. We also adapt the Shannon approach to quantifying the "redundancy" of a linguistic text in terms of a measurable entropy function, and demonstrate that noncoding regions in eukaryotes display a smaller entropy and larger redundancy than coding regions, supporting the possibility that noncoding regions of DNA may carry biological information. © 1994 The American Physical Society.
引用
收藏
页码:3169 / 3172
页数:4
相关论文
共 20 条