Lack of biological significance in the 'linguistic features' of noncoding DNA-a quantitative analysis

被引:19
作者
ChatzidimitriouDreismann, CA
Streffer, RMF
Larhammar, D
机构
[1] UPPSALA UNIV, DEPT MED PHARMACOL, S-75124 UPPSALA, SWEDEN
[2] INST BASIC RES DEV DISABIL, I-86075 MONTERODUNI, IS, ITALY
关键词
D O I
10.1093/nar/24.9.1676
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Recently, the application of two statistical methods (related to Zipf's distribution and Shannon's redundancy), called 'linguistic' tests, to the primary structure of DNA sequences of living organisms has excited considerable interest. Of particular importance is the claim that noncoding DNA sequences in eukaryotes display specific 'linguistic' features, being reminiscent of natural languages. Furthermore, this implies that noncoding regions of DNA may carry some new, thus far unknown, biological information which is revealed by these tests. In this paper these claims are tested quantitatively. With the aid of computer simulations of natural DNA sequences, and by applying the same 'linguistic' tests to both natural and artificial sequences, we investigate in detail the reasons of the appearance of the claimed 'linguistic' features and the associated differences between coding and noncoding DNAs. The presented results show quantitatively that the 'linguistic' tests failed to reveal any new biological information in (noncoding or coding) DNA.
引用
收藏
页码:1676 / 1681
页数:6
相关论文
共 17 条
[1]  
Bonhoeffer S, 1996, SCIENCE, V271, P14
[2]  
BUCHBINDCER H, 1995, SCIENCE MAY, P8
[3]   VARIATIONS IN BASE-PAIR COMPOSITION AND ASSOCIATED LONG-RANGE CORRELATIONS IN DNA-SEQUENCES - COMPUTER-SIMULATION RESULTS [J].
CHATZIDIMITRIOUDREISMANN, CA ;
STREFFER, RMF ;
LARHAMMAR, D .
BIOCHIMICA ET BIOPHYSICA ACTA-GENE STRUCTURE AND EXPRESSION, 1994, 1217 (02) :181-187
[4]   A QUANTITATIVE TEST OF LONG-RANGE CORRELATIONS AND COMPOSITIONAL FLUCTUATIONS IN DNA-SEQUENCES [J].
CHATZIDIMITRIOUDREISMANN, CA ;
STREFFER, RMF ;
LARHAMMAR, D .
EUROPEAN JOURNAL OF BIOCHEMISTRY, 1994, 224 (02) :365-371
[5]   HINTS OF A LANGUAGE IN JUNK DNA [J].
FLAM, F .
SCIENCE, 1994, 266 (5189) :1320-1320
[6]   PSEUDORANDOM NUMBER GENERATOR FOR MASSIVELY-PARALLEL MOLECULAR-DYNAMICS SIMULATIONS [J].
HOLIAN, BL ;
PERCUS, OE ;
WARNOCK, TT ;
WHITLOCK, PA .
PHYSICAL REVIEW E, 1994, 50 (02) :1607-1615
[7]   PATCHINESS AND CORRELATIONS IN DNA-SEQUENCES [J].
KARLIN, S ;
BRENDEL, V .
SCIENCE, 1993, 259 (5095) :677-680
[8]   NONCODING DNA, ZIPFS LAW, AND LANGUAGE [J].
KONOPKA, AK ;
MARTINDALE, C .
SCIENCE, 1995, 268 (5212) :789-789
[9]   BIOLOGICAL ORIGINS OF LONG-RANGE CORRELATIONS AND COMPOSITIONAL VARIATIONS IN DNA [J].
LARHAMMAR, D ;
CHATZIDIMITRIOUDREISMANN, CA .
NUCLEIC ACIDS RESEARCH, 1993, 21 (22) :5167-5170
[10]   LINGUISTIC FEATURES OF NONCODING DNA-SEQUENCES [J].
MANTEGNA, RN ;
BULDYREV, SV ;
GOLDBERGER, AL ;
HAVLIN, S ;
PENG, CK ;
SIMONS, M ;
STANLEY, HE .
PHYSICAL REVIEW LETTERS, 1994, 73 (23) :3169-3172