LINGUISTIC MEASURE OF TAXONOMIC AND FUNCTIONAL RELATEDNESS OF NUCLEOTIDE-SEQUENCES

被引:48
作者
PIETROKOVSKI, S [1 ]
HIRSHON, J [1 ]
TRIFONOV, EN [1 ]
机构
[1] LONG ISL UNIV,DEPT BOT,BROOKLYN,NY 11201
关键词
D O I
10.1080/07391102.1990.10508563
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The frequencies of “words”, oligonucleotides within nucleotide sequences, reflect the genetic information contained in the sequence “texts”. Nucleotide sequences are characteristically represented by their contrast word vocabularies. Comparison of the sequences by correlating their contrast vocabularies is shown to reflect well the relatedness (unrelatedness) between the sequences. A single value, the linguistic similarity between the sequences, is suggested as a measure of sequence relatedness. Sequences as short as 1000 bases can be characterized and quantitatively related to other sequences by this technique. The linguistic sequence similarity value is used for analysis of taxonomically and functionally diverse nucleotide sequences. The similarity value is shown to be very sensitive to the relatedness of the source species, thus providing a convenient tool for taxonomic classification of species by their sequence vocabularies. Functionally diverse sequences appear distinct by their linguistic similarity values. This can be a basis for a quick screening technique for functional characterization of the sequences and for mapping functionally distinct regions in long sequences. © Taylor & Francis Group, LLC.
引用
收藏
页码:1251 / 1268
页数:18
相关论文
共 39 条
[1]   INTERVENING SEQUENCES EXHIBIT DISTINCT VOCABULARY [J].
BECKMANN, JS ;
BRENDEL, V ;
TRIFONOV, EN .
JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 1986, 4 (03) :391-400
[2]  
Bennet W. R., 1976, SCI ENG PROBLEM SOLV
[4]   LINGUISTICS OF NUCLEOTIDE-SEQUENCES - MORPHOLOGY AND COMPARISON OF VOCABULARIES [J].
BRENDEL, V ;
BECKMANN, JS ;
TRIFONOV, EN .
JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 1986, 4 (01) :11-21
[5]   COMPUTERS IN MOLECULAR-BIOLOGY - CURRENT APPLICATIONS AND EMERGING TRENDS [J].
DELISI, C .
SCIENCE, 1988, 240 (4848) :47-52
[6]   A COMPREHENSIVE SET OF SEQUENCE-ANALYSIS PROGRAMS FOR THE VAX [J].
DEVEREUX, J ;
HAEBERLI, P ;
SMITHIES, O .
NUCLEIC ACIDS RESEARCH, 1984, 12 (01) :387-395
[7]   THE PHYLOGENY OF PROKARYOTES [J].
FOX, GE ;
STACKEBRANDT, E ;
HESPELL, RB ;
GIBSON, J ;
MANILOFF, J ;
DYER, TA ;
WOLFE, RS ;
BALCH, WE ;
TANNER, RS ;
MAGRUM, LJ ;
ZABLEN, LB ;
BLAKEMORE, R ;
GUPTA, R ;
BONEN, L ;
LEWIS, BJ ;
STAHL, DA ;
LUEHRSEN, KR ;
CHEN, KN ;
WOESE, CR .
SCIENCE, 1980, 209 (4455) :457-463
[8]   SEQUENCE OF SIMIAN IMMUNODEFICIENCY VIRUS AND ITS RELATIONSHIP TO THE HUMAN IMMUNODEFICIENCY VIRUSES [J].
FRANCHINI, G ;
GURGO, C ;
GUO, HG ;
GALLO, RC ;
COLLALTI, E ;
FARGNOLI, KA ;
HALL, LF ;
WONGSTAAL, F ;
REITZ, MS .
NATURE, 1987, 328 (6130) :539-543
[9]  
GOAD WB, 1986, ANNU REV BIOPHYS BIO, V15, P79
[10]   WORKINGS OF THE GENETIC-CODE [J].
GRANTHAM, R .
TRENDS IN BIOCHEMICAL SCIENCES, 1980, 5 (12) :327-331