DERIVATION OF A SCALE-INDEPENDENT PARAMETER WHICH CHARACTERIZES GENETIC SEQUENCE COMPARISONS

被引:1
作者
DEPETRILLO, PB [1 ]
BUTTE, AJ [1 ]
机构
[1] ROGER WILLIAMS GEN HOSP, DIV CLIN PHARMACOL, PROVIDENCE, RI 02908 USA
来源
COMPUTERS AND BIOMEDICAL RESEARCH | 1993年 / 26卷 / 06期
关键词
D O I
10.1006/cbmr.1993.1037
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A new method of quantifying the similarity between genetic sequences is presented. The method makes use of the finding that sequence comparisons expressed in binary vector form have an associated scale-independent parameter, D. This parameter is represented in the function M(S, n) = (N) (en/enD), where S is the vector, n represents the window size which is allowed to vary, N is a constant, and D is the scale-independent measure of homology. By comparing two sequences using this method, a unimodal, symmetric distribution of D values associated with the frameshifted vectors is obtained. The degree of sequence similarity is determined by the distribution of these parameters. A set of sequences of evolutionary interest coding for glyceraldehyde-3-phosphate dehydrogenases and mammalian insulins iscompared using this methodology. The results confirm evolutionary tree distances calculated using different procedures. Since a z score can be calculated for each comparison, the method allows forthe rapid identification of sequence homologies ranked according to the probability of occurrence. This unique scale-independent measure of similarity allows contrasts and comparisons between any two sequence fragments using all available order information. © 1993 Academic Press, Inc.
引用
收藏
页码:517 / 540
页数:24
相关论文
共 25 条
[1]  
BELL GI, 1980, NATURE, V284, P26, DOI 10.1038/284026a0
[2]  
BERTHELSEN CL, 1992, AM PHYS SOC, V45, P8092
[3]   NUCLEOTIDE-SEQUENCE OF THE ESCHERICHIA-COLI GAP GENE - DIFFERENT EVOLUTIONARY BEHAVIOR OF THE NAD+-BINDING DOMAIN AND THE CATALYTIC DOMAIN OF D-GLYCERALDEHYDE-3-PHOSPHATE DEHYDROGENASE [J].
BRANLANT, G ;
BRANLANT, C .
EUROPEAN JOURNAL OF BIOCHEMISTRY, 1985, 150 (01) :61-66
[4]   GENBANK [J].
BURKS, C ;
CASSIDY, M ;
CINKOSKY, MJ ;
CUMELLA, KE ;
GILNA, P ;
HAYDEN, JED ;
KEEN, GM ;
KELLEY, TA ;
KELLY, M ;
KRISTOFFERSON, D ;
RYALS, J .
NUCLEIC ACIDS RESEARCH, 1991, 19 :2221-2225
[6]   A SIMPLE WAY TO LOOK AT DNA [J].
GATES, MA .
JOURNAL OF THEORETICAL BIOLOGY, 1986, 119 (03) :319-328
[7]   DIAGRAM, A METHOD FOR COMPARING SEQUENCES - ITS USE WITH AMINO ACID AND NUCLEOTIDE SEQUENCES [J].
GIBBS, AJ ;
MCINTYRE, GA .
EUROPEAN JOURNAL OF BIOCHEMISTRY, 1970, 16 (01) :1-+
[8]   ANCIENT CONSERVED REGIONS IN NEW GENE-SEQUENCES AND THE PROTEIN DATABASES [J].
GREEN, P ;
LIPMAN, D ;
HILLIER, L ;
WATERSTON, R ;
STATES, D ;
CLAVERIE, JM .
SCIENCE, 1993, 259 (5102) :1711-1716
[9]   THE NUCLEOTIDE-SEQUENCE OF THE CHICK CYTOPLASMIC BETA-ACTIN GENE [J].
KOST, TA ;
THEODORAKIS, N ;
HUGHES, SH .
NUCLEIC ACIDS RESEARCH, 1983, 11 (23) :8287-8301
[10]  
KWOK SCM, 1983, J BIOL CHEM, V258, P2357