WORDUP - AN EFFICIENT ALGORITHM FOR DISCOVERING STATISTICALLY SIGNIFICANT PATTERNS IN DNA-SEQUENCES

被引:48
作者
PESOLE, G [1 ]
PRUNELLA, N [1 ]
LIUNI, S [1 ]
ATTIMONELLI, M [1 ]
SACCONE, C [1 ]
机构
[1] CNR, CTR STUDIO MITOCONDRI & METAB ENERGENT, I-70126 BARI, ITALY
关键词
D O I
10.1093/nar/20.11.2871
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We present here a fast and sensitive method designed to isolate short nucleotide sequences which have non-random statistical properties and may thus be biologically active. It is based on a first order Markov analysis and allows us to detect statistically significant sequence motifs from six to ten nucleotides long which are significantly shared (or avoided) in the sequences under investigation. This method has been tested on a set of 521 sequences extracted from the Eukaryotic Promoter Database (2). Our results demonstrate the accuracy and the efficiency of the method in that the sequence motifs which are known to act as eukaryotic promoters, such as the TATA-box and the CAAT-box, were clearly identified. In addition we have found other statistically significant motifs, the biological roles of which are yet to be clarified.
引用
收藏
页码:2871 / 2875
页数:5
相关论文
共 35 条
[1]  
ATTIMONELLI M, UNPUB
[2]  
BECKMANN JS, 1986, BIOMOLECULAR STRUCTU, V4, P91
[3]   EVOLUTION OF THE GENOME AND THE GENETIC-CODE - SELECTION AT THE DINUCLEOTIDE LEVEL BY METHYLATION AND POLYRIBONUCLEOTIDE CLEAVAGE [J].
BEUTLER, E ;
GELBART, T ;
HAN, JH ;
KOZIOL, JA ;
BEUTLER, B .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1989, 86 (01) :192-196
[5]   A GENERAL RULE FOR RANGED SERIES OF CODON FREQUENCIES IN DIFFERENT GENOMES [J].
BORODOVSKY, MY ;
GUSEIN-ZADE, SM .
JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 1989, 6 (05) :1001-1012
[6]  
BOYER RS, 1977, COMMUN ACM, V20, P62
[7]  
BREATHNACH R, 1981, ANNU REV BIOCHEM, V50, P349, DOI 10.1146/annurev.bi.50.070181.002025
[8]   LINGUISTICS OF NUCLEOTIDE-SEQUENCES - MORPHOLOGY AND COMPARISON OF VOCABULARIES [J].
BRENDEL, V ;
BECKMANN, JS ;
TRIFONOV, EN .
JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 1986, 4 (01) :11-21
[9]   SIGNAL SEARCH ANALYSIS - A NEW METHOD TO LOCALIZE AND CHARACTERIZE FUNCTIONALLY IMPORTANT DNA-SEQUENCES [J].
BUCHER, P ;
BRYAN, B .
NUCLEIC ACIDS RESEARCH, 1984, 12 (01) :287-305
[10]  
BUCHER P, 1991, EUKARYOTIC PROMOTER