Promoter2.0: for the recognition of PolII promoter sequences

被引:276
作者
Knudsen, S [1 ]
机构
[1] Tech Univ Denmark, Ctr Biol Sequence Anal, DK-2800 Lyngby, Denmark
关键词
D O I
10.1093/bioinformatics/15.5.356
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: A new approach to the prediction of eukaryotic PolII promoters from DNA sequence takes advantage of a combination of elements similar to neural networks and genetic algorithms to recognize a set of discrete subpatterns with variable separation as one pattern: a promoter. The neural networks use as input a small window of DNA sequence, as well as the output of other neural networks. Through the use of genetic algorithms, the weights in the neural networks are optimized to discriminate maximally between promoters and non-promoters. Results: After several thousand generations of optimization, the algorithm was able to discriminate between vertebrate promoter and non-promoter sequences in a test set with a correlation coefficient of 0.63. In addition, all five known transcription start sites on the plus strand of the complete adenovirus genome were within 161 bp of 35 predicted transcription start sites. On standardized test sets consisting of human genomic DNA, the performance of Promoter2.0 compares well with other software developed for the same purpose.
引用
收藏
页码:356 / 361
页数:6
相关论文
共 15 条
[1]  
Baldi P., 1998, Bioinformatics: The machine learning approach
[2]   PREDICTION OF HUMAN MESSENGER-RNA DONOR AND ACCEPTOR SITES FROM THE DNA-SEQUENCE [J].
BRUNAK, S ;
ENGELBRECHT, J ;
KNUDSEN, S .
JOURNAL OF MOLECULAR BIOLOGY, 1991, 220 (01) :49-65
[3]   COMPILATION AND ANALYSIS OF EUKARYOTIC POL-II PROMOTER SEQUENCES [J].
BUCHER, P ;
TRIFONOV, EN .
NUCLEIC ACIDS RESEARCH, 1986, 14 (24) :10009-10026
[4]   WEIGHT MATRIX DESCRIPTIONS OF 4 EUKARYOTIC RNA POLYMERASE-II PROMOTER ELEMENTS DERIVED FROM 502 UNRELATED PROMOTER SEQUENCES [J].
BUCHER, P .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 212 (04) :563-578
[5]   Eukaryotic promoter recognition [J].
Fickett, JW ;
Hatzigeorgiou, AC .
GENOME RESEARCH, 1997, 7 (09) :861-878
[6]   PREDICTION OF GENE STRUCTURE [J].
GUIGO, R ;
KNUDSEN, S ;
DRAKE, N ;
SMITH, T .
JOURNAL OF MOLECULAR BIOLOGY, 1992, 226 (01) :141-157
[7]  
Holland J.H., 1975, Adoption in Natural and Artificial systerm
[8]  
JOHNSON PF, 1989, ANNU REV BIOCHEM, V58, P799, DOI 10.1146/annurev.biochem.58.1.799
[9]  
KNUDSEN S, 1993, P 2 INT C BIOINF
[10]  
Koza JR, 1992, Genetic programming