Computer model for recognition of functional transcription start sites in RNA polymerase II promoters of vertebrates

被引:43
作者
Bajic, VB [1 ]
Seah, SH [1 ]
Chong, A [1 ]
Krishnan, SPT [1 ]
Koh, JLY [1 ]
Brusic, V [1 ]
机构
[1] BIC, Labs Informat Technol, Computat Immunol Grp, Singapore 119613, Singapore
关键词
promoter modelling; promoter recognition; transcription start site; eukaryotic promoters;
D O I
10.1016/S1093-3263(02)00179-1
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
This paper introduces a new computer system for recognition of functional transcription start sites (TSSs) in RNA polymerase II promoter regions of vertebrates. This system allows scanning complete vertebrate genomes for promoters with significantly reduced number of false positive predictions. It can be used in the context of gene finding through its recognition of the 5' end of genes. The implemented recognition model uses a composite-hierarchical approach, artificial intelligence, statistics, and signal processing techniques. It also exploits the separation of promoter sequences into those that are C + G-rich or C + G-poor. The system was evaluated on a large and diverse human sequence-set and exhibited several times higher accuracy than several publicly available TSS-finding programs. Results obtained using human chromosome 22 data showed even greater specificity than the evaluation set results. The system has been implemented in the Dragon Promoter Finder package, which can be accessed at http://sdmc.krdl.org.sg:8080/promoter/. (C) 2002 Elsevier Science Inc. All rights reserved.
引用
收藏
页码:323 / 332
页数:10
相关论文
共 35 条
  • [21] Ohler U, 2001, Bioinformatics, V17 Suppl 1, pS199
  • [22] The biology of eukaryotic promoter prediction - a review
    Pedersen, AG
    Baldi, P
    Chauvin, Y
    Brunak, S
    [J]. COMPUTERS & CHEMISTRY, 1999, 23 (3-4): : 191 - 207
  • [23] HUMAN DNA TATA BOXES AND TRANSCRIPTION INITIATION SITES - A STATISTICAL STUDY
    PENOTTI, FE
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 213 (01) : 37 - 52
  • [24] The Eukaryotic Promoter Database (EPD)
    Périer, RC
    Praz, V
    Junier, T
    Bonnard, C
    Bucher, P
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 302 - 303
  • [25] UTRdb and UTRsite:: specialized databases of sequences and functional elements of 5′ and 3′ untranslated regions of eukaryotic mRNAs
    Pesole, G
    Liuni, S
    Grillo, G
    Licciulli, F
    Larizza, A
    Makalowski, W
    Saccone, C
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 193 - 196
  • [26] PRESTRIDGE DS, 1999, COMPUTER SOFTWARE EU
  • [27] Genome annotation assessment in Drosophila melanogaster
    Reese, MG
    Hartzell, G
    Harris, NL
    Ohler, U
    Abril, JF
    Lewis, SE
    [J]. GENOME RESEARCH, 2000, 10 (04) : 483 - 501
  • [28] REESE MG, 1999, UNPUB TIME DELAY NEU
  • [29] REESE MG, 1996, P 1996 PAC S BIOC
  • [30] First pass annotation of promoters on human chromosome 22
    Scherf, M
    Klingenhoff, A
    Frech, K
    Quandt, K
    Schneider, R
    Grote, K
    Frisch, M
    Gailus-Durner, V
    Seidel, A
    Brack-Werner, R
    Werner, T
    [J]. GENOME RESEARCH, 2001, 11 (03) : 333 - 340