Toward a gold standard for promoter prediction evaluation

被引:196
作者
Abeel, Thomas [1 ,2 ]
Van de Peer, Yves [1 ,2 ]
Saeys, Yvan [1 ,2 ]
机构
[1] Univ Ghent VIB, Dept Plant Syst Biol, B-9052 Ghent, Belgium
[2] Univ Ghent, Dept Plant Biotechnol & Genet, B-9052 Ghent, Belgium
关键词
TRANSCRIPTION START SITES; COMPUTATIONAL DETECTION; IDENTIFICATION; RECOGNITION;
D O I
10.1093/bioinformatics/btp191
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Promoter prediction is an important task in genome annotation projects, and during the past years many new promoter prediction programs (PPPs) have emerged. However, many of these programs are compared inadequately to other programs. In most cases, only a small portion of the genome is used to evaluate the program, which is not a realistic setting for whole genome annotation projects. In addition, a common evaluation design to properly compare PPPs is still lacking. Results: We present a large-scale benchmarking study of 17 state-of-the-art PPPs. A multi-faceted evaluation strategy is proposed that can be used as a gold standard for promoter prediction evaluation, allowing authors of promoter prediction software to compare their method to existing methods in a proper way. This evaluation strategy is subsequently used to compare the chosen promoter predictors, and an in-depth analysis on predictive performance, promoter class specificity, overlap between predictors and positional bias of the predictions is conducted.
引用
收藏
页码:I313 / I320
页数:8
相关论文
共 24 条
  • [1] ProSOM:: core promoter prediction based on unsupervised clustering of DNA physical profiles
    Abeel, Thomas
    Saeys, Yvan
    Rouze, Pierre
    Van de Peer, Yves
    [J]. BIOINFORMATICS, 2008, 24 (13) : I24 - I31
  • [2] Generic eukaryotic core promoter prediction using structural features of DNA
    Abeel, Thomas
    Saeys, Yvan
    Bonnet, Eric
    Rouze, Pierre
    Van de Peer, Yves
    [J]. GENOME RESEARCH, 2008, 18 (02) : 310 - 323
  • [3] Promoter prediction analysis on the whole human genome
    Bajic, VB
    Tan, SL
    Suzuki, Y
    Sugano, S
    [J]. NATURE BIOTECHNOLOGY, 2004, 22 (11) : 1467 - 1473
  • [4] Bajic VB, 2003, METHOD ENZYMOL, V370, P237
  • [5] Dragon Promoter Finder: recognition of vertebrate RNA polymerase II promoters
    Bajic, VB
    Seah, SH
    Chong, A
    Zhang, GL
    Koh, JLY
    Brusic, V
    [J]. BIOINFORMATICS, 2002, 18 (01) : 198 - 199
  • [6] Performance assessment of promoter predictions on ENCODE regions in the EGASP experiment
    Bajic, Vladimir B.
    Brent, Michael R.
    Brown, Randall H.
    Frankish, Adam
    Harrow, Jennifer
    Ohler, Uwe
    Solovyev, Victor V.
    Tan, Sin Lam
    [J]. GENOME BIOLOGY, 2006, 7 (Suppl 1)
  • [7] Computational identification of promoters and first exons in the human genome
    Davuluri, RV
    Grosse, I
    Zhang, MQ
    [J]. NATURE GENETICS, 2001, 29 (04) : 412 - 417
  • [8] Computational detection and location of transcription start sites in mammalian genomic DNA
    Down, TA
    Hubbard, TJP
    [J]. GENOME RESEARCH, 2002, 12 (03) : 458 - 461
  • [9] Eukaryotic promoter recognition
    Fickett, JW
    Hatzigeorgiou, AC
    [J]. GENOME RESEARCH, 1997, 7 (09) : 861 - 878
  • [10] A code for transcription initiation in mammalian genomes
    Frith, Martin C.
    Valen, Eivind
    Krogh, Anders
    Hayashizaki, Yoshihide
    Carninci, Piero
    Sandelin, Albin
    [J]. GENOME RESEARCH, 2008, 18 (01) : 1 - 12