Promoter prediction analysis on the whole human genome

被引:113
作者
Bajic, VB
Tan, SL
Suzuki, Y
Sugano, S
机构
[1] Inst Infocomm Res, Singapore 119613, Singapore
[2] Univ Tokyo, Ctr Human Genome, Minato Ku, Tokyo 1088639, Japan
关键词
D O I
10.1038/nbt1032
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Promoter prediction programs (PPPs) are important for in silico gene discovery without support from expressed sequence tag (EST)/cDNA/mRNA sequences, in the analysis of gene regulation and in genome annotation. Contrary to previous expectations, a comprehensive analysis of PPPs reveals that no program simultaneously achieves sensitivity and a positive predictive value >65%. PPP performances deduced from a limited number of chromosomes or smaller data sets do not hold when evaluated at the level of the whole genome, with serious inaccuracy of predictions for non-CpG-island-related promoters. Some PPPs even perform worse than, or close to, pure random guessing.
引用
收藏
页码:1467 / 1473
页数:7
相关论文
共 29 条
  • [1] Bajic V B, 2000, Brief Bioinform, V1, P214, DOI 10.1093/bib/1.3.214
  • [2] Dragon Gene Start Finder: An advanced system for finding approximate locations of the start of gene transcriptional units
    Bajic, VB
    Seah, SH
    [J]. GENOME RESEARCH, 2003, 13 (08) : 1923 - 1929
  • [3] Dragon Gene Start Finder identifies approximate locations of the 5′ ends of genes
    Bajic, VB
    Seah, SH
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (13) : 3560 - 3563
  • [4] Computer model for recognition of functional transcription start sites in RNA polymerase II promoters of vertebrates
    Bajic, VB
    Seah, SH
    Chong, A
    Krishnan, SPT
    Koh, JLY
    Brusic, V
    [J]. JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2003, 21 (05) : 323 - 332
  • [5] Computational identification of promoters and first exons in the human genome
    Davuluri, RV
    Grosse, I
    Zhang, MQ
    [J]. NATURE GENETICS, 2001, 29 (04) : 412 - 417
  • [6] Computational detection and location of transcription start sites in mammalian genomic DNA
    Down, TA
    Hubbard, TJP
    [J]. GENOME RESEARCH, 2002, 12 (03) : 458 - 461
  • [7] Eukaryotic promoter recognition
    Fickett, JW
    Hatzigeorgiou, AC
    [J]. GENOME RESEARCH, 1997, 7 (09) : 861 - 878
  • [8] Large-scale human promoter mapping using CpG islands
    Ioshikhes, IP
    Zhang, MQ
    [J]. NATURE GENETICS, 2000, 26 (01) : 61 - 63
  • [9] Promoter2.0: for the recognition of PolII promoter sequences
    Knudsen, S
    [J]. BIOINFORMATICS, 1999, 15 (05) : 356 - 361
  • [10] Initial sequencing and analysis of the human genome
    Lander, ES
    Int Human Genome Sequencing Consortium
    Linton, LM
    Birren, B
    Nusbaum, C
    Zody, MC
    Baldwin, J
    Devon, K
    Dewar, K
    Doyle, M
    FitzHugh, W
    Funke, R
    Gage, D
    Harris, K
    Heaford, A
    Howland, J
    Kann, L
    Lehoczky, J
    LeVine, R
    McEwan, P
    McKernan, K
    Meldrim, J
    Mesirov, JP
    Miranda, C
    Morris, W
    Naylor, J
    Raymond, C
    Rosetti, M
    Santos, R
    Sheridan, A
    Sougnez, C
    Stange-Thomann, N
    Stojanovic, N
    Subramanian, A
    Wyman, D
    Rogers, J
    Sulston, J
    Ainscough, R
    Beck, S
    Bentley, D
    Burton, J
    Clee, C
    Carter, N
    Coulson, A
    Deadman, R
    Deloukas, P
    Dunham, A
    Dunham, I
    Durbin, R
    French, L
    [J]. NATURE, 2001, 409 (6822) : 860 - 921