The biology of eukaryotic promoter prediction - a review

被引:147
作者
Pedersen, AG
Baldi, P
Chauvin, Y
Brunak, S
机构
[1] Tech Univ Denmark, Dept Biotechnol, Ctr Biol Sequence Anal, DK-2800 Lyngby, Denmark
[2] Net ID Inc, Los Angeles, CA 90042 USA
来源
COMPUTERS & CHEMISTRY | 1999年 / 23卷 / 3-4期
关键词
TATA-box; initiator; nucleosomes; transcriptional initiation; genome analysis;
D O I
10.1016/S0097-8485(99)00015-7
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Computational prediction of eukaryotic promoters from the nucleotide sequence is one of the most attractive problems in sequence analysis today, but it is also a very difficult one. Thus, current methods predict in the order of one promoter per kilobase in human DNA, while the average distance between functional promoters has been estimated to be in the range of 30-40 kilobases. Although it is conceivable that some of these predicted promoters correspond to cryptic initiation sites that are used in vivo, it is likely that most are false positives. This suggests that it is important to carefully reconsider the biological data that forms the basis of current algorithms, and we here present a review of data that may be useful in this regard. The review covers the following topics: (1) basal transcription and core promoters, (2) activated transcription and transcription factor binding sites, (3) CpG islands and DNA methylation, (4) chromosomal structure and nucleosome modification, and (5) chromosomal domains and domain boundaries. We discuss the possible lessons that may be learned, especially with respect to the wealth of information about epigenetic regulation of transcription that has been appearing in recent years. (C) 1999 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:191 / 207
页数:17
相关论文
共 187 条
  • [1] ADHYA S, 1989, ANNU REV GENET, V23, P227, DOI 10.1146/annurev.genet.23.1.227
  • [2] CPG ISLANDS, GENES AND ISOCHORES IN THE GENOMES OF VERTEBRATES
    AISSANI, B
    BERNARDI, G
    [J]. GENE, 1991, 106 (02) : 185 - 195
  • [3] CPG ISLANDS - FEATURES AND DISTRIBUTION IN THE GENOMES OF VERTEBRATES
    AISSANI, B
    BERNARDI, G
    [J]. GENE, 1991, 106 (02) : 173 - 183
  • [4] EXTENSIVE HOMOLOGY AMONG THE LARGEST SUBUNITS OF EUKARYOTIC AND PROKARYOTIC RNA-POLYMERASES
    ALLISON, LA
    MOYLE, M
    SHALES, M
    INGLES, CJ
    [J]. CELL, 1985, 42 (02) : 599 - 610
  • [5] NUMBER OF CPG ISLANDS AND GENES IN HUMAN AND MOUSE
    ANTEQUERA, F
    BIRD, A
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1993, 90 (24) : 11995 - 11999
  • [6] Detection of eukaryotic promoters using Markov transition matrices
    Audic, S
    Claverie, JM
    [J]. COMPUTERS & CHEMISTRY, 1997, 21 (04): : 223 - 227
  • [7] Visualizing the competitive recognition of TATA-boxes in vertebrate promoters
    Audic, S
    Claverie, JM
    [J]. TRENDS IN GENETICS, 1998, 14 (01) : 10 - 11
  • [8] BALDI P, 1998, COMPUTATIONAL APPL D
  • [9] Baldi P., 1998, Bioinformatics: The machine learning approach
  • [10] Benham CJ, 1996, COMPUT APPL BIOSCI, V12, P375