DNA sequence and structural properties as predictors of human and mouse promoters

被引:41
作者
Akan, Pelin [1 ]
Deloukas, Panos [1 ]
机构
[1] Wellcome Trust Sanger Inst, Cambridge CB10 1SA, England
基金
英国惠康基金;
关键词
promoter; human; mouse; genome-wide computational analysis; CpG island; TATA-box; DNA bendability; propeller twist; nucleosome positioning preference; ATG desert;
D O I
10.1016/j.gene.2007.12.011
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Promoters play a central role in gene regulation, yet our power to discriminate them from non-promoter sequences in higher eukaryotes is mainly restricted to those associated with CpG islands. Here, we examined in silico the promoters of 30,954 human and 18,083 mouse transcripts in the DBTSS database, to assess the impact of particular sequence and structural features (propeller twist, bendability and nucleosome positioning preference) on promoter classification and prediction. Our analysis showed that a stricter-than-traditional definition of CpG islands captures low and high CpG count promoter classes more accurately than the traditional one. We observed that both human and mouse promoter sequences are flexible with the exception of the TATA box and TSS, which are rigid regions irrespective of association with a CpG island. Therefore varying levels of structural flexibility in promoters may affect their accessibility to proteins, and hence their specificity. For all features investigated, averaged values across core promoters discriminated CpG island associated promoters from background, whereas the same did not hold for promoters without a CpG island. However, local changes around -34 to -23 (expected position of TATA box) and the TSS were informative in discriminating promoters (both classes) from non-promoter sequences. Additionally, we investigated ATG deserts and observed that they occur in all promoter sets except those with a TATA-box and without a CpG island in human. Interestingly, all mouse promoter sets showed ATG codon depletion irrespective of the presence of a TATA-box, possibly reflecting a weaker contribution to TSS specificity in mouse. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:165 / 176
页数:12
相关论文
共 68 条
[21]   Sequence-dependent DNA structure: A database of octamer structural parameters [J].
Gardiner, EJ ;
Hunter, CA ;
Packer, MJ ;
Palmer, DS ;
Willett, P .
JOURNAL OF MOLECULAR BIOLOGY, 2003, 332 (05) :1025-1035
[22]   CPG ISLANDS IN VERTEBRATE GENOMES [J].
GARDINERGARDEN, M ;
FROMMER, M .
JOURNAL OF MOLECULAR BIOLOGY, 1987, 196 (02) :261-282
[23]   Synergy of human Pol II core promoter elements revealed by statistical sequence analysis [J].
Gershenzon, NI ;
Ioshikhes, IP .
BIOINFORMATICS, 2005, 21 (08) :1295-1300
[24]   YEAST TATA-BINDING PROTEIN TFIID BINDS TO TATA ELEMENTS WITH BOTH CONSENSUS AND NONCONSENSUS DNA-SEQUENCES [J].
HAHN, S ;
BURATOWSKI, S ;
SHARP, PA ;
GUARENTE, L .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1989, 86 (15) :5718-5722
[25]  
Hannenhalli S, 2001, Bioinformatics, V17 Suppl 1, pS90
[26]   5′-end SAGE for the analysis of transcriptional start sites [J].
Hashimoto, S ;
Suzuki, Y ;
Kasai, Y ;
Morohoshi, K ;
Yamada, T ;
Sese, J ;
Morishita, S ;
Sugano, S ;
Matsushima, K .
NATURE BIOTECHNOLOGY, 2004, 22 (09) :1146-1149
[27]   FACILITATED BINDING OF TATA-BINDING PROTEIN TO NUCLEOSOMAL DNA [J].
IMBALZANO, AN ;
KWON, H ;
GREEN, MR ;
KINGSTON, RE .
NATURE, 1994, 370 (6489) :481-485
[28]   Structure and function of a human TAFII250 double bromodomain module [J].
Jacobson, RH ;
Ladurner, AG ;
King, DS ;
Tjian, R .
SCIENCE, 2000, 288 (5470) :1422-1425
[29]   Structural properties of promoters: similarities and differences between prokaryotes and eukaryotes [J].
Kanhere, A ;
Bansal, M .
NUCLEIC ACIDS RESEARCH, 2005, 33 (10) :3165-3175
[30]   Evidence for widespread degradation of gene control regions in hominid genomes [J].
Keightley, PD ;
Lercher, MJ ;
Eyre-Walker, A .
PLOS BIOLOGY, 2005, 3 (02) :282-288