DNA sequence and structural properties as predictors of human and mouse promoters

被引:41
作者
Akan, Pelin [1 ]
Deloukas, Panos [1 ]
机构
[1] Wellcome Trust Sanger Inst, Cambridge CB10 1SA, England
基金
英国惠康基金;
关键词
promoter; human; mouse; genome-wide computational analysis; CpG island; TATA-box; DNA bendability; propeller twist; nucleosome positioning preference; ATG desert;
D O I
10.1016/j.gene.2007.12.011
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Promoters play a central role in gene regulation, yet our power to discriminate them from non-promoter sequences in higher eukaryotes is mainly restricted to those associated with CpG islands. Here, we examined in silico the promoters of 30,954 human and 18,083 mouse transcripts in the DBTSS database, to assess the impact of particular sequence and structural features (propeller twist, bendability and nucleosome positioning preference) on promoter classification and prediction. Our analysis showed that a stricter-than-traditional definition of CpG islands captures low and high CpG count promoter classes more accurately than the traditional one. We observed that both human and mouse promoter sequences are flexible with the exception of the TATA box and TSS, which are rigid regions irrespective of association with a CpG island. Therefore varying levels of structural flexibility in promoters may affect their accessibility to proteins, and hence their specificity. For all features investigated, averaged values across core promoters discriminated CpG island associated promoters from background, whereas the same did not hold for promoters without a CpG island. However, local changes around -34 to -23 (expected position of TATA box) and the TSS were informative in discriminating promoters (both classes) from non-promoter sequences. Additionally, we investigated ATG deserts and observed that they occur in all promoter sets except those with a TATA-box and without a CpG island in human. Interestingly, all mouse promoter sets showed ATG codon depletion irrespective of the presence of a TATA-box, possibly reflecting a weaker contribution to TSS specificity in mouse. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:165 / 176
页数:12
相关论文
共 68 条
[1]   Comprehensive analysis of the base composition around the transcription start site in Metazoa [J].
Aerts, S ;
Thijs, G ;
Dabrowski, M ;
Moreau, Y ;
De Moor, B .
BMC GENOMICS, 2004, 5 (1)
[2]   Structure, function and evolution of CpG island promoters [J].
Antequera, F .
CELLULAR AND MOLECULAR LIFE SCIENCES, 2003, 60 (08) :1647-1658
[3]   Investigating extended regulatory regions of genomic DNA sequences [J].
Babenko, VN ;
Kosarev, PS ;
Vishnevsky, OV ;
Levitsky, VG ;
Basin, VV ;
Frolov, AS .
BIOINFORMATICS, 1999, 15 (7-8) :644-653
[4]  
BAJIC VB, 2004, IN SILICO BIOL, V4, P109
[5]   CAP BINDING-SITES REVEAL PYRIMIDINE-PURINE PATTERN CHARACTERISTIC OF DNA BENDING [J].
BARBER, AM ;
ZHURKIN, VB .
JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 1990, 8 (02) :213-232
[6]   SEQUENCE-DEPENDENT BENDING PROPENSITY OF DNA AS REVEALED BY DNASE-I - PARAMETERS FOR TRINUCLEOTIDES [J].
BRUKNER, I ;
SANCHEZ, R ;
SUCK, D ;
PONGOR, S .
EMBO JOURNAL, 1995, 14 (08) :1812-1818
[7]   TRINUCLEOTIDE MODELS FOR DNA BENDING PROPENSITY - COMPARISON OF MODELS BASED ON DNASEI DIGESTION AND NUCLEOSOME PACKAGING DATA [J].
BRUKNER, I ;
SANCHEZ, R ;
SUCK, D ;
PONGOR, S .
JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 1995, 13 (02) :309-317
[8]   DNA binding site selection by RNA polymerase II TAFs:: a TAFII250-TAFII150 complex recognizes the Initiator [J].
Chalkley, GE ;
Verrijzer, CP .
EMBO JOURNAL, 1999, 18 (17) :4835-4845
[9]   Nucleosomes are translationally positioned on the active allele and rotationally positioned on the inactive allele of the HPRT promoter [J].
Chen, C ;
Yang, TP .
MOLECULAR AND CELLULAR BIOLOGY, 2001, 21 (22) :7682-7695
[10]   Comprehensive analysis of transcriptional promoter structure and function in 1% of the human genome [J].
Cooper, SJ ;
Trinklein, ND ;
Anton, ED ;
Nguyen, L ;
Myers, RM .
GENOME RESEARCH, 2006, 16 (01) :1-10