Prevalence of quadruplexes in the human genome

被引:1400
作者
Huppert, JL [1 ]
Balasubramanian, S [1 ]
机构
[1] Univ Cambridge, Chem Lab, Cambridge CB2 1EW, England
关键词
D O I
10.1093/nar/gki609
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Guanine-rich DNA sequences of a particular form have the ability to fold into four-stranded structures called G-quadruplexes. In this paper, we present a working rule to predict which primary sequences can form this structure, and describe a search algorithm to identify such sequences in genomic DNA. We count the number of quadruplexes found in the human genome and compare that with the figure predicted by modelling DNA as a Bernoulli stream or as a Markov chain, using windows of various sizes. We demonstrate that the distribution of loop lengths is significantly different from what would be expected in a random case, providing an indication of the number of potentially relevant quadruplex- forming sequences. In particular, we show that there is a significant repression of quadruplexes in the coding strand of exonic regions, which suggests that quadruplex- forming patterns are disfavoured in sequences that will form RNA.
引用
收藏
页码:2908 / 2916
页数:9
相关论文
共 51 条
[51]   The mouse Ms6-hm hypervariable microsatellite forms a hairpin and two unusual tetraplexes [J].
Weitzmann, MN ;
Woodford, KJ ;
Usdin, K .
JOURNAL OF BIOLOGICAL CHEMISTRY, 1998, 273 (46) :30742-30749