Mutations of Different Molecular Origins Exhibit Contrasting Patterns of Regional Substitution Rate Variation

被引:69
作者
Elango, Navin [1 ]
Kim, Seong-Ho [1 ]
Vigoda, Eric [3 ]
Yi, Soojin V. [1 ]
机构
[1] Georgia Inst Technol, Sch Biol, Atlanta, GA 30332 USA
[2] NHGRI, NIH Intramural Sequencing Ctr, NIH, Bethesda, MD 20892 USA
[3] Georgia Inst Technol, Coll Comp, Atlanta, GA 30332 USA
基金
美国国家卫生研究院;
关键词
D O I
10.1371/journal.pcbi.1000015
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Transitions at CpG dinucleotides, referred to as "CpG substitutions", are a major mutational input into vertebrate genomes and a leading cause of human genetic disease. The prevalence of CpG substitutions is due to their mutational origin, which is dependent on DNA methylation. In comparison, other single nucleotide substitutions (for example those occurring at GpC dinucleotides) mainly arise from errors during DNA replication. Here we analyzed high quality BAC-based data from human, chimpanzee, and baboon to investigate regional variation of CpG substitution rates. We show that CpG substitutions occur approximately 15 times more frequently than other single nucleotide substitutions in primate genomes, and that they exhibit substantial regional variation. Patterns of CpG rate variation are consistent with differences in methylation level and susceptibility to subsequent deamination. In particular, we propose a "distance-decaying" hypothesis, positing that due to the molecular mechanism of a CpG substitution, rates are correlated with the stability of double-stranded DNA surrounding each CpG dinucleotide, and the effect of local DNA stability may decrease with distance from the CpG dinucleotide. Consistent with our "distance-decaying" hypothesis, rates of CpG substitution are strongly (negatively) correlated with regional G+C content. The influence of G+C content decays as the distance from the target CpG site increases. We estimate that the influence of local G+C content extends up to 1,500,2,000 bps centered on each CpG site. We also show that the distance-decaying relationship persisted when we controlled for the effect of long-range homogeneity of nucleotide composition. GpC sites, in contrast, do not exhibit such "distance-decaying" relationship. Our results highlight an example of the distinctive properties of methylation-dependent substitutions versus substitutions mostly arising from errors during DNA replication. Furthermore, the negative relationship between G+C content and CpG rates may provide an explanation for the observation that GC-rich SINEs show lower CpG rates than other repetitive elements.
引用
收藏
页数:10
相关论文
共 47 条
[1]  
ARNDT P, 2005, J MOL EVOL, V60, P1
[2]   Distinct changes of genomic biases in nucleotide substitution at the time of mammalian radiation [J].
Arndt, PF ;
Petrov, DA ;
Hwa, T .
MOLECULAR BIOLOGY AND EVOLUTION, 2003, 20 (11) :1887-1896
[3]   GenBank [J].
Benson, Dennis A. ;
Karsch-Mizrachi, Ilene ;
Lipman, David J. ;
Ostell, James ;
Wheeler, David L. .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D16-D20
[4]   The compositional evolution of vertebrate genomes [J].
Bernardi, G .
GENE, 2000, 259 (1-2) :31-43
[5]   DNA METHYLATION AND THE FREQUENCY OF CPG IN ANIMAL DNA [J].
BIRD, AP .
NUCLEIC ACIDS RESEARCH, 1980, 8 (07) :1499-1504
[6]   CPG-RICH ISLANDS AND THE FUNCTION OF DNA METHYLATION [J].
BIRD, AP .
NATURE, 1986, 321 (6067) :209-213
[7]   Reconstructing large regions of an ancestral mammalian genome in silico [J].
Blanchette, M ;
Green, ED ;
Miller, W ;
Haussler, D .
GENOME RESEARCH, 2004, 14 (12) :2412-2423
[8]   Mutation pattern variation among regions of the primate genome [J].
Casane, D ;
Boissinot, S ;
Chang, BHJ ;
Shimmin, LC ;
Li, WH .
JOURNAL OF MOLECULAR EVOLUTION, 1997, 45 (03) :216-226
[9]   SPECIFIC ALU BINDING-PROTEIN FROM HUMAN SPERM CHROMATIN PREVENTS DNA METHYLATION [J].
CHESNOKOV, IN ;
SCHMID, CW .
JOURNAL OF BIOLOGICAL CHEMISTRY, 1995, 270 (31) :18539-18542
[10]   The GC content of primates and rodents genomes is not at equilibrium: A reply to Antezana [J].
Duret, Laurent .
JOURNAL OF MOLECULAR EVOLUTION, 2006, 62 (06) :803-806