Sequence context at human single nucleotide polymorphisms: Overrepresentation of CpG dinucleotide at polymorphic sites and suppression of variation in CpG islands

被引:40
作者
Tomso, DJ [1 ]
Bell, DA [1 ]
机构
[1] NIEHS, Lab Computat Biol & Risk Anal, Res Triangle Pk, NC 27709 USA
关键词
CpG islands; single nucleotide polymorphisms; methylation; genomics; bioinformatics;
D O I
10.1016/S0022-2836(03)00120-7
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Human polymorphisms originate as mutations, and the influence of context on mutagenesis should be reflected in the distribution of sequences surrounding single nucleotide polymorphisms (SNPs). We have performed a computational survey of nearly two million human SNPs to determine if sequence-dependent hotspots for polymorphism exist in the human genome. Here we show that sequences containing CpG dinucleotides, which occur at low frequencies in the human genome, are 6.7-fold more abundant at polymorphic sites than expected. In contrast, polymorphisms in CpG sequences located within CpG islands, important regulatory regions that modulate gene expression, are 6.8-fold less prevalent than expected. The distribution of polymorphic alleles at CpGs in CpG islands is also significantly different from that in non-island regions. These data strongly support a role for 5-methylcytosine deamination in the generation of human variation, and suggest that variation at CpGs in islands is suppressed. (C) 2003 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:303 / 308
页数:6
相关论文
共 16 条
[1]   CPG-RICH ISLANDS AND THE FUNCTION OF DNA METHYLATION [J].
BIRD, AP .
NATURE, 1986, 321 (6067) :209-213
[2]   UNMETHYLATED DOMAINS IN VERTEBRATE DNA [J].
COOPER, DN ;
TAGGART, MH ;
BIRD, AP .
NUCLEIC ACIDS RESEARCH, 1983, 11 (03) :647-658
[3]   MOLECULAR-BASIS OF BASE SUBSTITUTION HOTSPOTS IN ESCHERICHIA-COLI [J].
COULONDRE, C ;
MILLER, JH ;
FARABAUGH, PJ ;
GILBERT, W .
NATURE, 1978, 274 (5673) :775-780
[4]   Context-dependent mutagenesis by DNA lesions [J].
Delaney, JC ;
Essigmann, JM .
CHEMISTRY & BIOLOGY, 1999, 6 (10) :743-753
[5]   The covariation between TpA deficiency, CpG deficiency, and G + C content of human isochores is due to a mathematical artifact [J].
Duret, L ;
Galtier, N .
MOLECULAR BIOLOGY AND EVOLUTION, 2000, 17 (11) :1620-1625
[6]   DNA METHYLATION AND MUTATION [J].
HOLLIDAY, R ;
GRIGG, GW .
MUTATION RESEARCH, 1993, 285 (01) :61-67
[7]   Neighboring-nucleotide effects on the rates of germ-line single-base-pair substitution in human genes [J].
Krawczak, M ;
Ball, EV ;
Cooper, DN .
AMERICAN JOURNAL OF HUMAN GENETICS, 1998, 63 (02) :474-488
[8]   HEAT-INDUCED DEAMINATION OF CYTOSINE RESIDUES IN DEOXYRIBONUCLEIC-ACID [J].
LINDAHL, T ;
NYBERG, B .
BIOCHEMISTRY, 1974, 13 (16) :3405-3410
[9]   Single-nucleotide polymorphisms in the public domain: how useful are they? [J].
Marth, G ;
Yeh, R ;
Minton, M ;
Donaldson, R ;
Li, Q ;
Duan, SG ;
Davenport, R ;
Miller, RD ;
Kwok, PY .
NATURE GENETICS, 2001, 27 (04) :371-372
[10]  
Pollock PM, 1996, GENE CHROMOSOME CANC, V15, P77, DOI 10.1002/(SICI)1098-2264(199602)15:2<77::AID-GCC1>3.0.CO