Investigating single nucleotide polymorphism (SNP) density in the human genome and its implications for molecular evolution

被引:133
作者
Zhao, ZM
Fu, YX
Hewett-Emmett, D
Boerwinkle, E
机构
[1] Univ Texas, Hlth Sci Ctr, Ctr Human Genet, Houston, TX 77030 USA
[2] Univ Texas, Hlth Sci Ctr, Inst Mol Med, Houston, TX 77030 USA
关键词
Celera; RefSNP; CgsSNP; nonsense mutation; missense mutation; silent mutation; natural selection;
D O I
10.1016/S0378-1119(03)00670-X
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
We investigated the single nucleotide polymorphism (SNP) density across the human genome and in different genic categories using two SNP databases: Celera's CgsSNP, which includes SNPs identified by comparing genomic sequences, and Celera's RefSNP, which includes SNPs from a variety of sources and is biased toward disease-associated genes. Based on CgsSNP, the average numbers of SNPs per 10 kb was 8.33, 8.44. and 8.09 in the human genome, in intergenic regions, and in genic regions, respectively. In genic regions, the SNP density in intronic, exonic and adjoining untranslated regions was 8.21, 5.28, and 7.51 SNPs per 10 kb, respectively. The pattern of SNP density based on RefSNP was different from that based on CgsSNP, emphasizing its utility for genotype-phenotype association studies but not for most population genetic studies. The number of SNPs per chromosome was correlated with chromosome length, but the density of SNPs estimated by CgsSNP was not significantly correlated with the GC content of the chromosome. Based on CgsSNP, the ratio of nonsense to missense mutations (0.027), the ratio of missense to silent mutations (1.15), and the ratio of non-synonymous to synonymous mutations (1.18) was less than half of that expected in a human protein coding sequence under the neutral mutation theory, reflecting a role for natural selection, especially purifying selection. (C) 2003 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:207 / 213
页数:7
相关论文
共 25 条
[1]  
[Anonymous], 1985, NEW SCI
[2]   Characterization of single-nucleotide polymorphisms in coding regions of human genes [J].
Cargill, M ;
Altshuler, D ;
Ireland, J ;
Sklar, P ;
Ardlie, K ;
Patil, N ;
Lane, CR ;
Lim, EP ;
Kalyanaraman, N ;
Nemesh, J ;
Ziaugra, L ;
Friedland, L ;
Rolfe, A ;
Warrington, J ;
Lipshutz, R ;
Daley, GQ ;
Lander, ES .
NATURE GENETICS, 1999, 22 (03) :231-238
[3]   High-resolution haplotype structure in the human genome [J].
Daly, MJ ;
Rioux, JD ;
Schaffner, SE ;
Hudson, TJ ;
Lander, ES .
NATURE GENETICS, 2001, 29 (02) :229-232
[4]   Assessment of the total number of human transcription units [J].
Das, M ;
Burge, CB ;
Park, E ;
Colinas, J ;
Pelletier, J .
GENOMICS, 2001, 77 (1-2) :71-78
[5]   High genomic deleterious mutation rates in hominids [J].
Eyre-Walker, A ;
Keightley, PD .
NATURE, 1999, 397 (6717) :344-347
[6]  
Fay JC, 2001, GENETICS, V158, P1227
[7]   Patterns of single-nucleotide polymorphisms in candidate genes for blood-pressure homeostasis [J].
Halushka, MK ;
Fan, JB ;
Bentley, K ;
Hsie, L ;
Shen, NP ;
Weder, A ;
Cooper, R ;
Lipshutz, R ;
Chakravarti, A .
NATURE GENETICS, 1999, 22 (03) :239-247
[8]   Population genomics: a bridge from evolutionary history to genetic medicine [J].
Jorde, LB ;
Watkins, WS ;
Bamshad, MJ .
HUMAN MOLECULAR GENETICS, 2001, 10 (20) :2199-2207
[9]   DNA sequence variation in a non-coding region of low recombination on the human X chromosome [J].
Kaessmann, H ;
Heissig, F ;
von Haeseler, A ;
Pääbo, S .
NATURE GENETICS, 1999, 22 (01) :78-81
[10]   The Celera Discovery System™ [J].
Kerlavage, A ;
Bonazzi, V ;
di Tommaso, M ;
Lawrence, C ;
Li, P ;
Mayberry, F ;
Mural, R ;
Nodell, M ;
Yandell, M ;
Zhang, JH ;
Thomas, P .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :129-136