Correcting Estimators of θ and Tajima's D for Ascertainment Biases Caused by the Single-Nucleotide Polymorphism Discovery Process

被引:25
作者
Ramirez-Soriano, Anna [1 ]
Nielsen, Rasmus [2 ,3 ,4 ]
机构
[1] Univ Pompeu Fabra, Dept Ciencies Salut & Vida, Barcelona 08003, Catalonia, Spain
[2] Univ Calif Berkeley, Dept Integrat Biol, Berkeley, CA 94720 USA
[3] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
[4] Univ Copenhagen, Dept Biol, DK-2100 Copenhagen O, Denmark
基金
美国国家卫生研究院;
关键词
AUTOSOMAL RECESSIVE HYPERCHOLESTEROLEMIA; RECENT POSITIVE SELECTION; HUMAN GENOME; NATURAL-SELECTION; LINKAGE-DISEQUILIBRIUM; STATISTICAL PROPERTIES; DEMOGRAPHIC HISTORY; BALANCING SELECTION; FREQUENCY-SPECTRUM; POPULATION-GROWTH;
D O I
10.1534/genetics.108.094060
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Most single-nucleotide polymorphism (SNP) data suffer from an ascertainment bias caused by the process of SNP discovery followed by SNP genotyping. The final genotyped data are biased toward an excess of common alleles compared to directly sequenced data, making standard genetic methods of analysis inapplicable to this type of data. We here derive corrected estimators of the fundamental population genetic parameter theta = 4N(c)mu, (N-c effective population size; mu, mutation rate) oil the basis of the average number of pairwise differences and on the basis of the number of segregating sites. We also derive the variances and covariances of these estimators and provide a corrected version of Tajima's D statistic. We reanalyze a human genomewide SNP data set and find substantial differences in the results with or without ascertainment bias correction.
引用
收藏
页码:701 / 710
页数:10
相关论文
共 40 条
[1]   An SNP map of the human genome generated by reduced representation shotgun sequencing [J].
Altshuler, D ;
Pollara, VJ ;
Cowles, CR ;
Van Etten, WJ ;
Baldwin, J ;
Linton, L ;
Lander, ES .
NATURE, 2000, 407 (6803) :513-516
[2]   Signatures of natural selection in the human genome [J].
Bamshad, M ;
Wooding, SP .
NATURE REVIEWS GENETICS, 2003, 4 (02) :99-111A
[3]   Direct detection of null alleles in SNP genotyping data [J].
Carlson, Christopher S. ;
Smith, Joshua D. ;
Stanaway, Ian B. ;
Rieder, Mark J. ;
Nickerson, Deborah A. .
HUMAN MOLECULAR GENETICS, 2006, 15 (12) :1931-1937
[4]   Mapping complex disease loci in whole-genome association studies [J].
Carlson, CS ;
Eberle, MA ;
Kruglyak, L ;
Nickerson, DA .
NATURE, 2004, 429 (6990) :446-452
[5]   Ascertainment bias in studies of human genome-wide polymorphism [J].
Clark, AG ;
Hubisz, MJ ;
Bustamante, CD ;
Williamson, SH ;
Nielsen, R .
GENOME RESEARCH, 2005, 15 (11) :1496-1502
[6]   The patterns of natural variation in human genes [J].
Crawford, DC ;
Akey, DT ;
Nickerson, DA .
ANNUAL REVIEW OF GENOMICS AND HUMAN GENETICS, 2005, 6 :287-312
[7]  
Durrett R, 2008, PROBAB APPL SER, P1, DOI 10.1007/978-0-387-78168-6_1
[8]  
Fay JC, 2000, GENETICS, V155, P1405
[9]   A second generation human haplotype map of over 3.1 million SNPs [J].
Frazer, Kelly A. ;
Ballinger, Dennis G. ;
Cox, David R. ;
Hinds, David A. ;
Stuve, Laura L. ;
Gibbs, Richard A. ;
Belmont, John W. ;
Boudreau, Andrew ;
Hardenbol, Paul ;
Leal, Suzanne M. ;
Pasternak, Shiran ;
Wheeler, David A. ;
Willis, Thomas D. ;
Yu, Fuli ;
Yang, Huanming ;
Zeng, Changqing ;
Gao, Yang ;
Hu, Haoran ;
Hu, Weitao ;
Li, Chaohua ;
Lin, Wei ;
Liu, Siqi ;
Pan, Hao ;
Tang, Xiaoli ;
Wang, Jian ;
Wang, Wei ;
Yu, Jun ;
Zhang, Bo ;
Zhang, Qingrun ;
Zhao, Hongbin ;
Zhao, Hui ;
Zhou, Jun ;
Gabriel, Stacey B. ;
Barry, Rachel ;
Blumenstiel, Brendan ;
Camargo, Amy ;
Defelice, Matthew ;
Faggart, Maura ;
Goyette, Mary ;
Gupta, Supriya ;
Moore, Jamie ;
Nguyen, Huy ;
Onofrio, Robert C. ;
Parkin, Melissa ;
Roy, Jessica ;
Stahl, Erich ;
Winchester, Ellen ;
Ziaugra, Liuda ;
Altshuler, David ;
Shen, Yan .
NATURE, 2007, 449 (7164) :851-U3
[10]   STATISTICAL PROPERTIES OF SEGREGATING SITES [J].
FU, YX .
THEORETICAL POPULATION BIOLOGY, 1995, 48 (02) :172-197