The impact of missing and erroneous genotypes on tagging SNP selection and power of subsequent association tests

被引:13
作者
Liu, WL [1 ]
Zhao, W [1 ]
Chase, GA [1 ]
机构
[1] Penn State Coll Med, Div Biostat, Dept Hlth Evaluat Sci, Hershey, PA 17033 USA
关键词
tagging SNPs; missing genotype; genotyping error; haplotype; power;
D O I
10.1159/000092141
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Objective: Single nucleotide polymorphisms (SNPs) serve as effective markers for localizing disease susceptibility genes, but current genotyping technologies are inadequate for genotyping all available SNP markers in a typical linkage/association study. Much attention has recently been paid to methods for selecting the minimal informative subset of SNPs in identifying haplotypes, but there has been little investigation of the effect of missing or erroneous genotypes on the performance of these SNP selection algorithms and subsequent association tests using the selected tagging SNPs. The purpose of this study is to explore the effect of missing genotype or genotyping error on tagging SNP selection and subsequent single marker and haplotype association tests using the selected tagging SNPs. Methods: Through two sets of simulations, we evaluated the performance of three tagging SNP selection programs in the presence of missing or erroneous genotypes: Clayton's diversity based program htstep, Carlson's linkage disequilibrium (LID) based program IdSelect, and Stram's coefficient of determination based program tagsnp.exe. Results: When randomly selected known loci were relabeled as 'missing', we found that the average number of tagging SNPs selected by all three algorithms changed very little and the power of subsequent single marker and haplotype association tests using the selected tagging SNPs remained close to the power of these tests in the absence of missing genotype. When random genotyping errors were introduced, we found that the average number of tagging SNPs selected by all three algorithms increased. In data sets simulated according to the haplotype frequecies in the CYP19 region, Stram's program had larger increase than Carlson's and Clayton's programs. In data sets simulated under the coalescent model, Carlson's program had the largest increase and Clayton's program had the smallest increase. In both sets of simulations, with the presence of genotyping errors, the power of the haplotypetests from all three programs decreased quickly, but there was not much reduction in power of the single marker tests. Conclusions: Missing genotypes do not seem to have much impact on tagging SNP selection and subsequent single marker and haplotype association tests. In contrast, genotyping errors could have severe impact on tagging SNP selection and haplotype tests, but not on single marker tests. Copyright (c) 2006 S. Karger AG, Basel.
引用
收藏
页码:31 / 44
页数:14
相关论文
共 31 条
[1]   The impact of genotyping error on family-based analysis of quantitative traits [J].
Abecasis, GR ;
Cherny, SS ;
Cardon, LR .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2001, 9 (02) :130-134
[2]   The effect that genotyping errors have on the robustness of common linkage-disequilibrium measures [J].
Akey, JM ;
Zhang, K ;
Xiong, MM ;
Doris, P ;
Jin, L .
AMERICAN JOURNAL OF HUMAN GENETICS, 2001, 68 (06) :1447-1456
[3]   A haplotype map of the human genome [J].
Altshuler, D ;
Brooks, LD ;
Chakravarti, A ;
Collins, FS ;
Daly, MJ ;
Donnelly, P ;
Gibbs, RA ;
Belmont, JW ;
Boudreau, A ;
Leal, SM ;
Hardenbol, P ;
Pasternak, S ;
Wheeler, DA ;
Willis, TD ;
Yu, FL ;
Yang, HM ;
Zeng, CQ ;
Gao, Y ;
Hu, HR ;
Hu, WT ;
Li, CH ;
Lin, W ;
Liu, SQ ;
Pan, H ;
Tang, XL ;
Wang, J ;
Wang, W ;
Yu, J ;
Zhang, B ;
Zhang, QR ;
Zhao, HB ;
Zhao, H ;
Zhou, J ;
Gabriel, SB ;
Barry, R ;
Blumenstiel, B ;
Camargo, A ;
Defelice, M ;
Faggart, M ;
Goyette, M ;
Gupta, S ;
Moore, J ;
Nguyen, H ;
Onofrio, RC ;
Parkin, M ;
Roy, J ;
Stahl, E ;
Winchester, E ;
Ziaugra, L ;
Shen, Y .
NATURE, 2005, 437 (7063) :1299-1320
[4]   Beyond Mendel: An evolving view of human genetic disease transmission [J].
Badano, JL ;
Katsanis, N .
NATURE REVIEWS GENETICS, 2002, 3 (10) :779-789
[5]   Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium [J].
Carlson, CS ;
Eberle, MA ;
Rieder, MJ ;
Yi, Q ;
Kruglyak, L ;
Nickerson, DA .
AMERICAN JOURNAL OF HUMAN GENETICS, 2004, 74 (01) :106-120
[6]   A multipoint method for detecting genotyping errors and mutations in sibling-pair linkage data [J].
Douglas, JA ;
Boehnke, M ;
Lange, K .
AMERICAN JOURNAL OF HUMAN GENETICS, 2000, 66 (04) :1287-1297
[7]  
EXCOFFIER L, 1995, MOL BIOL EVOL, V12, P921
[8]   The structure of haplotype blocks in the human genome [J].
Gabriel, SB ;
Schaffner, SF ;
Nguyen, H ;
Moore, JM ;
Roy, J ;
Blumenstiel, B ;
Higgins, J ;
DeFelice, M ;
Lochner, A ;
Faggart, M ;
Liu-Cordero, SN ;
Rotimi, C ;
Adeyemo, A ;
Cooper, R ;
Ward, R ;
Lander, ES ;
Daly, MJ ;
Altshuler, D .
SCIENCE, 2002, 296 (5576) :2225-2229
[9]   The International HapMap Project [J].
Gibbs, RA ;
Belmont, JW ;
Hardenbol, P ;
Willis, TD ;
Yu, FL ;
Yang, HM ;
Ch'ang, LY ;
Huang, W ;
Liu, B ;
Shen, Y ;
Tam, PKH ;
Tsui, LC ;
Waye, MMY ;
Wong, JTF ;
Zeng, CQ ;
Zhang, QR ;
Chee, MS ;
Galver, LM ;
Kruglyak, S ;
Murray, SS ;
Oliphant, AR ;
Montpetit, A ;
Hudson, TJ ;
Chagnon, F ;
Ferretti, V ;
Leboeuf, M ;
Phillips, MS ;
Verner, A ;
Kwok, PY ;
Duan, SH ;
Lind, DL ;
Miller, RD ;
Rice, JP ;
Saccone, NL ;
Taillon-Miller, P ;
Xiao, M ;
Nakamura, Y ;
Sekine, A ;
Sorimachi, K ;
Tanaka, T ;
Tanaka, Y ;
Tsunoda, T ;
Yoshino, E ;
Bentley, DR ;
Deloukas, P ;
Hunt, S ;
Powell, D ;
Altshuler, D ;
Gabriel, SB ;
Qiu, RZ .
NATURE, 2003, 426 (6968) :789-796
[10]   Power and sample size calculations for case-control genetic association tests when errors are present: Application to single nucleotide polymorphisms [J].
Gordon, D ;
Finch, SJ ;
Nothnagel, M ;
Ott, J .
HUMAN HEREDITY, 2002, 54 (01) :22-33