Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium

被引:1308
作者
Carlson, CS
Eberle, MA
Rieder, MJ
Yi, Q
Kruglyak, L
Nickerson, DA
机构
[1] Univ Washington, Med Ctr, Dept Genome Sci, Seattle, WA 98195 USA
[2] Fred Hutchinson Canc Res Ctr, Div Human Biol, Seattle, WA 98104 USA
[3] Howard Hughes Med Inst, Seattle, WA USA
关键词
D O I
10.1086/381000
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Common genetic polymorphisms may explain a portion of the heritable risk for common diseases. Within candidate genes, the number of common polymorphisms is finite, but direct assay of all existing common polymorphism is inefficient, because genotypes at many of these sites are strongly correlated. Thus, it is not necessary to assay all common variants if the patterns of allelic association between common variants can be described. We have developed an algorithm to select the maximally informative set of common single-nucleotide polymorphisms (tagSNPs) to assay in candidate-gene association studies, such that all known common polymorphisms either are directly assayed or exceed a threshold level of association with a tagSNP. The algorithm is based on the r(2) linkage disequilibrium (LD) statistic, because r(2) is directly related to statistical power to detect disease associations with unassayed sites. We show that, at a relatively stringent r(2) threshold (r(2) > 0.8), the LD-selected tagSNPs resolve >80% of all haplotypes across a set of 100 candidate genes, regardless of recombination, and tag specific haplotypes and clades of related haplotypes in nonrecombinant regions. Thus, if the patterns of common variation are described for a candidate gene, analysis of the tagSNP set can comprehensively interrogate for main effects from common functional variation. We demonstrate that, although common variation tends to be shared between populations, tagSNPs should be selected separately for populations with different ancestries.
引用
收藏
页码:106 / 120
页数:15
相关论文
共 41 条
[21]   Variation is the spice of life [J].
Kruglyak, L ;
Nickerson, DA .
NATURE GENETICS, 2001, 27 (03) :234-236
[22]   Initial sequencing and analysis of the human genome [J].
Lander, ES ;
Int Human Genome Sequencing Consortium ;
Linton, LM ;
Birren, B ;
Nusbaum, C ;
Zody, MC ;
Baldwin, J ;
Devon, K ;
Dewar, K ;
Doyle, M ;
FitzHugh, W ;
Funke, R ;
Gage, D ;
Harris, K ;
Heaford, A ;
Howland, J ;
Kann, L ;
Lehoczky, J ;
LeVine, R ;
McEwan, P ;
McKernan, K ;
Meldrim, J ;
Mesirov, JP ;
Miranda, C ;
Morris, W ;
Naylor, J ;
Raymond, C ;
Rosetti, M ;
Santos, R ;
Sheridan, A ;
Sougnez, C ;
Stange-Thomann, N ;
Stojanovic, N ;
Subramanian, A ;
Wyman, D ;
Rogers, J ;
Sulston, J ;
Ainscough, R ;
Beck, S ;
Bentley, D ;
Burton, J ;
Clee, C ;
Carter, N ;
Coulson, A ;
Deadman, R ;
Deloukas, P ;
Dunham, A ;
Dunham, I ;
Durbin, R ;
French, L .
NATURE, 2001, 409 (6822) :860-921
[23]   Allelic discrimination using fluorogenic probes and the 5′ nuclease assay [J].
Livak, KJ .
GENETIC ANALYSIS-BIOMOLECULAR ENGINEERING, 1999, 14 (5-6) :143-149
[24]   Selection of genetic markers for association analyses, using linkage disequilibrium and haplotypes [J].
Meng, ZL ;
Zaykin, DV ;
Xu, CF ;
Wagner, M ;
Ehm, MG .
AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 73 (01) :115-130
[25]   Sequence diversity and large-scale typing of SNPs in the human apolipoprotein E gene [J].
Nickerson, DA ;
Taylor, SL ;
Fullerton, SM ;
Weiss, KM ;
Clark, AG ;
Stengård, JH ;
Salomaa, V ;
Boerwinkle, E ;
Sing, CF .
GENOME RESEARCH, 2000, 10 (10) :1532-1545
[26]   PolyPhred: Automating the detection and genotyping of single nucleotide substitutions using fluorescence-based resequencing [J].
Nickerson, DA ;
Tobe, VO ;
Taylor, SL .
NUCLEIC ACIDS RESEARCH, 1997, 25 (14) :2745-2751
[27]   Blocks of limited haplotype diversity revealed by high-resolution scanning of human chromosome 21 [J].
Patil, N ;
Berno, AJ ;
Hinds, DA ;
Barrett, WA ;
Doshi, JM ;
Hacker, CR ;
Kautzer, CR ;
Lee, DH ;
Marjoribanks, C ;
McDonough, DP ;
Nguyen, BTN ;
Norris, MC ;
Sheehan, JB ;
Shen, NP ;
Stern, D ;
Stokowski, RP ;
Thomas, DJ ;
Trulson, MO ;
Vyas, KR ;
Frazer, KA ;
Fodor, SPA ;
Cox, DR .
SCIENCE, 2001, 294 (5547) :1719-1723
[28]   Linkage disequilibrium in humans: Models and data [J].
Pritchard, JK ;
Przeworski, M .
AMERICAN JOURNAL OF HUMAN GENETICS, 2001, 69 (01) :1-14
[29]   Sequence variation in the human angiotensin converting enzyme [J].
Rieder, MJ ;
Taylor, SL ;
Clark, AG ;
Nickerson, DA .
NATURE GENETICS, 1999, 22 (01) :59-62
[30]   A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms [J].
Sachidanandam, R ;
Weissman, D ;
Schmidt, SC ;
Kakol, JM ;
Stein, LD ;
Marth, G ;
Sherry, S ;
Mullikin, JC ;
Mortimore, BJ ;
Willey, DL ;
Hunt, SE ;
Cole, CG ;
Coggill, PC ;
Rice, CM ;
Ning, ZM ;
Rogers, J ;
Bentley, DR ;
Kwok, PY ;
Mardis, ER ;
Yeh, RT ;
Schultz, B ;
Cook, L ;
Davenport, R ;
Dante, M ;
Fulton, L ;
Hillier, L ;
Waterston, RH ;
McPherson, JD ;
Gilman, B ;
Schaffner, S ;
Van Etten, WJ ;
Reich, D ;
Higgins, J ;
Daly, MJ ;
Blumenstiel, B ;
Baldwin, J ;
Stange-Thomann, NS ;
Zody, MC ;
Linton, L ;
Lander, ES ;
Altshuler, D .
NATURE, 2001, 409 (6822) :928-933