Selection of genetic markers for association analyses, using linkage disequilibrium and haplotypes

被引:105
作者
Meng, ZL
Zaykin, DV
Xu, CF
Wagner, M
Ehm, MG
机构
[1] N Carolina State Univ, Bioinformat Res Ctr, Raleigh, NC 27695 USA
[2] GlaxoSmithKline, Dept Populat Genet, Res Triangle Pk, NC USA
关键词
D O I
10.1086/376561
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The genotyping of closely spaced single-nucleotide polymorphism ( SNP) markers frequently yields highly correlated data, owing to extensive linkage disequilibrium (LD) between markers. The extent of LD varies widely across the genome and drives the number of frequent haplotypes observed in small regions. Several studies have illustrated the possibility that LD or haplotype data could be used to select a subset of SNPs that optimize the information retained in a genomic region while reducing the genotyping effort and simplifying the analysis. We propose a method based on the spectral decomposition of the matrices of pairwise LD between markers, and we select markers on the basis of their contributions to the total genetic variation. We also modify Clayton's "haplotype tagging SNP" selection method, which utilizes haplotype information. For both methods, we propose sliding window - based algorithms that allow the methods to be applied to large chromosomal regions. Our procedures require genotype information about a small number of individuals for an initial set of SNPs and selection of an optimum subset of SNPs that could be efficiently genotyped on larger numbers of samples while retaining most of the genetic variation in samples. We identify suitable parameter combinations for the procedures, and we show that a sample size of 50 - 100 individuals achieves consistent results in studies of simulated data sets in linkage equilibrium and LD. When applied to experimental data sets, both procedures were similarly effective at reducing the genotyping requirement while maintaining the genetic information content throughout the regions. We also show that haplotype-association results that Hosking et al. obtained near CYP2D6 were almost identical before and after marker selection.
引用
收藏
页码:115 / 130
页数:16
相关论文
共 16 条
[1]  
BENNETT JH, 1954, ANN EUGENIC, V18, P311
[2]  
CLAYTON D, 2001, CHOOSING SET HAPLOTY
[3]   Genomics - New mapping project splits the community [J].
Couzin, J .
SCIENCE, 2002, 296 (5572) :1391-+
[4]   High-resolution haplotype structure in the human genome [J].
Daly, MJ ;
Rioux, JD ;
Schaffner, SE ;
Hudson, TJ ;
Lander, ES .
NATURE GENETICS, 2001, 29 (02) :229-232
[5]   A first-generation linkage disequilibrium map of human chromosome 22 [J].
Dawson, E ;
Abecasis, GR ;
Bumpstead, S ;
Chen, Y ;
Hunt, S ;
Beare, DM ;
Pabial, J ;
Dibling, T ;
Tinsley, E ;
Kirby, S ;
Carter, D ;
Papaspyridonos, M ;
Livingstone, S ;
Ganske, R ;
Lohmmussaar, E ;
Zernant, J ;
Tonisson, N ;
Remm, M ;
Mägi, R ;
Puurand, T ;
Vilo, J ;
Kurg, A ;
Rice, K ;
Deloukas, P ;
Mott, R ;
Metspalu, A ;
Bentley, DR ;
Cardon, LR ;
Dunham, I .
NATURE, 2002, 418 (6897) :544-548
[6]   Genomewide search for type 2 diabetes susceptibility genes in four American populations [J].
Ehm, MG ;
Karnoub, MC ;
Sakul, H ;
Gottschalk, K ;
Holt, DC ;
Weber, JL ;
Vaske, D ;
Briley, D ;
Briley, L ;
McMillen, P ;
Nguyen, Q ;
Reisman, M ;
Lai, EH ;
Joslyn, G ;
Shepherd, NS ;
Bell, C ;
Wagner, MJ ;
Burns, DK .
AMERICAN JOURNAL OF HUMAN GENETICS, 2000, 66 (06) :1871-1881
[7]   The structure of haplotype blocks in the human genome [J].
Gabriel, SB ;
Schaffner, SF ;
Nguyen, H ;
Moore, JM ;
Roy, J ;
Blumenstiel, B ;
Higgins, J ;
DeFelice, M ;
Lochner, A ;
Faggart, M ;
Liu-Cordero, SN ;
Rotimi, C ;
Adeyemo, A ;
Cooper, R ;
Ward, R ;
Lander, ES ;
Daly, MJ ;
Altshuler, D .
SCIENCE, 2002, 296 (5576) :2225-2229
[8]   Linkage disequilibrium mapping identifies a 390 kb region associated with CYP2D6 poor drug metabolising activity [J].
Hosking L.K. ;
Boyd P.R. ;
Xu C.F. ;
Nissum M. ;
Cantone K. ;
Purvis I.J. ;
Khakhar R. ;
Barnes M.R. ;
Liberwirth U. ;
Hagen-Mann K. ;
Ehm M.G. ;
Riley J.H. .
The Pharmacogenomics Journal, 2002, 2 (3) :165-175
[9]  
Jackson JE, 1991, A user's guide to principal components
[10]   Haplotype tagging for the identification of common disease genes [J].
Johnson, GCL ;
Esposito, L ;
Barratt, BJ ;
Smith, AN ;
Heward, J ;
Di Genova, G ;
Ueda, H ;
Cordell, HJ ;
Eaves, IA ;
Dudbridge, F ;
Twells, RCJ ;
Payne, F ;
Hughes, W ;
Nutland, S ;
Stevens, H ;
Carr, P ;
Tuomilehto-Wolf, E ;
Tuomilehto, J ;
Gough, SCL ;
Clayton, DG ;
Todd, JA .
NATURE GENETICS, 2001, 29 (02) :233-237