Efficiency and consistency of haplotype tagging of dense SNP maps in multiple samples

被引:42
作者
Ke, XY
Durrant, C
Morris, AP
Hunt, S
Bentley, DR
Deloukas, P
Cardon, LR
机构
[1] Univ Oxford, Wellcome Trust Ctr Human Genet, Oxford OX3 7BN, England
[2] Wellcome Trust Sanger Inst, Hinxton, England
关键词
D O I
10.1093/hmg/ddh294
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Haplotype tagging is a means of retaining most of the information in high density marker maps, while reducing genotyping requirements. Estimates of the numbers of tagging SNPs required to cover the human genome have varied widely, ranging from 100 000 to 1 000 000. Tagging has been applied to a number of gene-based datasets but has not been evaluated in contexts reflecting those of genome-wide association studies-large chromosome regions and multiple samples drawn from the same population. We analysed 5000 common markers across a 10 Mb segment of human chromosome 20 in three samples (UK Caucasian, CEPH Caucasian, African American) to evaluate tagging efficiency and consistency. Overall, the results indicate a high degree of efficiency, yielding 3-5-fold savings in Caucasians and 2-3-fold savings in African Americans. These levels varied according to linkage disequilibrium (LD) levels, tagging thresholds and allele frequencies, but in high LD regions they did not vary markedly due to marker density. However, a strong positive relationship between marker density and tagging was observed, relating to the fact that increasing marker density yields greater sequence coverage in high LD, thus requiring more tag SNPs to cover a greater fraction of the genome. Encouragingly, whatever the density employed, a high level of robustness was observed between UK and CEPH samples, as most of the htSNPs selected in one sample were also appropriate as tags in the other.
引用
收藏
页码:2557 / 2565
页数:9
相关论文
共 36 条
[1]   The power to detect linkage disequilibrium with quantitative traits in selected samples [J].
Abecasis, GR ;
Cookson, WOC ;
Cardon, LR .
AMERICAN JOURNAL OF HUMAN GENETICS, 2001, 68 (06) :1463-1474
[2]   Patterns of linkage disequilibrium in the human genome [J].
Ardlie, KG ;
Kruglyak, L ;
Seielstad, M .
NATURE REVIEWS GENETICS, 2002, 3 (04) :299-309
[3]  
Barratt BJ, 2002, ANN HUM GENET, V66, P393, DOI [10.1046/j.1469-1809.2002.00125.x, 10.1017/S0003480002001252]
[4]   Additional SNPs and linkage-disequilibrium analyses are necessary for whole-genome association studies in humans [J].
Carlson, CS ;
Eberle, MA ;
Rieder, MJ ;
Smith, JD ;
Kruglyak, L ;
Nickerson, DA .
NATURE GENETICS, 2003, 33 (04) :518-521
[5]   Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium [J].
Carlson, CS ;
Eberle, MA ;
Rieder, MJ ;
Yi, Q ;
Kruglyak, L ;
Nickerson, DA .
AMERICAN JOURNAL OF HUMAN GENETICS, 2004, 74 (01) :106-120
[6]  
Cavalli-Sforza L. L., 1994, HIST GEOGRAPHY HUMAN
[7]   Detecting disease associations due to linkage disequilibrium using haplotype tags: A class of tests and the determinants of statistical power [J].
Chapman, JM ;
Cooper, JD ;
Todd, JA ;
Clayton, DG .
HUMAN HEREDITY, 2003, 56 (1-3) :18-31
[8]   Association studies in candidate genes: Strategies to select SNPs to be tested [J].
Cousin, E ;
Genin, E ;
Mace, S ;
Ricard, S ;
Chansac, C ;
del Zompo, M ;
Deleuze, JF .
HUMAN HEREDITY, 2003, 56 (04) :151-159
[9]   Haplotype diversity across 100 candidate genes for inflammation, lipid metabolism, and blood pressure regulation in two populations [J].
Crawford, DC ;
Carlson, CS ;
Rieder, MJ ;
Carrington, DP ;
Yi, Q ;
Smith, JD ;
Eberle, MA ;
Kruglyak, L ;
Nickerson, DA .
AMERICAN JOURNAL OF HUMAN GENETICS, 2004, 74 (04) :610-622
[10]   The DNA sequence and comparative analysis of human chromosome 20 [J].
Deloukas, P ;
Matthews, LH ;
Ashurst, J ;
Burton, J ;
Gilbert, JGR ;
Jones, M ;
Stavrides, G ;
Almeida, JP ;
Babbage, AK ;
Bagguley, CL ;
Bailey, J ;
Barlow, KF ;
Bates, KN ;
Beard, LM ;
Beare, DM ;
Beasley, OP ;
Bird, CP ;
Blakey, SE ;
Bridgeman, AM ;
Brown, AJ ;
Buck, D ;
Burrill, W ;
Butler, AP ;
Carder, C ;
Carter, NP ;
Chapman, JC ;
Clamp, M ;
Clark, G ;
Clark, LN ;
Clark, SY ;
Clee, CM ;
Clegg, S ;
Cobley, VE ;
Collier, RE ;
Connor, R ;
Corby, NR ;
Coulson, A ;
Coville, GJ ;
Deadman, R ;
Dhami, P ;
Dunn, M ;
Ellington, AG ;
Frankland, JA ;
Fraser, A ;
French, L ;
Garner, P ;
Grafham, DV ;
Griffiths, C ;
Griffiths, ND ;
Gwilliam, R .
NATURE, 2001, 414 (6866) :865-U3