A comparison of tagging methods and their tagging space

被引:24
作者
Ke, XY
Miretti, MM
Broxholme, J
Hunt, S
Beck, S
Bentley, DR
Deloukas, P
Cardon, LR
机构
[1] Univ Oxford, Wellcome Trust Ctr Human Genet, Oxford OX3 7BN, England
[2] Wellcome Trust Sanger Inst, Hinxton, England
关键词
D O I
10.1093/hmg/ddi309
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 [生物化学与分子生物学]; 081704 [应用化学];
摘要
Single-nucleotide polymorphism (SNP) tagging is widely used as a way of saving genotyping costs in association studies. A number of different tagging methods have been developed to reduce the number of markers to be genotyped while maintaining power for detecting effects on non-assayed SNPs. How the different methods perform in different settings, the degree to which they overlap and share common tags and how they differ are important questions. We investigated these questions by comparing three widely used tagging methods/algorithms-one haplotype r(2)-based method, one pair-wise r(2)-based method and one method which was based on haplotype diversity but focused on major haplotypes. Tagging efficiency was defined as the number of genotyped markers divided by the number of tagging SNPs. Tagging effectiveness was defined as the proportion of un-genotyped or 'hidden' SNPs being detected (having a pair-wise or haplotype r(2) with a set of tagging SNPs over a threshold, e.g. haplotype r(2)>= 0.80). The ENCODE regions genotyped on the HapMap CEPH individuals were examined in this study. Tagging effectiveness was generally poor for rare SNPs than for common SNPs, for all three tagging methods. Inclusion of rare SNPs into initial HapMap scheme could enhance the performance of tags on rare hidden SNPs at the expense of increased genotyping cost. At a moderate tagging efficiency, more than 90% of hidden SNPs detected by tagging SNPs selected by one method were also detected by tagging SNPs selected by another method, and this figure could be increased to 100% if tagging efficiency was allowed to drop. These results indicate that the tagging space is highly concordant between different tagging methods, despite the fact that they often involve different sets of tagging SNPs.
引用
收藏
页码:2757 / 2767
页数:11
相关论文
共 29 条
[1]
Merlin-rapid analysis of dense genetic maps using sparse gene flow trees [J].
Abecasis, GR ;
Cherny, SS ;
Cookson, WO ;
Cardon, LR .
NATURE GENETICS, 2002, 30 (01) :97-101
[2]
The power to detect linkage disequilibrium with quantitative traits in selected samples [J].
Abecasis, GR ;
Cookson, WOC ;
Cardon, LR .
AMERICAN JOURNAL OF HUMAN GENETICS, 2001, 68 (06) :1463-1474
[3]
Haplotypic analysis of the TNF locus by association efficiency and entropy -: art. no. R24 [J].
Ackerman, H ;
Usen, S ;
Mott, R ;
Richardson, A ;
Sisay-Joof, F ;
Katundu, P ;
Taylor, T ;
Ward, R ;
Molyneux, M ;
Pinder, M ;
Kwiatkowski, DP .
GENOME BIOLOGY, 2003, 4 (04)
[4]
A single-nucleotide polymorphism tagging set for human drug metabolism and transport [J].
Ahmadi, KR ;
Weale, ME ;
Xue, ZYY ;
Soranzo, N ;
Yarnall, DP ;
Briley, JD ;
Maruyama, Y ;
Kobayashi, M ;
Wood, NW ;
Spurr, NK ;
Burns, DK ;
Roses, AD ;
Saunders, AM ;
Goldstein, DB .
NATURE GENETICS, 2005, 37 (01) :84-89
[5]
CLUSTAG: hierarchical clustering and graph methods for selecting tag SNPs [J].
Ao, SI ;
Yip, K ;
Ng, M ;
Cheung, D ;
Fong, PY ;
Melhado, I ;
Sham, PC .
BIOINFORMATICS, 2005, 21 (08) :1735-1736
[6]
Barratt BJ, 2002, ANN HUM GENET, V66, P393, DOI [10.1046/j.1469-1809.2002.00125.x, 10.1017/S0003480002001252]
[7]
Haploview: analysis and visualization of LD and haplotype maps [J].
Barrett, JC ;
Fry, B ;
Maller, J ;
Daly, MJ .
BIOINFORMATICS, 2005, 21 (02) :263-265
[8]
Additional SNPs and linkage-disequilibrium analyses are necessary for whole-genome association studies in humans [J].
Carlson, CS ;
Eberle, MA ;
Rieder, MJ ;
Smith, JD ;
Kruglyak, L ;
Nickerson, DA .
NATURE GENETICS, 2003, 33 (04) :518-521
[9]
Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium [J].
Carlson, CS ;
Eberle, MA ;
Rieder, MJ ;
Yi, Q ;
Kruglyak, L ;
Nickerson, DA .
AMERICAN JOURNAL OF HUMAN GENETICS, 2004, 74 (01) :106-120
[10]
Detecting disease associations due to linkage disequilibrium using haplotype tags: A class of tests and the determinants of statistical power [J].
Chapman, JM ;
Cooper, JD ;
Todd, JA ;
Clayton, DG .
HUMAN HEREDITY, 2003, 56 (1-3) :18-31