Genetic association mapping via evolution-based clustering of haplotypes

被引:29
作者
Tachmazidou, Ioanna [1 ]
Verzilli, Claudio J.
De Iorio, Maria
机构
[1] ImperialColl Med, Dept Epidemiol & Publ Hlth, London, England
[2] London Sch Hyg & Trop Med, Dept Epidemiol & Populat Hlth, London, England
基金
英国惠康基金;
关键词
D O I
10.1371/journal.pgen.0030111
中图分类号
Q3 [遗传学];
学科分类号
071007 [遗传学]; 090102 [作物遗传育种];
摘要
Multilocus analysis of single nucleotide polymorphism haplotypes is a promising approach to dissecting the genetic basis of complex diseases. We propose a coalescent-based model for association mapping that potentially increases the power to detect disease-susceptibility variants in genetic association studies. The approach uses Bayesian partition modelling to cluster haplotypes with similar disease risks by exploiting evolutionary information. We focus on candidate gene regions with densely spaced markers and model chromosomal segments in high linkage disequilibrium therein assuming a perfect phylogeny. To make this assumption more realistic, we split the chromosomal region of interest into sub-regions or windows of high linkage disequilibrium. The haplotype space is then partitioned into disjoint clusters, within which the phenotype-haplotype association is assumed to be the same. For example, in case-control studies, we expect chromosomal segments bearing the causal variant on a common ancestral background to be more frequent among cases than controls, giving rise to two separate haplotype clusters. The novelty of our approach arises from the fact that the distance used for clustering haplotypes has an evolutionary interpretation, as haplotypes are clustered according to the time to their most recent common ancestor. Our approach is fully Bayesian and we develop a Markov Chain Monte Carlo algorithm to sample efficiently over the space of possible partitions. We compare the proposed approach to both single-marker analyses and recently proposed multi-marker methods and show that the Bayesian partition modelling performs similarly in localizing the causal allele while yielding lower false-positive rates. Also, the method is computationally quicker than other multi-marker approaches. We present an application to real genotype data from the CYP2D6 gene region, which has a confirmed role in drug metabolism, where we succeed in mapping the location of the susceptibility variant within a small error.
引用
收藏
页码:1163 / 1177
页数:15
相关论文
共 37 条
[1]
Use of unphased multilocus genotype data in indirect association studies [J].
Clayton, D ;
Chapman, J ;
Cooper, J .
GENETIC EPIDEMIOLOGY, 2004, 27 (04) :415-428
[2]
High-resolution haplotype structure in the human genome [J].
Daly, MJ ;
Rioux, JD ;
Schaffner, SE ;
Hudson, TJ ;
Lander, ES .
NATURE GENETICS, 2001, 29 (02) :229-232
[3]
Bayesian partitioning for estimating disease risk [J].
Denison, DGT ;
Holmes, CC .
BIOMETRICS, 2001, 57 (01) :143-149
[4]
Linkage disequilibrium mapping via cladistic analysis of single-nucleotide polymorphism haplotypes [J].
Durrant, C ;
Zondervan, KT ;
Cardon, LR ;
Hunt, S ;
Deloukas, P ;
Morris, AP .
AMERICAN JOURNAL OF HUMAN GENETICS, 2004, 75 (01) :35-43
[5]
The structure of haplotype blocks in the human genome [J].
Gabriel, SB ;
Schaffner, SF ;
Nguyen, H ;
Moore, JM ;
Roy, J ;
Blumenstiel, B ;
Higgins, J ;
DeFelice, M ;
Lochner, A ;
Faggart, M ;
Liu-Cordero, SN ;
Rotimi, C ;
Adeyemo, A ;
Cooper, R ;
Ward, R ;
Lander, ES ;
Daly, MJ ;
Altshuler, D .
SCIENCE, 2002, 296 (5576) :2225-2229
[6]
Griffiths RC, 2001, NATO SCI S A LIF SCI, V310, P137
[7]
EFFICIENT ALGORITHMS FOR INFERRING EVOLUTIONARY TREES [J].
GUSFIELD, D .
NETWORKS, 1991, 21 (01) :19-28
[8]
HOGGART C, 2005, FREGENE SOFTWARE SIM
[9]
Linkage disequilibrium mapping identifies a 390 kb region associated with CYP2D6 poor drug metabolising activity [J].
Hosking L.K. ;
Boyd P.R. ;
Xu C.F. ;
Nissum M. ;
Cantone K. ;
Purvis I.J. ;
Khakhar R. ;
Barnes M.R. ;
Liberwirth U. ;
Hagen-Mann K. ;
Ehm M.G. ;
Riley J.H. .
The Pharmacogenomics Journal, 2002, 2 (3) :165-175
[10]
HUDSON RR, 1985, GENETICS, V111, P147