SNPs, haplotypes, and model selection in a candidate gene region: The SIMPle analysis for multilocus data

被引:33
作者
Conti, DV [1 ]
Gauderman, WJ [1 ]
机构
[1] Univ So Calif, Dept Prevent Med, Los Angeles, CA 90033 USA
关键词
genotypes; haplotypes; SNPs; Bayes model averaging; association analysis;
D O I
10.1002/gepi.20039
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Modern molecular techniques make discovery of numerous single nucleotide polymorphims (SNPs) in candidate gene regions feasible. Conventional analysis relies on either independent tests with each variant or the use of haplotypes in association analysis. The first technique ignores the dependencies between SNPs. The second, though it may increase power, often introduces uncertainty by estimating haplotypes from population data. Additionally, as the number of loci expands for a haplotype, ambiguity in interpretation increases for determining the underlying genetic components driving a detected association. Here, we present a genotype-level analysis to jointly model the SNPs via a SNP interaction model with phase information (SIMPle) to capture the underlying haplotype structure. This analysis estimates both the risk associated with each variant and the importance of phase between pairwise combinations of SNPs. Thus, rather than selecting between genotype- or haplotype-level approaches, the SIMPle method frames the analysis of multilocus data in a model selection paradigm, the aim to determine which SNPs, phase terms, and linear combinations best describe the relation between genetic variation and a trait of interest. To avoid unstable estimation due to sparse data and to incorporate both the dependencies among terms and the uncertainty in model selection, we propose a Bayes model averaging procedure. This highlights key SNPs and phase terms and yields a set of best representative models. Using simulations, we demonstrate the utility of the SIMPle model to identify crucial SNPs and underlying haplotype structures across a variety of causal models and genetic architectures. Genet. Epidemiol. (C) 2004 Wiley-Liss, Inc.
引用
收藏
页码:429 / 441
页数:13
相关论文
共 33 条
[1]  
[Anonymous], ARTITICIAL INTELLIGE
[2]  
Ayres KL, 2001, GENETICS, V157, P413
[3]   The relative power of SNPs and haplotype as genetic markers for association tests [J].
Bader, JS .
PHARMACOGENOMICS, 2001, 2 (01) :11-24
[4]   Detecting disease associations due to linkage disequilibrium using haplotype tags: A class of tests and the determinants of statistical power [J].
Chapman, JM ;
Cooper, JD ;
Todd, JA ;
Clayton, DG .
HUMAN HEREDITY, 2003, 56 (1-3) :18-31
[5]   Bayesian variable selection with related predictors [J].
Chipman, H .
CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 1996, 24 (01) :17-36
[6]   Bayesian modeling of complex metabolic pathways [J].
Conti, DV ;
Cortessis, V ;
Molitor, J ;
Thomas, DC .
HUMAN HEREDITY, 2003, 56 (1-3) :83-93
[7]   Hierarchical modeling of linkage disequilibrum: Genetic structure and spatial relations [J].
Conti, DV ;
Witte, JS .
AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 72 (02) :351-363
[8]   A unified stepwise regression procedure for evaluating the relative effects of polymorphisms within a gene using case/control or family data:: Application to HLA in type 1 diabetes [J].
Cordell, HJ ;
Clayton, DG .
AMERICAN JOURNAL OF HUMAN GENETICS, 2002, 70 (01) :124-141
[9]   Analysis of multilocus models of association [J].
Devlin, B ;
Roeder, K ;
Wasserman, L .
GENETIC EPIDEMIOLOGY, 2003, 25 (01) :36-47
[10]  
EXCOFFIER L, 1995, MOL BIOL EVOL, V12, P921