Estimating haplotype-disease associations with pooled genotype data

被引:25
作者
Zeng, D [1 ]
Lin, DY [1 ]
机构
[1] Univ N Carolina, Dept Biostat, Chapel Hill, NC 27599 USA
关键词
association tests; case-control studies; cohort studies; DNA pooling; EM algorithm; gene-environment interactions; haplotype analysis; Hardy-Weinberg equilibrium; linkage disequilibrium; maximum likelihood; pooled DNA samples; SNPs;
D O I
10.1002/gepi.20040
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The genetic dissection of complex human diseases requires large-scale association studies which explore the population associations between genetic variants and disease phenotypes. DNA pooling can substantially reduce the cost of genotyping assays in these studies, and thus enables one to examine a large number of genetic variants on a large number of subjects. The availability of pooled genotype data instead of individual data poses considerable challenges in the statistical inference, especially in the haplotype-based analysis because of increased phase uncertainty. Here we present a general likelihood-based approach to making inferences about haplotype-disease associations based on possibly pooled DNA data. We consider cohort and case-control studies of unrelated subjects, and allow arbitrary and unequal pool sizes. The phenotype can be discrete or continuous, univariate or multivariate. The effects of haplotypes on disease phenotypes are formulated through flexible regression models, which allow a variety of genetic hypotheses and gene-environment interactions. We construct appropriate likelihood functions for various designs and phenotypes, accommodating Hardy-Weinberg disequilibrium. The corresponding maximum likelihood estimators are approximately unbiased, normally distributed, and statistically efficient. We develop simple and efficient numerical algorithms for calculating the maximum likelihood estimators and their variances, and implement these algorithms in a freely available computer program. We assess the performance of the proposed methods through simulation studies, and provide an application to the Finland-United States Investigation of NIDDM Genetics Study. The results show that DNA pooling is highly efficient in studying haplotype-disease associations. As a by-product, this work provides valid and efficient methods for estimating haplotype-disease associations with unpooled DNA samples. (C) 2004 Wiley-Liss, Inc.
引用
收藏
页码:70 / 82
页数:13
相关论文
共 31 条
[1]  
Akaike H., 1998, A Celebration ofStatistics, P387, DOI DOI 10.1007/978-1-4613-8560-8_1
[2]   Haplotypes vs single marker linkage disequilibrium tests:: what do we gain? (Reprinted European Journal of Human Genetics, Vol 4, pg 291-300, 2001) [J].
Akey, Joshua ;
Jin, Li ;
Xiong, Momiao .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2017, 25 :S51-S58
[3]   DNA pooling in mutation detection with reference to sequence analysis [J].
Amos, CI ;
Frazier, ML ;
Wang, WF .
AMERICAN JOURNAL OF HUMAN GENETICS, 2000, 66 (05) :1689-1692
[4]   USE OF POOLED DNA SAMPLES TO DETECT LINKAGE DISEQUILIBRIUM OF POLYMORPHIC RESTRICTION FRAGMENTS AND HUMAN-DISEASE - STUDIES OF THE HLA CLASS-II LOCI [J].
ARNHEIM, N ;
STRANGE, C ;
ERLICH, H .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1985, 82 (20) :6970-6974
[5]   Association testing by DNA pooling: An effective initial screen [J].
Bansal, A ;
van den Boom, D ;
Kammerer, S ;
Honisch, C ;
Adam, G ;
Cantor, CR ;
Kleyn, P ;
Braun, A .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (26) :16871-16874
[6]   Association mapping of disease loci, by use of a pooled DNA genomic screen [J].
Barcellos, LF ;
Klitz, W ;
Field, LL ;
Tobias, R ;
Bowcock, AM ;
Wilson, R ;
Nelson, MP ;
Nagatomi, J ;
Thomson, G .
AMERICAN JOURNAL OF HUMAN GENETICS, 1997, 61 (03) :734-747
[7]  
Barratt BJ, 2002, ANN HUM GENET, V66, P393, DOI [10.1046/j.1469-1809.2002.00125.x, 10.1017/S0003480002001252]
[8]  
Bickel Peter J, 1993, Efficient and adaptive estimation for semiparametric models, V4
[9]   Discovering genotypes underlying human phenotypes: past successes for mendelian disease, future approaches for complex disease [J].
Botstein, D ;
Risch, N .
NATURE GENETICS, 2003, 33 (Suppl 3) :228-237
[10]   APPROXIMATE INFERENCE IN GENERALIZED LINEAR MIXED MODELS [J].
BRESLOW, NE ;
CLAYTON, DG .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (421) :9-25