A Hidden Markov Model Combining Linkage and Linkage Disequilibrium Information for Haplotype Reconstruction and Quantitative Trait Locus Fine Mapping

被引:170
作者
Druet, Tom [1 ]
Georges, Michel
机构
[1] Univ Liege, Fac Vet Med, GIGA Res, Unit Anim Genom, B-4000 Liege, Belgium
关键词
WHOLE-GENOME ASSOCIATION; MILK-YIELD; DAIRY-CATTLE; DESCENT; PREDICTION; IDENTITY; NUCLEOTIDE; IMPUTATION; PEDIGREES; INFERENCE;
D O I
10.1534/genetics.109.108431
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Faithful reconstruction of haplotypes from diploid marker data (phasing) is important for many kinds of genetic analyses, including mapping of trait loci, prediction of genomic breeding values, and identification of signatures of selection. In human genetics, phasing most often exploits population information (linkage disequilibrium), while in animal genetics the primary source of information is familial (Mendelian segregation and linkage). We herein develop and evaluate a method that simultaneously exploits both sources of information. It builds on hidden Markov models that were initially developed to exploit population information only. We demonstrate that the approach improves the accuracy of allele phasing as well as imputation of missing genotypes. Reconstructed haplotypes are assigned to hidden states that are shown to correspond to clusters of genealogically related chromosomes. We show that these cluster states can directly be used to fine map QTL. The method is computationally effective at handling large data sets based on high-density SNP panels.
引用
收藏
页码:789 / U237
页数:12
相关论文
共 31 条
[21]  
Meuwissen THE, 2000, GENETICS, V155, P421
[22]  
Meuwissen THE, 2001, GENET SEL EVOL, V33, P605, DOI 10.1051/gse:2001134
[23]  
Meuwissen THE, 2001, GENETICS, V157, P1819
[24]   Multipoint identity-by-descent prediction using dense markers to map quantitative trait loci and estimate effective population size [J].
Meuwissen, Theo H. E. ;
Goddard, Mike E. .
GENETICS, 2007, 176 (04) :2551-2560
[25]   A TUTORIAL ON HIDDEN MARKOV-MODELS AND SELECTED APPLICATIONS IN SPEECH RECOGNITION [J].
RABINER, LR .
PROCEEDINGS OF THE IEEE, 1989, 77 (02) :257-286
[26]   A fast and flexible statistical model for large-scale population genotype data: Applications to inferring missing genotypes and haplotypic phase [J].
Scheet, P ;
Stephens, M .
AMERICAN JOURNAL OF HUMAN GENETICS, 2006, 78 (04) :629-644
[27]   SNP discovery and allele frequency estimation by deep sequencing of reduced representation libraries [J].
Van Tassell, Curtis P. ;
Smith, Timothy P. L. ;
Matukumalli, Lakshmi K. ;
Taylor, Jeremy F. ;
Schnabel, Robert D. ;
Lawley, Cynthia Taylor ;
Haudenschild, Christian D. ;
Moore, Stephen S. ;
Warren, Wesley C. ;
Sonstegard, Tad S. .
NATURE METHODS, 2008, 5 (03) :247-252
[28]   DERIVATION, CALCULATION, AND USE OF NATIONAL ANIMAL-MODEL INFORMATION [J].
VANRADEN, PM ;
WIGGANS, GR .
JOURNAL OF DAIRY SCIENCE, 1991, 74 (08) :2737-2746
[29]  
Voight BF, 2006, PLOS BIOL, V4, P446, DOI 10.1371/journal.pbio.0040072
[30]   Rapid haplotype reconstruction in pedigrees with dense marker maps [J].
Windig, JJ ;
Meuwissen, THE .
JOURNAL OF ANIMAL BREEDING AND GENETICS, 2004, 121 (01) :26-39