Imputation of genotypes with low-density chips and its effect on reliability of direct genomic values in Dutch Holstein cattle

被引:84
作者
Mulder, H. A. [1 ]
Calus, M. P. L. [1 ]
Druet, T. [2 ,3 ]
Schrooten, C. [4 ]
机构
[1] Wageningen UR Livestock Res, Anim Breeding & Genom Ctr, NL-8200 AB Lelystad, Netherlands
[2] Univ Liege, Unit Anim Genom, Fac Vet Med, B-4000 Liege, Belgium
[3] Univ Liege, Ctr Biomed Integrat Genoprot, B-4000 Liege, Belgium
[4] CRV, NL-6800 AL Arnhem, Netherlands
关键词
imputation; single nucleotide polymorphism; low-density chip; genomic selection; NUCLEOTIDE POLYMORPHISM GENOTYPES; HAPLOTYPE-PHASE INFERENCE; DAIRY-CATTLE; UNRELATED INDIVIDUALS; WIDE ASSOCIATION; BREEDING VALUES; JERSEY CATTLE; MARKER PANELS; GENE CONTENT; ACCURACY;
D O I
10.3168/jds.2011-4490
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
Genomic selection using 50,000 single nucleotide polymorphism (50k SNP) chips has been implemented in many dairy cattle breeding programs. Cheap, low-density chips make genotyping of a larger number of animals cost effective. A commonly proposed strategy is to impute low-density genotypes up to 50,000 genotypes before predicting direct genomic values (DGV). The objectives of this study were to investigate the accuracy of imputation for animals genotyped with a low-density chip and to investigate the effect of imputation on reliability of DGV. Low-density chips contained 384, 3,000, or 6,000 SNP. The SNP were selected based either on the highest minor allele frequency in a bin or the middle SNP in a bin, and DAGPHASE, CHROMIBD, and multivariate BLUP were used for imputation. Genotypes of 9,378 animals were used, from which approximately 2,350 animals had deregressed proofs. Bayesian stochastic search variable selection was used for estimating SNP effects of the 50k chip. Imputation accuracies and imputation error rates were poor for low-density chips with 384 SNP. Imputation accuracies were higher with 3,000 and 6,000 SNP. Performance of DAGPHASE and CHROMIBD was very similar and much better than that of multivariate BLUP for both imputation accuracy and reliability of DGV. With 3,000 SNP and using CHROMIBD or DAGPHASE for imputation, 84 to 90% of the increase in DGV reliability using the 50k chip, compared with a pedigree index, was obtained. With multivariate BLUP, the increase in reliability was only 40%. With 384 SNP, the reliability of DGV was lower than for a pedigree index, whereas with 6,000 SNP, about 93% of the increase in reliability of DGV based on the 50k chip was obtained when using DAGPHASE for imputation. Using genotype probabilities to predict gene content increased imputation accuracy and the reliability of DGV and is therefore recommended for applications of imputation for genomic prediction. A deterministic equation was derived to predict accuracy of DGV based on imputation accuracy, which fitted closely with the observed relationship. The deterministic equation can be used to evaluate the effect of differences in imputation accuracy on accuracy and reliability of DGV.
引用
收藏
页码:876 / 889
页数:14
相关论文
共 38 条
[1]   Imputation of genotypes from low- to high-density genotyping platforms and implications for genomic selection [J].
Berry, D. P. ;
Kearney, J. F. .
ANIMAL, 2011, 5 (08) :1162-1169
[2]   A Unified Approach to Genotype Imputation and Haplotype-Phase Inference for Large Data Sets of Trios and Unrelated Individuals [J].
Browning, Brian L. ;
Browning, Sharon R. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2009, 84 (02) :210-223
[3]   Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering [J].
Browning, Sharon R. ;
Browning, Brian L. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 81 (05) :1084-1097
[4]   Missing data imputation and haplotype phase inference for genome-wide association studies [J].
Browning, Sharon R. .
HUMAN GENETICS, 2008, 124 (05) :439-450
[5]   Accuracy of genomic selection using different methods to define haplotypes [J].
Calus, M. P. L. ;
Meuwissen, T. H. E. ;
de Roos, A. P. W. ;
Veerkamp, R. F. .
GENETICS, 2008, 178 (01) :553-561
[6]   Imputation of missing single nucleotide polymorphism genotypes using a multivariate mixed model framework [J].
Calus, M. P. L. ;
Veerkamp, R. F. ;
Mulder, H. A. .
JOURNAL OF ANIMAL SCIENCE, 2011, 89 (07) :2042-2049
[7]   Genomic breeding value prediction: methods and procedures [J].
Calus, M. P. L. .
ANIMAL, 2010, 4 (02) :157-164
[8]   Highly effective SNP-based association mapping and management of recessive defects in livestock [J].
Charlier, Carole ;
Coppieters, Wouter ;
Rollin, Frederic ;
Desmecht, Daniel ;
Agerholm, Jorgen S. ;
Cambisano, Nadine ;
Carta, Eloisa ;
Dardano, Sabrina ;
Dive, Marc ;
Fasquelle, Corinne ;
Frennet, Jean-Claude ;
Hanset, Roger ;
Hubin, Xavier ;
Jorgensen, Claus ;
Karim, Latifa ;
Kent, Matthew ;
Harvey, Kirsten ;
Pearce, Brian R. ;
Simon, Patricia ;
Tama, Nico ;
Nie, Haisheng ;
Vandeputte, Sebastien ;
Lien, Sigbjorn ;
Longeri, Maria ;
Fredholm, Merete ;
Harvey, Robert J. ;
Georges, Michel .
NATURE GENETICS, 2008, 40 (04) :449-454
[9]   Imputation of genotypes from different single nucleotide polymorphism panels in dairy cattle [J].
Druet, T. ;
Schrooten, C. ;
de Roos, A. P. W. .
JOURNAL OF DAIRY SCIENCE, 2010, 93 (11) :5443-5454
[10]   Modeling of Identity-by-Descent Processes Along a Chromosome Between Haplotypes and Their Genotyped Ancestors [J].
Druet, Tom ;
Farnir, Frederic Paul .
GENETICS, 2011, 188 (02) :409-U279