Improving accuracy of genomic predictions within and between dairy cattle breeds with imputed high-density single nucleotide polymorphism panels

被引:470
作者
Erbe, M. [2 ]
Hayes, B. J. [1 ,3 ,4 ]
Matukumalli, L. K. [5 ]
Goswami, S. [6 ]
Bowman, P. J. [1 ,3 ]
Reich, C. M. [1 ,3 ]
Mason, B. A. [1 ,3 ]
Goddard, M. E. [1 ,7 ]
机构
[1] Dept Primary Ind, Biosci Res Div, Bundoora, Vic 3083, Australia
[2] Univ Gottingen, Dept Anim Sci, Anim Breeding & Genet Grp, D-37075 Gottingen, Germany
[3] Dairy Futures Cooperat Res Ctr, Bundoora, Vic 3083, Australia
[4] La Trobe Univ, Bundoora, Vic 3086, Australia
[5] USDA, Bovine Funct Genom Lab, Beltsville, MD 20705 USA
[6] George Mason Univ, Manassas, VA 20110 USA
[7] Univ Melbourne, Fac Land & Food Resources, Parkville, Vic 3010, Australia
关键词
genomic selection; multiple breeds; GENETIC-RELATIONSHIP INFORMATION; LINKAGE DISEQUILIBRIUM; GENOTYPE IMPUTATION; RELATIONSHIP MATRIX; WIDE ASSOCIATION; MULTI-BREED; SELECTION; VALUES; POPULATIONS; INFERENCE;
D O I
10.3168/jds.2011-5019
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
Achieving accurate genomic estimated breeding values for dairy cattle requires a very large reference population of genotyped and phenotyped individuals. Assembling such reference populations has been achieved for breeds such as Holstein, but is challenging for breeds with fewer individuals. An alternative is to use a multibreed reference population, such that smaller breeds gain some advantage in accuracy of genomic estimated breeding values (GEBV) from information from larger breeds. However, this requires that marker-quantitative trait loci associations persist across breeds. Here, we assessed the gain in accuracy of GEBV in Jersey cattle as a result of using a combined Holstein and Jersey reference population, with either 39,745 or 624,213 single nucleotide polymorphism (SNP) markers. The surrogate used for accuracy was the correlation of GEBV with daughter trait deviations in a validation population. Two methods were used to predict breeding values, either a genomic BLUP (GBLIJP_mod), or a new method, BayesR, which used a mixture of normal distributions as the prior for SNP effects, including one distribution that set SNP effects to zero. The GBLUP_mod method scaled both the genomic relationship matrix and the additive relationship matrix to a base at the time the breeds diverged, and regressed the genomic relationship matrix to account for sampling errors in estimating relationship coefficients due to a finite number of markers, before combining the 2 matrices. Although these modifications did result in less biased breeding values for Jerseys compared with an unmodified genomic relationship matrix, BayesR gave the highest accuracies of GEBV for the 3 traits investigated (milk yield, fat yield, and protein yield), with an average increase in accuracy compared with GBLUP_mod across the 3 traits of 0.05 for both Jerseys and Holsteins. The advantage was limited for either Jerseys or Holsteins in using 624,213 SNP rather than 39,745 SNP (0.01 for Holsteins and 0.03 for Jerseys, averaged across traits). Even this limited and nonsignificant advantage was only observed when BayesR was used. An alternative panel, which extracted the SNP in the transcribed part of the bovine genome from the 624,213 SNP panel (to give 58,532 SNP), performed better, with an increase in accuracy of 0.03 for Jerseys across traits. This panel captures much of the increased genomic content of the 624,213 SNP panel, with the advantage of a greatly reduced number of SNP effects to estimate. Taken together, using this panel, a combined breed reference and using BayesR rather than GBLUP_mod increased the accuracy of GEBV in Jerseys from 0.43 to 0.52, averaged across the 3 traits.
引用
收藏
页码:4114 / 4129
页数:16
相关论文
共 33 条
[1]   Reliabilities of genomic prediction using combined reference data of the Nordic Red dairy cattle populations [J].
Brondum, R. F. ;
Rius-Vilarrasa, E. ;
Stranden, I. ;
Su, G. ;
Guldbrandtsen, B. ;
Fikse, W. F. ;
Lund, M. S. .
JOURNAL OF DAIRY SCIENCE, 2011, 94 (09) :4700-4707
[2]   A Unified Approach to Genotype Imputation and Haplotype-Phase Inference for Large Data Sets of Trios and Unrelated Individuals [J].
Browning, Brian L. ;
Browning, Sharon R. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2009, 84 (02) :210-223
[3]   Linkage disequilibrium and persistence of phase in Holstein-Friesian, Jersey and Angus cattle [J].
de Roos, A. P. W. ;
Hayes, B. J. ;
Spelman, R. J. ;
Goddard, M. E. .
GENETICS, 2008, 179 (03) :1503-1512
[4]   Reliability of Genomic Predictions Across Multiple Populations [J].
de Roos, A. P. W. ;
Hayes, B. J. ;
Goddard, M. E. .
GENETICS, 2009, 183 (04) :1545-1553
[5]   Genome-Wide Survey of SNP Variation Uncovers the Genetic Structure of Cattle Breeds [J].
Gibbs, Richard A. ;
Taylor, Jeremy F. ;
Van Tassell, Curtis P. ;
Barendse, William ;
Eversoie, Kallye A. ;
Gill, Clare A. ;
Green, Ronnie D. ;
Hamernik, Debora L. ;
Kappes, Steven M. ;
Lien, Sigbjorn ;
Matukumalli, Lakshmi K. ;
McEwan, John C. ;
Nazareth, Lynne V. ;
Schnabel, Robert D. ;
Taylor, Jeremy F. ;
Weinstock, George M. ;
Wheeler, David A. ;
Ajmone-Marsan, Paolo ;
Barendse, William ;
Boettcher, Paul J. ;
Caetano, Alexandre R. ;
Garcia, Jose Fernando ;
Hanotte, Olivier ;
Mariani, Paola ;
Skow, Loren C. ;
Williams, John L. ;
Caetano, Alexandre R. ;
Diallo, Boubacar ;
Green, Ronnie D. ;
Hailemariam, Lemecha ;
Hanotte, Olivier ;
Martinez, Mario L. ;
Morris, Chris A. ;
Silva, Luiz O. C. ;
Spelman, Richard J. ;
Taylor, Jeremy F. ;
Mulatu, Woudyalew ;
Zhao, Keyan ;
Abbey, Colette A. ;
Agaba, Morris ;
Araujo, Flabio R. ;
Bunch, Rowan J. ;
Burton, James ;
Gill, Clare A. ;
Gorni, Chiara ;
Olivier, Hanotte ;
Harrison, Blair E. ;
Luff, Bill ;
Machado, Marco A. ;
Mariani, Paola .
SCIENCE, 2009, 324 (5926) :528-532
[6]  
Gilmour A.R., 2002, ASREML USER GUIDE RE
[7]   Using the genomic relationship matrix to predict the accuracy of genomic selection [J].
Goddard, M. E. ;
Hayes, B. J. ;
Meuwissen, T. H. E. .
JOURNAL OF ANIMAL BREEDING AND GENETICS, 2011, 128 (06) :409-421
[8]   Genomic selection: prediction of accuracy and maximisation of long term response [J].
Goddard, Mike .
GENETICA, 2009, 136 (02) :245-257
[9]   The impact of genetic relationship information on genome-assisted breeding values [J].
Habier, D. ;
Fernando, R. L. ;
Dekkers, J. C. M. .
GENETICS, 2007, 177 (04) :2389-2397
[10]   Extension of the bayesian alphabet for genomic selection [J].
Habier, David ;
Fernando, Rohan L. ;
Kizilkaya, Kadir ;
Garrick, Dorian J. .
BMC BIOINFORMATICS, 2011, 12