A Common Dataset for Genomic Analysis of Livestock Populations

被引:86
作者
Cleveland, Matthew A. [1 ]
Hickey, John M. [2 ]
Forni, Selma [1 ]
机构
[1] Genus Plc, Hendersonville, TN 37075 USA
[2] Univ New England, Sch Environm & Rural Sci, Armidale, NSW 2351, Australia
来源
G3-GENES GENOMES GENETICS | 2012年 / 2卷 / 04期
基金
澳大利亚研究理事会;
关键词
pig; genomic relationships; GenPred; cross-validation; shared data resources; BREEDING VALUES; FULL PEDIGREE; INFORMATION; PREDICTION; TRAITS;
D O I
10.1534/g3.111.001453
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Although common datasets are an important resource for the scientific community and can be used to address important questions, genomic datasets of a meaningful size have not generally been available in livestock species. We describe a pig dataset that PIC (a Genus company) has made available for comparing genomic prediction methods. We also describe genomic evaluation of the data using methods that PIC considers best practice for predicting and validating genomic breeding values, and we discuss the impact of data structure on accuracy. The dataset contains 3534 individuals with high-density genotypes, phenotypes, and estimated breeding values for five traits. Genomic breeding values were calculated using BayesB, with phenotypes and de-regressed breeding values, and using a single-step genomic BLUP approach that combines information from genotyped and un-genotyped animals. The genomic breeding value accuracy increased with increased trait heritability and with increased relationship between training and validation. In nearly all cases, BayesB using de-regressed breeding values outperformed the other approaches, but the single-step evaluation performed only slightly worse. This dataset was useful for comparing methods for genomic prediction using real data. Our results indicate that validation approaches accounting for relatedness between populations can correct for potential overestimation of genomic breeding value accuracies, with implications for genotyping strategies to carry out genomic selection programs.
引用
收藏
页码:429 / 435
页数:7
相关论文
共 27 条
[1]   A map of human genome variation from population-scale sequencing [J].
Altshuler, David ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Collins, Francis S. ;
De la Vega, Francisco M. ;
Donnelly, Peter ;
Egholm, Michael ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Knoppers, Bartha M. ;
Lander, Eric S. ;
Lehrach, Hans ;
Mardis, Elaine R. ;
McVean, Gil A. ;
Nickerson, DebbieA. ;
Peltonen, Leena ;
Schafer, Alan J. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Deiros, David ;
Metzker, Mike ;
Muzny, Donna ;
Reid, Jeff ;
Wheeler, David ;
Wang, Jun ;
Li, Jingxiang ;
Jian, Min ;
Li, Guoqing ;
Li, Ruiqiang ;
Liang, Huiqing ;
Tian, Geng ;
Wang, Bo ;
Wang, Jian ;
Wang, Wei ;
Yang, Huanming ;
Zhang, Xiuqing ;
Zheng, Huisong ;
Lander, Eric S. ;
Altshuler, David L. ;
Ambrogio, Lauren ;
Bloom, Toby ;
Cibulskis, Kristian ;
Fennell, Tim J. ;
Gabriel, Stacey B. .
NATURE, 2010, 467 (7319) :1061-1073
[2]   Different models of genetic variation and their effect on genomic evaluation [J].
Clark, Samuel A. ;
Hickey, John M. ;
van der Werf, Julius H. J. .
GENETICS SELECTION EVOLUTION, 2011, 43
[3]  
Cleveland M.A., 2010, 9 WORLD C GEN APPL L
[4]   Inbreeding in genome-wide selection [J].
Daetwyler, H. D. ;
Villanueva, B. ;
Bijma, P. ;
Woolliams, J. A. .
JOURNAL OF ANIMAL BREEDING AND GENETICS, 2007, 124 (06) :369-376
[5]  
DAWBER TR, 1951, AM J PUBLIC HEALTH, V41, P279
[6]  
Deeb N., 2010, P INT PLANT AN GEN 1, P602
[7]  
Deeb N., 2011, P INT PLANT AN GEN 1, P606
[8]  
Fernando R.L., 2009, GENSEL USER MANUAL P, VSecond
[9]  
Forni S., 2010, P 9 WORLD C GEN APPL
[10]   Different genomic relationship matrices for single-step analysis using phenotypic, pedigree and genomic information [J].
Forni, Selma ;
Aguilar, Ignacio ;
Misztal, Ignacy .
GENETICS SELECTION EVOLUTION, 2011, 43