Semi-parametric genomic-enabled prediction of genetic values using reproducing kernel Hilbert spaces methods

被引:275
作者
de los Campos, Gustavo [1 ,2 ]
Gianola, Daniel [1 ]
Rosa, Guilherme J. M. [1 ]
Weigel, Kent A. [1 ]
Crossa, Jose [2 ]
机构
[1] Univ Wisconsin, Madison, WI 53706 USA
[2] CIMMYT, Int Maize & Wheat Improvement Ctr, Mexico City 06600, DF, Mexico
关键词
QUANTITATIVE TRAITS; PAIRWISE RELATEDNESS; ASSISTED PREDICTION; RIDGE REGRESSION; BREEDING VALUES; SELECTION; INFORMATION;
D O I
10.1017/S0016672310000285
中图分类号
Q3 [遗传学];
学科分类号
071007 [遗传学];
摘要
Prediction of genetic values is a central problem in quantitative genetics. Over many decades, such predictions have been successfully accomplished using information on phenotypic records and family structure usually represented with a pedigree. Dense molecular markers are now available in the genome of humans, plants and animals, and this information can be used to enhance the prediction of genetic values. However, the incorporation of dense molecular marker data into models poses many statistical and computational challenges, such as how models can cope with the genetic complexity of multi-factorial traits and with the curse of dimensionality that arises when the number of markers exceeds the number of data points. Reproducing kernel Hilbert spaces regressions can be used to address some of these challenges. The methodology allows regressions on almost any type of prediction sets (covariates, graphs, strings, images, etc.) and has important computational advantages relative to many parametric approaches. Moreover, some parametric models appear as special cases. This article provides an overview of the methodology, a discussion of the problem of kernel choice with a focus on genetic applications, algorithms for kernel selection and an assessment of the proposed methods using a collection of 599 wheat lines evaluated for grain yield in four mega environments.
引用
收藏
页码:295 / 308
页数:14
相关论文
共 44 条
[1]
[Anonymous], 2002, P 19 INT C MACH LEAR
[2]
THEORY OF REPRODUCING KERNELS [J].
ARONSZAJN, N .
TRANSACTIONS OF THE AMERICAN MATHEMATICAL SOCIETY, 1950, 68 (MAY) :337-404
[3]
Prospects for genomewide selection for quantitative traits in maize [J].
Bernardo, Rex ;
Yu, Jianming .
CROP SCIENCE, 2007, 47 (03) :1082-1090
[4]
Examining the relative influence of familial, genetic, and environmental covariate information in flexible risk models [J].
Bravo, Hector Corrada ;
Lee, Kristine E. ;
Klein, Barbara E. K. ;
Klein, Ronald ;
Iyengar, Sudha K. ;
Wahba, Grace .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (20) :8128-8133
[5]
Accuracy of breeding values when using and ignoring the polygenic effect in genomic breeding value estimation with a marker density of one SNP per cM [J].
Calus, M. P. L. ;
Veerkamp, R. F. .
JOURNAL OF ANIMAL BREEDING AND GENETICS, 2007, 124 (06) :362-368
[6]
COCKERHAM CC, 1954, GENETICS, V39, P859
[7]
Cressie N., 1993, Statistics for Spatial Data, DOI [10.1002/9781119115151, DOI 10.1002/9781119115151]
[8]
Reproducing kernel Hilbert spaces regression: A general framework for genetic evaluation [J].
de los Campos, G. ;
Gianola, D. ;
Rosa, G. J. M. .
JOURNAL OF ANIMAL SCIENCE, 2009, 87 (06) :1883-1887
[9]
Predicting Quantitative Traits With Regression Models for Dense Molecular Markers and Pedigree [J].
de los Campos, Gustavo ;
Naya, Hugo ;
Gianola, Daniel ;
Crossa, Jose ;
Legarra, Andres ;
Manfredi, Eduardo ;
Weigel, Kent ;
Cotes, Jose Miguel .
GENETICS, 2009, 182 (01) :375-385
[10]
DELOSCAMPOS G, 2010, P 9 WORLD C IN PRESS