Beyond Missing Heritability: Prediction of Complex Traits

被引:197
作者
Makowsky, Robert [1 ]
Pajewski, Nicholas M. [1 ]
Klimentidis, Yann C. [1 ]
Vazquez, Ana I. [1 ]
Duarte, Christine W. [1 ]
Allison, David B. [1 ]
de los Campos, Gustavo [1 ]
机构
[1] Univ Alabama Birmingham, Dept Biostat, Birmingham, AL 35294 USA
关键词
COMMON SNPS EXPLAIN; QUANTITATIVE TRAITS; LARGE PROPORTION; GENETIC VALUES; LOCI; ASSOCIATION; HEIGHT; STRATIFICATION; SELECTION; VARIANTS;
D O I
10.1371/journal.pgen.1002051
中图分类号
Q3 [遗传学];
学科分类号
071007 [遗传学];
摘要
Despite rapid advances in genomic technology, our ability to account for phenotypic variation using genetic information remains limited for many traits. This has unfortunately resulted in limited application of genetic data towards preventive and personalized medicine, one of the primary impetuses of genome-wide association studies. Recently, a large proportion of the "missing heritability" for human height was statistically explained by modeling thousands of single nucleotide polymorphisms concurrently. However, it is currently unclear how gains in explained genetic variance will translate to the prediction of yet-to-be observed phenotypes. Using data from the Framingham Heart Study, we explore the genomic prediction of human height in training and validation samples while varying the statistical approach used, the number of SNPs included in the model, the validation scheme, and the number of subjects used to train the model. In our training datasets, we are able to explain a large proportion of the variation in height (h(2) up to 0.83, R-2 up to 0.96). However, the proportion of variance accounted for in validation samples is much smaller (ranging from 0.15 to 0.36 depending on the degree of familial information used in the training dataset). While such R-2 values vastly exceed what has been previously reported using a reduced number of pre-selected markers (< 0.10), given the heritability of the trait (similar to 0.80), substantial room for improvement remains.
引用
收藏
页数:9
相关论文
共 55 条
[1]
Hundreds of variants clustered in genomic loci and biological pathways affect human height [J].
Allen, Hana Lango ;
Estrada, Karol ;
Lettre, Guillaume ;
Berndt, Sonja I. ;
Weedon, Michael N. ;
Rivadeneira, Fernando ;
Willer, Cristen J. ;
Jackson, Anne U. ;
Vedantam, Sailaja ;
Raychaudhuri, Soumya ;
Ferreira, Teresa ;
Wood, Andrew R. ;
Weyant, Robert J. ;
Segre, Ayellet V. ;
Speliotes, Elizabeth K. ;
Wheeler, Eleanor ;
Soranzo, Nicole ;
Park, Ju-Hyun ;
Yang, Jian ;
Gudbjartsson, Daniel ;
Heard-Costa, Nancy L. ;
Randall, Joshua C. ;
Qi, Lu ;
Smith, Albert Vernon ;
Maegi, Reedik ;
Pastinen, Tomi ;
Liang, Liming ;
Heid, Iris M. ;
Luan, Jian'an ;
Thorleifsson, Gudmar ;
Winkler, Thomas W. ;
Goddard, Michael E. ;
Lo, Ken Sin ;
Palmer, Cameron ;
Workalemahu, Tsegaselassie ;
Aulchenko, Yurii S. ;
Johansson, Asa ;
Zillikens, M. Carola ;
Feitosa, Mary F. ;
Esko, Tonu ;
Johnson, Toby ;
Ketkar, Shamika ;
Kraft, Peter ;
Mangino, Massimo ;
Prokopenko, Inga ;
Absher, Devin ;
Albrecht, Eva ;
Ernst, Florian ;
Glazer, Nicole L. ;
Hayward, Caroline .
NATURE, 2010, 467 (7317) :832-838
[2]
Rare genetic variants and the risk of cancer [J].
Bodmer, Walter ;
Tomlinson, Ian .
CURRENT OPINION IN GENETICS & DEVELOPMENT, 2010, 20 (03) :262-267
[3]
Demonstrating stratification in a European American population [J].
Campbell, CD ;
Ogburn, EL ;
Lunetta, KL ;
Lyon, HN ;
Freedman, ML ;
Groop, LC ;
Altshuler, D ;
Ardlie, KG ;
Hirschhorn, JN .
NATURE GENETICS, 2005, 37 (08) :868-872
[4]
GWAS: heritability missing in action? [J].
Clarke, Angus J. ;
Cooper, David N. .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2010, 18 (08) :859-861
[5]
Prediction of Genetic Values of Quantitative Traits in Plant Breeding Using Pedigree and Molecular Markers [J].
Crossa, Jose ;
de los Campos, Gustavo ;
Perez, Paulino ;
Gianola, Daniel ;
Burgueno, Juan ;
Luis Araus, Jose ;
Makumbi, Dan ;
Singh, Ravi P. ;
Dreisigacker, Susanne ;
Yan, Jianbing ;
Arief, Vivi ;
Banziger, Marianne ;
Braun, Hans-Joachim .
GENETICS, 2010, 186 (02) :713-U406
[6]
DAWBER TR, 1951, AM J PUBLIC HEALTH, V41, P279
[7]
AN APPROACH TO LONGITUDINAL STUDIES IN A COMMUNITY - FRAMINGHAM STUDY [J].
DAWBER, TR ;
KANNEL, WB ;
LYELL, LP .
ANNALS OF THE NEW YORK ACADEMY OF SCIENCES, 1963, 107 (02) :539-&
[8]
de los Campos G., 2010, BLR: Bayesian linear regression
[9]
Predicting genetic predisposition in humans: the promise of whole-genome markers [J].
de los Campos, Gustavo ;
Gianola, Daniel ;
Allison, David B. .
NATURE REVIEWS GENETICS, 2010, 11 (12) :880-886
[10]
Semi-parametric genomic-enabled prediction of genetic values using reproducing kernel Hilbert spaces methods [J].
de los Campos, Gustavo ;
Gianola, Daniel ;
Rosa, Guilherme J. M. ;
Weigel, Kent A. ;
Crossa, Jose .
GENETICS RESEARCH, 2010, 92 (04) :295-308