Comparison of marker-based pairwise relatedness estimators on a pedigreed plant population

被引:31
作者
Bink, Marco C. A. M. [1 ]
Anderson, Amy D. [2 ]
van de Weg, W. Eric [3 ]
Thompson, Elizabeth A. [4 ]
机构
[1] Univ Wageningen & Res Ctr, NL-6700 Wageningen, Netherlands
[2] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
[3] Univ Wageningen & Res Ctr, Dept Plant Breeding, Wageningen, Netherlands
[4] Univ Washington, Dept Stat, Seattle, WA 98195 USA
关键词
D O I
10.1007/s00122-008-0824-1
中图分类号
S3 [农学(农艺学)];
学科分类号
0901 ;
摘要
Several estimators have been proposed that use molecular marker data to infer the degree of relatedness for pairs of individuals. The objective of this study was to evaluate the performance of seven estimators when applied to marker data of a set of 33 key individuals from a large complex apple pedigree. The evaluation considered different scenarios of allele frequencies and different numbers of marker loci. The method of moments estimators were Similarity, Queller-Goodknight, Lynch-Ritland and Wang. The maximum likelihood estimators were Thompson, Anderson-Weir and Jacquard. The pedigree-based coancestry coefficients were taken as the point of reference in calculating correlations and root mean square error (RMSE). The marker data comprised 86 multi-allelic SSR markers on 17 linkage groups, covering 11 Morgans. Additionally, we simulated 10 datasets conditional on the real pedigree to support the results on the real dataset. None of the estimators outperformed the others. Knowledge of allele frequencies appeared to be the most influential, i.e., the highest correlations and lowest RMSE were found when frequencies from the founder population were available. When equal allele frequencies were used, all estimators resulted in very similar, but on average lower, correlations. The use of allele frequencies estimated from the set of 33 individuals gave, on average, the poorest results. The maximum likelihood estimators and the Lynch-Ritland estimator were the most sensitive to allele frequencies. The results from the simulation study fully supported the trends in results of the real dataset. This study indicated that high correlations (up to 0.90) and small RMSE (below 0.03), may be obtained when population allelic frequencies are available. In this scenario, the performances of the various estimators were similar, but seemed to favor the maximum likelihood estimators. In the absence of reliable allele frequencies the method of moments estimators were shown to be more robust. The number of marker loci influenced the average performance of the estimators; however, the ranking was not affected. Correlations up to 0.80 were obtained when two markers per chromosome and appropriate allele frequencies were available. Adding more markers to the current dataset may lead to marginal improvements.
引用
收藏
页码:843 / 855
页数:13
相关论文
共 42 条
[1]   A maximum-likelihood method for the estimation of pairwise relatedness in structured populations [J].
Anderson, Amy D. ;
Weir, Bruce S. .
GENETICS, 2007, 176 (01) :421-440
[2]  
[Anonymous], 1998, Genetics and Analysis of Quantitative Traits (Sinauer)
[3]   A MAXIMIZATION TECHNIQUE OCCURRING IN STATISTICAL ANALYSIS OF PROBABILISTIC FUNCTIONS OF MARKOV CHAINS [J].
BAUM, LE ;
PETRIE, T ;
SOULES, G ;
WEISS, N .
ANNALS OF MATHEMATICAL STATISTICS, 1970, 41 (01) :164-&
[4]  
BINK MCA, 2007, EUPHYTICA, DOI DOI 10.1007/S10681-10007-19516-10681
[5]   Multiple QTL mapping in related plant populations via a pedigree-analysis approach [J].
Bink, MCAM ;
Uimari, P ;
Sillanpää, MJ ;
Janss, LLG ;
Jansen, RC .
THEORETICAL AND APPLIED GENETICS, 2002, 104 (05) :751-762
[6]  
COCKERHAM CC, 1954, GENETICS, V39, P859
[7]   The variant call format and VCFtools [J].
Danecek, Petr ;
Auton, Adam ;
Abecasis, Goncalo ;
Albers, Cornelis A. ;
Banks, Eric ;
DePristo, Mark A. ;
Handsaker, Robert E. ;
Lunter, Gerton ;
Marth, Gabor T. ;
Sherry, Stephen T. ;
McVean, Gilean ;
Durbin, Richard .
BIOINFORMATICS, 2011, 27 (15) :2156-2158
[8]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[9]   Estimating genealogies from unlinked marker data:: A Bayesian approach [J].
Gasbarra, Dario ;
Pirinen, Matti ;
Sillanpaa, Mikko J. ;
Salmela, Elina ;
Arjas, Elja .
THEORETICAL POPULATION BIOLOGY, 2007, 72 (03) :305-322
[10]   The European project HiDRAS: Innovative multidisciplinary approaches to breeding high quality disease resistant apples [J].
Gianfranceschi, L ;
Soglio, V .
Proceedings of the XIth Eucarpia Symposium on Fruit Breeding and Genetics, Vols 1 and 2, 2004, (663) :327-330