Accuracy, efficiency and robustness of four algorithms allowing full sibship reconstruction from DNA marker data

被引:75
作者
Butler, K [1 ]
Field, C [1 ]
Herbinger, CM [1 ]
Smith, BR [1 ]
机构
[1] Dalhousie Univ, Dept Math & Stat, Halifax, NS B3H 3J5, Canada
关键词
DNA marker; full sib; likelihood; MCMC; pedigree reconstruction;
D O I
10.1111/j.1365-294X.2004.02152.x
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In the problem of reconstructing full sib pedigrees from DNA marker data, three existing algorithms and one new algorithm are compared in terms of accuracy, efficiency and robustness using real and simulated data sets. An algorithm based on the exclusion principle and another based on a maximization of the Simpson index were very accurate at reconstructing data sets comprising a few large families but had problems with data sets with limited family structure, while a Markov Chain Monte Carlo (MCMC) algorithm based on the maximization of a partition score had the opposite behaviour. An MCMC algorithm based on maximizing the full joint likelihood performed best in small data sets comprising several medium-sized families but did not work well under most other conditions. It appears that the likelihood surface may be rough and presents challenges for the MCMC algorithm to find the global maximum. This likelihood algorithm also exhibited problems in reconstructing large family groups, due possibly to limits in computational precision. The accuracy of each algorithm improved with an increasing amount of information in the data set, and was very high with eight loci with eight alleles each. All four algorithms were quite robust to deviation from an idealized uniform allelic distribution, to departures from idealized Mendelian inheritance in simulated data sets and to the presence of null alleles. In contrast, none of the algorithms were very robust to the probable presence of error/mutation in the data. Depending upon the type of mutation or errors and the algorithm used, between 70 and 98% of the affected individuals were classified improperly on average.
引用
收藏
页码:1589 / 1600
页数:12
相关论文
共 25 条
[1]   Estimation of single-generation sibling relationships based on DNA markers [J].
Almudevar, A ;
Field, C .
JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 1999, 4 (02) :136-165
[2]  
HAMILTON WD, 1964, J THEOR BIOL, V7, P1, DOI [10.1016/0022-5193(64)90038-4, 10.1016/0022-5193(64)90039-6]
[3]   Family relationships and effective population size in a natural cohort of Atlantic cod (Gadus morhua) larvae [J].
Herbinger, C. M. ;
Doyle, R. W. ;
Taggart, C. T. ;
Lochmann, S. E. ;
Brooker, A. L. ;
Wright, J. M. ;
Cook, D. .
CANADIAN JOURNAL OF FISHERIES AND AQUATIC SCIENCES, 1997, 54 :11-18
[4]   Early growth performance of Atlantic salmon full-sib families reared in single family tanks versus in mixed family tanks [J].
Herbinger, CM ;
O'Reilly, PT ;
Doyle, RW ;
Wright, JM ;
O'Flynn, F .
AQUACULTURE, 1999, 173 (1-4) :105-116
[5]  
Lynch M, 1999, GENETICS, V152, P1753
[6]   A novel method for estimating heritability using molecular markers [J].
Timothy A Mousseau ;
Kermit Ritland ;
Daniel D Heath .
Heredity, 1998, 80 (2) :218-224
[7]   Analysis of parentage determination in Atlantic salmon (Salmo salar) using microsatellites [J].
O'Reilly, PT ;
Herbinger, C ;
Wright, JM .
ANIMAL GENETICS, 1998, 29 (05) :363-370
[8]  
OCONNELL P, 1997, REV FISH BIOL FISHER, V7, P1
[9]  
OReilly PT, 1996, CAN J FISH AQUAT SCI, V53, P2292, DOI 10.1139/cjfas-53-10-2292
[10]  
Painter Ian, 1997, Journal of Agricultural Biological and Environmental Statistics, V2, P212, DOI 10.2307/1400404