Fast "coalescent" simulation

被引:143
作者
Marjoram, P [1 ]
Wall, JD [1 ]
机构
[1] Univ So Calif, Dept Prevent Med, Los Angeles, CA 90089 USA
关键词
D O I
10.1186/1471-2156-7-16
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background: The amount of genome-wide molecular data is increasing rapidly, as is interest in developing methods appropriate for such data. There is a consequent increasing need for methods that are able to efficiently simulate such data. In this paper we implement the sequentially Markovian coalescent algorithm described by McVean and Cardin and present a further modification to that algorithm which slightly improves the closeness of the approximation to the full coalescent model. The algorithm ignores a class of recombination events known to affect the behavior of the genealogy of the sample, but which do not appear to affect the behavior of generated samples to any substantial degree. Results: We show that our software is able to simulate large chromosomal regions, such as those appropriate in a consideration of genome-wide data, in a way that is several orders of magnitude faster than existing coalescent algorithms. Conclusion: This algorithm provides a useful resource for those needing to simulate large quantities of data for chromosomal-length regions using an approach that is much more efficient than traditional coalescent models.
引用
收藏
页数:9
相关论文
共 21 条
[1]  
[Anonymous], 2005, Gene Genealogies, Variation and Evolution: A Primer in Coalescent Theory
[2]  
Balding D., 2001, HDB STAT GENETICS, P179, DOI DOI 10.2307/2419615
[3]   Linkage disequilibrium mapping via cladistic analysis of single-nucleotide polymorphism haplotypes [J].
Durrant, C ;
Zondervan, KT ;
Cardon, LR ;
Hunt, S ;
Deloukas, P ;
Morris, AP .
AMERICAN JOURNAL OF HUMAN GENETICS, 2004, 75 (01) :35-43
[4]   The structure of haplotype blocks in the human genome [J].
Gabriel, SB ;
Schaffner, SF ;
Nguyen, H ;
Moore, JM ;
Roy, J ;
Blumenstiel, B ;
Higgins, J ;
DeFelice, M ;
Lochner, A ;
Faggart, M ;
Liu-Cordero, SN ;
Rotimi, C ;
Adeyemo, A ;
Cooper, R ;
Ward, R ;
Lander, ES ;
Daly, MJ ;
Altshuler, D .
SCIENCE, 2002, 296 (5576) :2225-2229
[5]  
GRIFFITHS RC, 1997, PROGR POPULATION GEN, V87, P100
[6]  
Hudson RR, 2001, GENETICS, V159, P1805
[7]   Generating samples under a Wright-Fisher neutral model of genetic variation [J].
Hudson, RR .
BIOINFORMATICS, 2002, 18 (02) :337-338
[8]   PROPERTIES OF A NEUTRAL ALLELE MODEL WITH INTRAGENIC RECOMBINATION [J].
HUDSON, RR .
THEORETICAL POPULATION BIOLOGY, 1983, 23 (02) :183-201
[9]  
HUDSON RR, 1985, GENETICS, V111, P147
[10]  
HUDSON RR, 1991, OXF SURV EVOL BIOL, V7, P1