Fast and flexible simulation of DNA sequence data

被引:273
作者
Chen, Gary K. [1 ,2 ,3 ]
Marjoram, Paul [2 ]
Wall, Jeffrey D. [1 ,3 ]
机构
[1] Univ Calif San Francisco, Inst Human Genet, San Francisco, CA 94143 USA
[2] Univ Calif Los Angeles, Dept Prevent Med, Los Angeles, CA 90033 USA
[3] Univ Calif San Francisco, Dept Epidemiol & Biostat, San Francisco, CA 94143 USA
基金
美国国家卫生研究院;
关键词
LINKAGE DISEQUILIBRIUM; GENE CONVERSION; COALESCENT SIMULATION; HUMAN GENOME; RECOMBINATION; POPULATION; POLYMORPHISM; MODELS; DIVERSITY; SELECTION;
D O I
10.1101/gr.083634.108
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Simulation of genomic sequences under the coalescent with recombination has conventionally been impractical for regions beyond tens of megabases. This work presents an algorithm, implemented as the program MaCS (Markovian Coalescent Simulator), that can efficiently simulate haplotypes under any arbitrary model of population history. We present several metrics comparing the performance of MaCS with other available simulation programs. Practical usage of MaCS is demonstrated through a comparison of measures of linkage disequilibrium between generated program output and real genotype data from populations considered to be structured.
引用
收藏
页码:136 / 142
页数:7
相关论文
共 35 条
[1]  
[Anonymous], 1991, OXFORD SURVEYS EVOL
[2]   Population genomics:: Whole-genome analysis of polymorphism and divergence in Drosophila simulans [J].
Begun, David J. ;
Holloway, Alisha K. ;
Stevens, Kristian ;
Hillier, LaDeana W. ;
Poh, Yu-Ping ;
Hahn, Matthew W. ;
Nista, Phillip M. ;
Jones, Corbin D. ;
Kern, Andrew D. ;
Dewey, Colin N. ;
Pachter, Lior ;
Myers, Eugene ;
Langley, Charles H. .
PLOS BIOLOGY, 2007, 5 (11) :2534-2559
[3]   TCF7L2 Genetic Defect and Type 2 Diabetes [J].
Cauchi, Stephane ;
Froguel, Philippe .
CURRENT DIABETES REPORTS, 2008, 8 (02) :149-155
[4]   Common sequence polymorphisms shaping genetic diversity in Arabidopsis thaliana [J].
Clark, Richard M. ;
Schweikert, Gabriele ;
Toomajian, Christopher ;
Ossowski, Stephan ;
Zeller, Georg ;
Shinn, Paul ;
Warthmann, Norman ;
Hu, Tina T. ;
Fu, Glenn ;
Hinds, David A. ;
Chen, Huaming ;
Frazer, Kelly A. ;
Huson, Daniel H. ;
Schoelkopf, Bernhard ;
Nordborg, Magnus ;
Raetsch, Gunnar ;
Ecker, Joseph R. ;
Weigel, Detlef .
SCIENCE, 2007, 317 (5836) :338-342
[5]   Linkage disequilibrium mapping via cladistic analysis of single-nucleotide polymorphism haplotypes [J].
Durrant, C ;
Zondervan, KT ;
Cardon, LR ;
Hunt, S ;
Deloukas, P ;
Morris, AP .
AMERICAN JOURNAL OF HUMAN GENETICS, 2004, 75 (01) :35-43
[6]   A second generation human haplotype map of over 3.1 million SNPs [J].
Frazer, Kelly A. ;
Ballinger, Dennis G. ;
Cox, David R. ;
Hinds, David A. ;
Stuve, Laura L. ;
Gibbs, Richard A. ;
Belmont, John W. ;
Boudreau, Andrew ;
Hardenbol, Paul ;
Leal, Suzanne M. ;
Pasternak, Shiran ;
Wheeler, David A. ;
Willis, Thomas D. ;
Yu, Fuli ;
Yang, Huanming ;
Zeng, Changqing ;
Gao, Yang ;
Hu, Haoran ;
Hu, Weitao ;
Li, Chaohua ;
Lin, Wei ;
Liu, Siqi ;
Pan, Hao ;
Tang, Xiaoli ;
Wang, Jian ;
Wang, Wei ;
Yu, Jun ;
Zhang, Bo ;
Zhang, Qingrun ;
Zhao, Hongbin ;
Zhao, Hui ;
Zhou, Jun ;
Gabriel, Stacey B. ;
Barry, Rachel ;
Blumenstiel, Brendan ;
Camargo, Amy ;
Defelice, Matthew ;
Faggart, Maura ;
Goyette, Mary ;
Gupta, Supriya ;
Moore, Jamie ;
Nguyen, Huy ;
Onofrio, Robert C. ;
Parkin, Melissa ;
Roy, Jessica ;
Stahl, Erich ;
Winchester, Ellen ;
Ziaugra, Liuda ;
Altshuler, David ;
Shen, Yan .
NATURE, 2007, 449 (7164) :851-U3
[7]   Gene conversion and different population histories may explain the contrast between polymorphism and linkage disequilibrium levels [J].
Frisse, L ;
Hudson, RR ;
Bartoszewicz, A ;
Wall, JD ;
Donfack, J ;
Di Rienzo, A .
AMERICAN JOURNAL OF HUMAN GENETICS, 2001, 69 (04) :831-843
[8]   Variant of transcription factor 7-like 2 (TCF7L2) gene confers risk of type 2 diabetes [J].
Grant, SFA ;
Thorleifsson, G ;
Reynisdottir, I ;
Benediktsson, R ;
Manolescu, A ;
Sainz, J ;
Helgason, A ;
Stefansson, H ;
Emilsson, V ;
Helgadottir, A ;
Styrkarsdottir, U ;
Magnusson, KP ;
Walters, GB ;
Palsdottir, E ;
Jonsdottir, T ;
Gudmundsdottir, T ;
Gylfason, A ;
Saemundsdottir, J ;
Wilensky, RL ;
Reilly, MP ;
Rader, DJ ;
Bagger, Y ;
Christiansen, C ;
Gudnason, V ;
Sigurdsson, G ;
Thorsteinsdottir, U ;
Gulcher, JR ;
Kong, A ;
Stefansson, K .
NATURE GENETICS, 2006, 38 (03) :320-323
[9]   Ancestral inference from samples of DNA sequences with recombination [J].
Griffiths, RC ;
Marjoram, P .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1996, 3 (04) :479-502
[10]  
HILL W G, 1968, Theoretical and Applied Genetics, V38, P226, DOI 10.1007/BF01245622