Inference about recombination from haplotype data: Lower bounds and recombination hotspots

被引:12
作者
Bafna, V [1 ]
Bansal, V [1 ]
机构
[1] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
关键词
recombination; population genetics; ancestral recombination graph; lower bounds; recombination hotspots; computational complexity; minimum number of recombination;
D O I
10.1089/cmb.2006.13.501
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Recombination is an important evolutionary mechanism responsible for creating the patterns of haplotype variation observable in human populations. Recently, there has been extensive research on understanding the fine-scale variation in recombination across the human genome using DNA polymorphism data. Historical recombination events leave signature patterns in haplotype data. A nonparametric approach for estimating the number of historical recombination events is to compute the minimum number of recombination events in the history of a set of haplotypes. In this paper, we provide new and improved methods for computing lower bounds on the minimum number of recombination events. These methods are shown to detect a higher number of recombination events for a haplotype dataset from a region in the lipoprotein lipase gene than previous lower bounds. We apply our methods to two datasets for which recombination hotspots have been experimentally determined and demonstrate a high density of detectable recombination events in the regions annotated as recombination hotspots. The programs implementing the methods in this paper are available at www.cs.ucsd.edu/users/vibansal/RecBounds/.
引用
收藏
页码:501 / 521
页数:21
相关论文
共 43 条
[1]  
[Anonymous], 1979, Computers and Intractablity: A Guide to the Theoryof NP-Completeness
[2]   The number of recombination events in a sample history: Conflict graph and lower bounds [J].
Bafna, V ;
Bansal, V .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2004, 1 (02) :78-90
[3]   Haplotype structure and population genetic inferences from nucleotide-sequence variation in human lipoprotein lipase [J].
Clark, AG ;
Weiss, KM ;
Nickerson, DA ;
Taylor, SL ;
Buchanan, A ;
Stengård, J ;
Salomaa, V ;
Vartiainen, E ;
Perola, M ;
Boerwinkle, E ;
Sing, CF .
AMERICAN JOURNAL OF HUMAN GENETICS, 1998, 63 (02) :595-612
[4]   High-resolution haplotype structure in the human genome [J].
Daly, MJ ;
Rioux, JD ;
Schaffner, SE ;
Hudson, TJ ;
Lander, ES .
NATURE GENETICS, 2001, 29 (02) :229-232
[5]  
Fearnhead P, 2001, GENETICS, V159, P1299
[6]   Application of coalescent mediods to reveal fine-scale rate vairiation and recombination hotspots [J].
Fearnhead, P ;
Harding, RM ;
Schneider, JA ;
Myers, S ;
Donnelly, P .
GENETICS, 2004, 167 (04) :2067-2081
[7]   A second generation human haplotype map of over 3.1 million SNPs [J].
Frazer, Kelly A. ;
Ballinger, Dennis G. ;
Cox, David R. ;
Hinds, David A. ;
Stuve, Laura L. ;
Gibbs, Richard A. ;
Belmont, John W. ;
Boudreau, Andrew ;
Hardenbol, Paul ;
Leal, Suzanne M. ;
Pasternak, Shiran ;
Wheeler, David A. ;
Willis, Thomas D. ;
Yu, Fuli ;
Yang, Huanming ;
Zeng, Changqing ;
Gao, Yang ;
Hu, Haoran ;
Hu, Weitao ;
Li, Chaohua ;
Lin, Wei ;
Liu, Siqi ;
Pan, Hao ;
Tang, Xiaoli ;
Wang, Jian ;
Wang, Wei ;
Yu, Jun ;
Zhang, Bo ;
Zhang, Qingrun ;
Zhao, Hongbin ;
Zhao, Hui ;
Zhou, Jun ;
Gabriel, Stacey B. ;
Barry, Rachel ;
Blumenstiel, Brendan ;
Camargo, Amy ;
Defelice, Matthew ;
Faggart, Maura ;
Goyette, Mary ;
Gupta, Supriya ;
Moore, Jamie ;
Nguyen, Huy ;
Onofrio, Robert C. ;
Parkin, Melissa ;
Roy, Jessica ;
Stahl, Erich ;
Winchester, Ellen ;
Ziaugra, Liuda ;
Altshuler, David ;
Shen, Yan .
NATURE, 2007, 449 (7164) :851-U3
[8]   The structure of haplotype blocks in the human genome [J].
Gabriel, SB ;
Schaffner, SF ;
Nguyen, H ;
Moore, JM ;
Roy, J ;
Blumenstiel, B ;
Higgins, J ;
DeFelice, M ;
Lochner, A ;
Faggart, M ;
Liu-Cordero, SN ;
Rotimi, C ;
Adeyemo, A ;
Cooper, R ;
Ward, R ;
Lander, ES ;
Daly, MJ ;
Altshuler, D .
SCIENCE, 2002, 296 (5576) :2225-2229
[9]   Efficient reconstruction of phylogenetic networks with constrained recombination [J].
Gusfield, D ;
Eddhu, S ;
Langley, C .
PROCEEDINGS OF THE 2003 IEEE BIOINFORMATICS CONFERENCE, 2003, :363-374
[10]   RECONSTRUCTING EVOLUTION OF SEQUENCES SUBJECT TO RECOMBINATION USING PARSIMONY [J].
HEIN, J .
MATHEMATICAL BIOSCIENCES, 1990, 98 (02) :185-200