Analysis of copy number variants and segmental duplications in the human genome: Evidence for a change in the process of formation in recent evolutionary history

被引:102
作者
Kim, Philip M. [1 ]
Lam, Hugo Y. K. [2 ]
Urban, Alexander E. [3 ]
Korbel, Jan O. [1 ,6 ]
Affourtit, Jason
Grubert, Fabian [4 ]
Chen, Xueying [1 ]
Weissman, Sherman [4 ]
Snyder, Michael [3 ]
Gerstein, Mark B. [1 ,2 ,5 ]
机构
[1] Yale Univ, Dept Mol Biophys & Biochem, New Haven, CT 06520 USA
[2] Yale Univ, Program Computat Biol & Bioinformat, New Haven, CT 06520 USA
[3] Yale Univ, Dept Mol Cellular & Dev Biol, New Haven, CT 06520 USA
[4] Yale Univ, Dept Genet, New Haven, CT 06520 USA
[5] Yale Univ, Dept Comp Sci, New Haven, CT 06520 USA
[6] European Mol Biol Lab, D-69177 Heidelberg, Germany
关键词
D O I
10.1101/gr.081422.108
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Segmental duplications (SDs) are operationally defined as >1 kb stretches of duplicated DNA with high sequence identity. They arise from copy number variants (CNVs) fixed in the population. To investigate the formation of SDs and CNVs, we examine their large-scale patterns of co-occurrence with different repeats. Alu elements, a major class of genomic repeats, had previously been identified as prime drivers of SD formation. We also observe this association; however, we find that it sharply decreases for younger SDs. Continuing this trend, we find only weak associations of CNVs with Alus. Similarly, we find an association of SDs with processed pseudogenes, which is decreasing for younger SDs and absent entirely for CNVs. Next, we find that SDs are significantly co-localized with each other, resulting in a highly skewed "power-law" distribution and chromosomal hotspots. We also observe a significant association of CNVs with SDs, but find that an SD-mediated mechanism only accounts for some CNVs (<28%). Overall, our results imply that a shift in predominant formation mechanism occurred in recent history: similar to 40 million years ago, during the "Alu burst" in retrotransposition activity, non-allelic homologous recombination, first mediated by Alus and then the by newly formed CNVs themselves, was the main driver of genome rearrangements; however, its relative importance has decreased markedly since then, with proportionally more events now stemming from other repeats and from non-homologous end-joining. In addition to a coarse-grained analysis, we performed targeted sequencing of 67 CNVs and then analyzed a combined set of 270 CNVs (540 breakpoints) to verify our conclusions.
引用
收藏
页码:1865 / 1874
页数:10
相关论文
共 39 条
[1]   Statistical mechanics of complex networks [J].
Albert, R ;
Barabási, AL .
REVIEWS OF MODERN PHYSICS, 2002, 74 (01) :47-97
[2]   A haplotype map of the human genome [J].
Altshuler, D ;
Brooks, LD ;
Chakravarti, A ;
Collins, FS ;
Daly, MJ ;
Donnelly, P ;
Gibbs, RA ;
Belmont, JW ;
Boudreau, A ;
Leal, SM ;
Hardenbol, P ;
Pasternak, S ;
Wheeler, DA ;
Willis, TD ;
Yu, FL ;
Yang, HM ;
Zeng, CQ ;
Gao, Y ;
Hu, HR ;
Hu, WT ;
Li, CH ;
Lin, W ;
Liu, SQ ;
Pan, H ;
Tang, XL ;
Wang, J ;
Wang, W ;
Yu, J ;
Zhang, B ;
Zhang, QR ;
Zhao, HB ;
Zhao, H ;
Zhou, J ;
Gabriel, SB ;
Barry, R ;
Blumenstiel, B ;
Camargo, A ;
Defelice, M ;
Faggart, M ;
Goyette, M ;
Gupta, S ;
Moore, J ;
Nguyen, H ;
Onofrio, RC ;
Parkin, M ;
Roy, J ;
Stahl, E ;
Winchester, E ;
Ziaugra, L ;
Shen, Y .
NATURE, 2005, 437 (7063) :1299-1320
[3]   An Alu transposition model for the origin and expansion of human segmental duplications [J].
Bailey, JA ;
Liu, G ;
Eichler, EE .
AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 73 (04) :823-834
[4]   Recent segmental duplications in the human genome [J].
Bailey, JA ;
Gu, ZP ;
Clark, RA ;
Reinert, K ;
Samonte, RV ;
Schwartz, S ;
Adams, MD ;
Myers, EW ;
Li, PW ;
Eichler, EE .
SCIENCE, 2002, 297 (5583) :1003-1007
[5]   Primate segmental duplications: crucibles of evolution, diversity and disease [J].
Bailey, Jeffrey A. ;
Eichler, Evan E. .
NATURE REVIEWS GENETICS, 2006, 7 (07) :552-564
[6]   Emergence of scaling in random networks [J].
Barabási, AL ;
Albert, R .
SCIENCE, 1999, 286 (5439) :509-512
[7]   Nonrecurrent MECP2 duplications mediated by genomic architecture-driven DNA breaks and break-induced replication repair [J].
Bauters, Marijke ;
Van Esch, Hilde ;
Friez, Michael J. ;
Boespflug-Tanguy, Odile ;
Zenker, Martin ;
Vianna-Morgante, Angela M. ;
Rosenberg, Carla ;
Ignatius, Jaakko ;
Raynaud, Martine ;
Hollanders, Karen ;
Govaerts, Karen ;
Vandenreijt, Kris ;
Niel, Florence ;
Blanc, Pierre ;
Stevenson, Roger E. ;
Fryns, Jean-Pierre ;
Marynen, Peter ;
Schwartz, Charles E. ;
Froyen, Guy .
GENOME RESEARCH, 2008, 18 (06) :847-858
[8]   A genome-wide comparison of recent chimpanzee and human segmental duplications [J].
Cheng, Z ;
Ventura, M ;
She, XW ;
Khaitovich, P ;
Graves, T ;
Osoegawa, K ;
Church, D ;
DeJong, P ;
Wilson, RK ;
Pääbo, S ;
Rocchi, M ;
Eichler, EE .
NATURE, 2005, 437 (7055) :88-93
[9]   Resolving the resolution of array CGH [J].
Coe, Bradley P. ;
Ylstra, Bauke ;
Carvalho, Beatriz ;
Meijer, Gerrit A. ;
MacAulay, Calum ;
Lam, Wan L. .
GENOMICS, 2007, 89 (05) :647-653
[10]   The population genetics of structural variation [J].
Conrad, Donald F. ;
Hurles, Matthew E. .
NATURE GENETICS, 2007, 39 (Suppl 7) :S30-S36