The fine-scale architecture of structural variants in 17 mouse genomes

被引:40
作者
Yalcin, Binnaz [1 ,2 ]
Wong, Kim [3 ]
Bhomra, Amarjit [1 ]
Goodson, Martin [1 ]
Keane, Thomas M. [3 ]
Adams, David J. [3 ]
Flint, Jonathan [1 ]
机构
[1] Wellcome Trust Ctr Human Genet, Oxford OX3 7BN, England
[2] Univ Lausanne, Ctr Integrat Genom, Dept Med Genet, Lausanne, Switzerland
[3] Wellcome Trust Sanger Inst, Cambridge CB10 1HH, England
来源
GENOME BIOLOGY | 2012年 / 13卷 / 03期
基金
英国医学研究理事会; 英国惠康基金;
关键词
COPY-NUMBER VARIATION; ASSOCIATION; REARRANGEMENTS; DISORDERS; DELETIONS;
D O I
10.1186/gb-2012-13-3-r18
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Accurate catalogs of structural variants (SVs) in mammalian genomes are necessary to elucidate the potential mechanisms that drive SV formation and to assess their functional impact. Next generation sequencing methods for SV detection are an advance on array-based methods, but are almost exclusively limited to four basic types: deletions, insertions, inversions and copy number gains. Results: By visual inspection of 100 Mbp of genome to which next generation sequence data from 17 inbred mouse strains had been aligned, we identify and interpret 21 paired-end mapping patterns, which we validate by PCR. These paired-end mapping patterns reveal a greater diversity and complexity in SVs than previously recognized. In addition, Sanger-based sequence analysis of 4,176 breakpoints at 261 SV sites reveal additional complexity at approximately a quarter of structural variants analyzed. We find micro-deletions and micro-insertions at SV breakpoints, ranging from 1 to 107 bp, and SNPs that extend breakpoint micro-homology and may catalyze SV formation. Conclusions: An integrative approach using experimental analyses to train computational SV calling is essential for the accurate resolution of the architecture of SVs. We find considerable complexity in SV formation; about a quarter of SVs in the mouse are composed of a complex mixture of deletion, insertion, inversion and copy number gain. Computational methods can be adapted to identify most paired-end mapping patterns.
引用
收藏
页数:11
相关论文
共 43 条
[1]   Dindel: Accurate indel calls from short-read data [J].
Albers, Cornelis A. ;
Lunter, Gerton ;
MacArthur, Daniel G. ;
McVean, Gilean ;
Ouwehand, Willem H. ;
Durbin, Richard .
GENOME RESEARCH, 2011, 21 (06) :961-973
[2]   APPLICATIONS OF NEXT-GENERATION SEQUENCING Genome structural variation discovery and genotyping [J].
Alkan, Can ;
Coe, Bradley P. ;
Eichler, Evan E. .
NATURE REVIEWS GENETICS, 2011, 12 (05) :363-375
[3]   The genomic complexity of primary human prostate cancer [J].
Berger, Michael F. ;
Lawrence, Michael S. ;
Demichelis, Francesca ;
Drier, Yotam ;
Cibulskis, Kristian ;
Sivachenko, Andrey Y. ;
Sboner, Andrea ;
Esgueva, Raquel ;
Pflueger, Dorothee ;
Sougnez, Carrie ;
Onofrio, Robert ;
Carter, Scott L. ;
Park, Kyung ;
Habegger, Lukas ;
Ambrogio, Lauren ;
Fennell, Timothy ;
Parkin, Melissa ;
Saksena, Gordon ;
Voet, Douglas ;
Ramos, Alex H. ;
Pugh, Trevor J. ;
Wilkinson, Jane ;
Fisher, Sheila ;
Winckler, Wendy ;
Mahan, Scott ;
Ardlie, Kristin ;
Baldwin, Jennifer ;
Simons, Jonathan W. ;
Kitabayashi, Naoki ;
MacDonald, Theresa Y. ;
Kantoff, Philip W. ;
Chin, Lynda ;
Gabriel, Stacey B. ;
Gerstein, Mark B. ;
Golub, Todd R. ;
Meyerson, Matthew ;
Tewari, Ashutosh ;
Lander, Eric S. ;
Getz, Gad ;
Rubin, Mark A. ;
Garraway, Levi A. .
NATURE, 2011, 470 (7333) :214-220
[4]   Large, rare chromosomal deletions associated with severe early-onset obesity [J].
Bochukova, Elena G. ;
Huang, Ni ;
Keogh, Julia ;
Henning, Elana ;
Purmann, Carolin ;
Blaszczyk, Kasia ;
Saeed, Sadia ;
Hamilton-Shield, Julian ;
Clayton-Smith, Jill ;
O'Rahilly, Stephen ;
Hurles, Matthew E. ;
Farooqi, I. Sadaf .
NATURE, 2010, 463 (7281) :666-670
[5]  
Chen K, 2009, NAT METHODS, V6, P677, DOI [10.1038/NMETH.1363, 10.1038/nmeth.1363]
[6]   Mutation spectrum revealed by breakpoint sequencing of human germline CNVs [J].
Conrad, Donald F. ;
Bird, Christine ;
Blackburne, Ben ;
Lindsay, Sarah ;
Mamanova, Lira ;
Lee, Charles ;
Turner, Daniel J. ;
Hurles, Matthew E. .
NATURE GENETICS, 2010, 42 (05) :385-U43
[7]   Origins and functional impact of copy number variation in the human genome [J].
Conrad, Donald F. ;
Pinto, Dalila ;
Redon, Richard ;
Feuk, Lars ;
Gokcumen, Omer ;
Zhang, Yujun ;
Aerts, Jan ;
Andrews, T. Daniel ;
Barnes, Chris ;
Campbell, Peter ;
Fitzgerald, Tomas ;
Hu, Min ;
Ihm, Chun Hwa ;
Kristiansson, Kati ;
MacArthur, Daniel G. ;
MacDonald, Jeffrey R. ;
Onyiah, Ifejinelo ;
Pang, Andy Wing Chun ;
Robson, Sam ;
Stirrups, Kathy ;
Valsesia, Armand ;
Walter, Klaudia ;
Wei, John ;
Tyler-Smith, Chris ;
Carter, Nigel P. ;
Lee, Charles ;
Scherer, Stephen W. ;
Hurles, Matthew E. .
NATURE, 2010, 464 (7289) :704-712
[8]   Break-Induced Replication Is Highly Inaccurate [J].
Deem, Angela ;
Keszthelyi, Andrea ;
Blackgrove, Tiffany ;
Vayl, Alexandra ;
Coffey, Barbara ;
Mathur, Ruchi ;
Chabes, Andrei ;
Malkova, Anna .
PLOS BIOLOGY, 2011, 9 (02)
[9]   Copy number variation at 1q21.1 associated with neuroblastoma [J].
Diskin, Sharon J. ;
Hou, Cuiping ;
Glessner, Joseph T. ;
Attiyeh, Edward F. ;
Laudenslager, Marci ;
Bosse, Kristopher ;
Cole, Kristina ;
Mosse, Yael P. ;
Wood, Andrew ;
Lynch, Jill E. ;
Pecor, Katlyn ;
Diamond, Maura ;
Winter, Cynthia ;
Wang, Kai ;
Kim, Cecilia ;
Geiger, Elizabeth A. ;
McGrady, Patrick W. ;
Blakemore, Alexandra I. F. ;
London, Wendy B. ;
Shaikh, Tamim H. ;
Bradfield, Jonathan ;
Grant, Struan F. A. ;
Li, Hongzhe ;
Devoto, Marcella ;
Rappaport, Eric R. ;
Hakonarson, Hakon ;
Maris, John M. .
NATURE, 2009, 459 (7249) :987-U112
[10]   Base-calling of automated sequencer traces using phred.: I.: Accuracy assessment [J].
Ewing, B ;
Hillier, L ;
Wendl, MC ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :175-185