The fine-scale architecture of structural variants in 17 mouse genomes

被引:40
作者
Yalcin, Binnaz [1 ,2 ]
Wong, Kim [3 ]
Bhomra, Amarjit [1 ]
Goodson, Martin [1 ]
Keane, Thomas M. [3 ]
Adams, David J. [3 ]
Flint, Jonathan [1 ]
机构
[1] Wellcome Trust Ctr Human Genet, Oxford OX3 7BN, England
[2] Univ Lausanne, Ctr Integrat Genom, Dept Med Genet, Lausanne, Switzerland
[3] Wellcome Trust Sanger Inst, Cambridge CB10 1HH, England
来源
GENOME BIOLOGY | 2012年 / 13卷 / 03期
基金
英国医学研究理事会; 英国惠康基金;
关键词
COPY-NUMBER VARIATION; ASSOCIATION; REARRANGEMENTS; DISORDERS; DELETIONS;
D O I
10.1186/gb-2012-13-3-r18
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Accurate catalogs of structural variants (SVs) in mammalian genomes are necessary to elucidate the potential mechanisms that drive SV formation and to assess their functional impact. Next generation sequencing methods for SV detection are an advance on array-based methods, but are almost exclusively limited to four basic types: deletions, insertions, inversions and copy number gains. Results: By visual inspection of 100 Mbp of genome to which next generation sequence data from 17 inbred mouse strains had been aligned, we identify and interpret 21 paired-end mapping patterns, which we validate by PCR. These paired-end mapping patterns reveal a greater diversity and complexity in SVs than previously recognized. In addition, Sanger-based sequence analysis of 4,176 breakpoints at 261 SV sites reveal additional complexity at approximately a quarter of structural variants analyzed. We find micro-deletions and micro-insertions at SV breakpoints, ranging from 1 to 107 bp, and SNPs that extend breakpoint micro-homology and may catalyze SV formation. Conclusions: An integrative approach using experimental analyses to train computational SV calling is essential for the accurate resolution of the architecture of SVs. We find considerable complexity in SV formation; about a quarter of SVs in the mouse are composed of a complex mixture of deletion, insertion, inversion and copy number gain. Computational methods can be adapted to identify most paired-end mapping patterns.
引用
收藏
页数:11
相关论文
共 43 条
[41]   Sequence-based characterization of structural variation in the mouse genome [J].
Yalcin, Binnaz ;
Wong, Kim ;
Agam, Avigail ;
Goodson, Martin ;
Keane, Thomas M. ;
Gan, Xiangchao ;
Nellaker, Christoffer ;
Goodstadt, Leo ;
Nicod, Jerome ;
Bhomra, Amarjit ;
Hernandez-Pliego, Polinka ;
Whitley, Helen ;
Cleak, James ;
Dutton, Rebekah ;
Janowitz, Deborah ;
Mott, Richard ;
Adams, David J. ;
Flint, Jonathan .
NATURE, 2011, 477 (7364) :326-329
[42]   Pindel: a pattern growth approach to detect break points of large deletions and medium sized insertions from paired-end short reads [J].
Ye, Kai ;
Schulz, Marcel H. ;
Long, Quan ;
Apweiler, Rolf ;
Ning, Zemin .
BIOINFORMATICS, 2009, 25 (21) :2865-2871
[43]   The DNA replication FoSTeS/MMBIR mechanism can generate genomic, genic and exonic complex rearrangements in humans [J].
Zhang, Feng ;
Khajavi, Mehrdad ;
Connolly, Anne M. ;
Towne, Charles F. ;
Batish, Sat Dev ;
Lupski, James R. .
NATURE GENETICS, 2009, 41 (07) :849-U115