Analysis of segmental duplications and genome assembly in the mouse

被引:87
作者
Bailey, JA
Church, DM
Ventura, M
Rocchi, M
Eichler, EE [1 ]
机构
[1] Case Western Reserve Univ, Sch Med, Ctr Computat Genom, Dept Genet, Cleveland, OH 44106 USA
[2] Univ Hosp Cleveland, Cleveland, OH 44106 USA
[3] Natl Lib Med, Natl Ctr Biotechnol Informat, NIH, Bethesda, MD 20894 USA
[4] Univ Bari, Dipartimento Anat Patol & Genet, Sez Genet, I-70126 Bari, Italy
基金
英国惠康基金;
关键词
D O I
10.1101/gr.2238404
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 [生物化学与分子生物学]; 081704 [应用化学];
摘要
Limited comparative studies suggest that the human genome is particularly enriched for recent segmental duplications. The extent of segmental duplications in other mammalian genomes is unknown and confounded by methodological differences in genome assembly. Here, we present a detailed analysis of recent duplication content within the mouse genome using a whole-genome assembly comparison method and a novel assembly independent method, designed to take advantage of the reduced allelic variation of the C57BL/6J strain. We conservatively estimate that similar to57% of all highly identical segmental duplications ( greater than or equal to90%) were misassembled or collapsed within the working draft WGS assembly. The WGS approach often leaves duplications fragmented and unassigned to a chromosome when compared with the clone-ordered-based approach. Our preliminary analysis suggests that 1.7%-2.0% of the mouse genome is part of recent large segmental duplications (about half of what is observed for the human genome). We have constructed a mouse segmental duplication database to aid in the characterization of these regions and their integration into the final mouse genome assembly. This work suggests significant biological differences in the architecture of recent segmental duplications between human and mouse. In addition, our unique method provides the means for improving whole-genome shotgun sequence assembly of mouse and future mammalian genomes.
引用
收藏
页码:789 / 801
页数:13
相关论文
共 51 条
[1]
BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]
Enrichment of segmental duplications in regions of breaks of synteny between the human and mouse genomes suggest their involvement in evolutionary rearrangements [J].
Armengol, L ;
Pujana, MA ;
Cheung, J ;
Scherer, SW ;
Estivill, X .
HUMAN MOLECULAR GENETICS, 2003, 12 (17) :2201-2208
[3]
An Alu transposition model for the origin and expansion of human segmental duplications [J].
Bailey, JA ;
Liu, G ;
Eichler, EE .
AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 73 (04) :823-834
[4]
Recent segmental duplications in the human genome [J].
Bailey, JA ;
Gu, ZP ;
Clark, RA ;
Reinert, K ;
Samonte, RV ;
Schwartz, S ;
Adams, MD ;
Myers, EW ;
Li, PW ;
Eichler, EE .
SCIENCE, 2002, 297 (5583) :1003-1007
[5]
Human-specific duplication and mosaic transcripts: The recent paralogous structure of chromosome 22 [J].
Bailey, JA ;
Yavor, AM ;
Viggiano, L ;
Misceo, D ;
Horvath, JE ;
Archidiacono, N ;
Schwartz, S ;
Rocchi, M ;
Eichler, EE .
AMERICAN JOURNAL OF HUMAN GENETICS, 2002, 70 (01) :83-100
[6]
Segmental duplications: Organization and impact within the current Human Genome Project assembly [J].
Bailey, JA ;
Yavor, AM ;
Massa, HF ;
Trask, BJ ;
Eichler, EE .
GENOME RESEARCH, 2001, 11 (06) :1005-1017
[7]
Batzoglou S, 2002, GENOME RES, V12, P177, DOI 10.1101/gr.208902
[8]
Recent segmental and gene duplications in the mouse genome [J].
Cheung, J ;
Wilson, MD ;
Zhang, JJ ;
Khaja, R ;
MacDonald, JR ;
Heng, HHQ ;
Koop, BF ;
Scherer, SW .
GENOME BIOLOGY, 2003, 4 (08)
[9]
Genome-wide detection of segmental duplications and potential assembly errors in the human genome sequence [J].
Cheung, J ;
Estivill, X ;
Khaja, R ;
MacDonald, JR ;
Lau, K ;
Tsui, LC ;
Scherer, SW .
GENOME BIOLOGY, 2003, 4 (04)
[10]
Integration of cytogenetic landmarks into the draft sequence of the human genome [J].
Cheung, VG ;
Nowak, N ;
Jang, W ;
Kirsch, IR ;
Zhao, S ;
Chen, XN ;
Furey, TS ;
Kim, UJ ;
Kuo, WL ;
Olivier, M ;
Conroy, J ;
Kasprzyk, A ;
Massa, H ;
Yonescu, R ;
Sait, S ;
Thoreen, C ;
Snijders, A ;
Lemyre, E ;
Bailey, JA ;
Bruzel, A ;
Burrill, WD ;
Clegg, SM ;
Collins, S ;
Dhami, P ;
Friedman, C ;
Han, CS ;
Herrick, S ;
Lee, J ;
Ligon, AH ;
Lowry, S ;
Morley, M ;
Narasimhan, S ;
Osoegawa, K ;
Peng, Z ;
Plajzer-Frick, I ;
Quade, BJ ;
Scott, D ;
Sirotkin, K ;
Thorpe, AA ;
Gray, JW ;
Hudson, J ;
Pinkel, D ;
Ried, T ;
Rowen, L ;
Shen-Ong, GL ;
Strausberg, RL ;
Birney, E ;
Callen, DF ;
Cheng, JF ;
Cox, DR .
NATURE, 2001, 409 (6822) :953-958