Shotgun sequence assembly and recent segmental duplications within the human genome

被引:184
作者
She, XW
Jiang, ZX
Clark, RL
Liu, G
Cheng, Z
Tuzun, E
Church, DM
Sutton, G
Halpern, AL
Eichler, EE
机构
[1] Univ Washington, Sch Med, Dept Genome Sci, Seattle, WA 98195 USA
[2] Case Western Reserve Univ, Dept Genet, Cleveland, OH 44106 USA
[3] NIH, Natl Ctr Biotechnol Informat, Natl Lib Med, Bethesda, MD 20894 USA
[4] Appl Biosyst Inc, Rockville, MD 20850 USA
[5] Ctr Advancement Genom, Rockville, MD 20850 USA
关键词
D O I
10.1038/nature03062
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Complex eukaryotic genomes are now being sequenced at an accelerated pace primarily using whole-genome shotgun (WGS) sequence assembly approaches. WGS assembly was initially criticized because of its perceived inability to resolve repeat structures within genomes. Here, we quantify the effect of WGS sequence assembly on large, highly similar repeats by comparison Of the segmental duplication content of two different human genome assemblies. Our analysis shows that large (>15 kilobases) and highly identical (>97%) duplications are not adequately resolved by WGS assembly. This leads to significant reduction in genome length and the loss of genes embedded within duplications. Comparable analyses of mouse genome assemblies confirm that strict WGS sequence assembly will oversimplify our understanding of mammalian genome structure and evolution; a hybrid strategy using a targeted clone-by-clone approach to resolve duplications is proposed.
引用
收藏
页码:927 / 930
页数:4
相关论文
共 22 条
  • [1] The genome sequence of Drosophila melanogaster
    Adams, MD
    Celniker, SE
    Holt, RA
    Evans, CA
    Gocayne, JD
    Amanatides, PG
    Scherer, SE
    Li, PW
    Hoskins, RA
    Galle, RF
    George, RA
    Lewis, SE
    Richards, S
    Ashburner, M
    Henderson, SN
    Sutton, GG
    Wortman, JR
    Yandell, MD
    Zhang, Q
    Chen, LX
    Brandon, RC
    Rogers, YHC
    Blazej, RG
    Champe, M
    Pfeiffer, BD
    Wan, KH
    Doyle, C
    Baxter, EG
    Helt, G
    Nelson, CR
    Miklos, GLG
    Abril, JF
    Agbayani, A
    An, HJ
    Andrews-Pfannkoch, C
    Baldwin, D
    Ballew, RM
    Basu, A
    Baxendale, J
    Bayraktaroglu, L
    Beasley, EM
    Beeson, KY
    Benos, PV
    Berman, BP
    Bhandari, D
    Bolshakov, S
    Borkova, D
    Botchan, MR
    Bouck, J
    Brokstein, P
    [J]. SCIENCE, 2000, 287 (5461) : 2185 - 2195
  • [2] Enrichment of segmental duplications in regions of breaks of synteny between the human and mouse genomes suggest their involvement in evolutionary rearrangements
    Armengol, L
    Pujana, MA
    Cheung, J
    Scherer, SW
    Estivill, X
    [J]. HUMAN MOLECULAR GENETICS, 2003, 12 (17) : 2201 - 2208
  • [3] Analysis of segmental duplications and genome assembly in the mouse
    Bailey, JA
    Church, DM
    Ventura, M
    Rocchi, M
    Eichler, EE
    [J]. GENOME RESEARCH, 2004, 14 (05) : 789 - 801
  • [4] Hotspots of mammalian chromosomal evolution
    Bailey, JA
    Baertsch, R
    Kent, WJ
    Haussler, D
    Eichler, EE
    [J]. GENOME BIOLOGY, 2004, 5 (04)
  • [5] Recent segmental duplications in the human genome
    Bailey, JA
    Gu, ZP
    Clark, RA
    Reinert, K
    Samonte, RV
    Schwartz, S
    Adams, MD
    Myers, EW
    Li, PW
    Eichler, EE
    [J]. SCIENCE, 2002, 297 (5583) : 1003 - 1007
  • [6] Segmental duplications: Organization and impact within the current Human Genome Project assembly
    Bailey, JA
    Yavor, AM
    Massa, HF
    Trask, BJ
    Eichler, EE
    [J]. GENOME RESEARCH, 2001, 11 (06) : 1005 - 1017
  • [7] Batzoglou S, 2002, GENOME RES, V12, P177, DOI 10.1101/gr.208902
  • [8] Genome-wide detection of segmental duplications and potential assembly errors in the human genome sequence
    Cheung, J
    Estivill, X
    Khaja, R
    MacDonald, JR
    Lau, K
    Tsui, LC
    Scherer, SW
    [J]. GENOME BIOLOGY, 2003, 4 (04)
  • [9] Integration of cytogenetic landmarks into the draft sequence of the human genome
    Cheung, VG
    Nowak, N
    Jang, W
    Kirsch, IR
    Zhao, S
    Chen, XN
    Furey, TS
    Kim, UJ
    Kuo, WL
    Olivier, M
    Conroy, J
    Kasprzyk, A
    Massa, H
    Yonescu, R
    Sait, S
    Thoreen, C
    Snijders, A
    Lemyre, E
    Bailey, JA
    Bruzel, A
    Burrill, WD
    Clegg, SM
    Collins, S
    Dhami, P
    Friedman, C
    Han, CS
    Herrick, S
    Lee, J
    Ligon, AH
    Lowry, S
    Morley, M
    Narasimhan, S
    Osoegawa, K
    Peng, Z
    Plajzer-Frick, I
    Quade, BJ
    Scott, D
    Sirotkin, K
    Thorpe, AA
    Gray, JW
    Hudson, J
    Pinkel, D
    Ried, T
    Rowen, L
    Shen-Ong, GL
    Strausberg, RL
    Birney, E
    Callen, DF
    Cheng, JF
    Cox, DR
    [J]. NATURE, 2001, 409 (6822) : 953 - 958
  • [10] Genome sequence of the Brown Norway rat yields insights into mammalian evolution
    Gibbs, RA
    Weinstock, GM
    Metzker, ML
    Muzny, DM
    Sodergren, EJ
    Scherer, S
    Scott, G
    Steffen, D
    Worley, KC
    Burch, PE
    Okwuonu, G
    Hines, S
    Lewis, L
    DeRamo, C
    Delgado, O
    Dugan-Rocha, S
    Miner, G
    Morgan, M
    Hawes, A
    Gill, R
    Holt, RA
    Adams, MD
    Amanatides, PG
    Baden-Tillson, H
    Barnstead, M
    Chin, S
    Evans, CA
    Ferriera, S
    Fosler, C
    Glodek, A
    Gu, ZP
    Jennings, D
    Kraft, CL
    Nguyen, T
    Pfannkoch, CM
    Sitter, C
    Sutton, GG
    Venter, JC
    Woodage, T
    Smith, D
    Lee, HM
    Gustafson, E
    Cahill, P
    Kana, A
    Doucette-Stamm, L
    Weinstock, K
    Fechtel, K
    Weiss, RB
    Dunn, DM
    Green, ED
    [J]. NATURE, 2004, 428 (6982) : 493 - 521