Genome assembly reborn: recent computational challenges

被引:201
作者
Pop, Mihai [1 ]
机构
[1] Univ Maryland, Dept Comp Sci, Ctr Bioinformat & Computat Biol, College Pk, MD 20742 USA
基金
美国国家科学基金会;
关键词
genome assembly; genome sequencing; next generation sequencing technologies; SEQUENCING TECHNOLOGY; MICROBIAL GENOMES; REPEAT REGIONS; DNA-SEQUENCES; SHOTGUN; TOOL; VALIDATION; ALGORITHMS; GENERATION; ACCURACY;
D O I
10.1093/bib/bbp026
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Research into genome assembly algorithms has experienced a resurgence due to new challenges created by the development of next generation sequencing technologies. Several genome assemblers have been published in recent years specifically targeted at the new sequence data; however, the ever-changing technological landscape leads to the need for continued research. In addition, the low cost of next generation sequencing data has led to an increased use of sequencing in new settings. For example, the new field of metagenomics relies on large-scale sequencing of entire microbial communities instead of isolate genomes, leading to new computational challenges. In this article, we outline the major algorithmic approaches for genome assembly and describe recent developments in this domain.
引用
收藏
页码:354 / 366
页数:13
相关论文
共 73 条
  • [61] STRATEGY OF DNA SEQUENCING EMPLOYING COMPUTER-PROGRAMS
    STADEN, R
    [J]. NUCLEIC ACIDS RESEARCH, 1979, 6 (07) : 2601 - 2610
  • [62] Whole-Genome Sequencing and Assembly with High-Throughput, Short-Read Technologies
    Sundquist, Andreas
    Ronaghi, Mostafa
    Tang, Haixu
    Pevzner, Pavel
    Batzoglou, Serafim
    [J]. PLOS ONE, 2007, 2 (05):
  • [63] Separation of nearly identical repeats in shotgun assemblies using defined nucleotide positions, DNPs
    Tammi, MT
    Arner, E
    Britton, T
    Andersson, B
    [J]. BIOINFORMATICS, 2002, 18 (03) : 379 - 388
  • [64] Community structure and metabolism through reconstruction of microbial genomes from the environment
    Tyson, GW
    Chapman, J
    Hugenholtz, P
    Allen, EE
    Ram, RJ
    Richardson, PM
    Solovyev, VV
    Rubin, EM
    Rokhsar, DS
    Banfield, JF
    [J]. NATURE, 2004, 428 (6978) : 37 - 43
  • [65] Environmental genome shotgun sequencing of the Sargasso Sea
    Venter, JC
    Remington, K
    Heidelberg, JF
    Halpern, AL
    Rusch, D
    Eisen, JA
    Wu, DY
    Paulsen, I
    Nelson, KE
    Nelson, W
    Fouts, DE
    Levy, S
    Knap, AH
    Lomas, MW
    Nealson, K
    White, O
    Peterson, J
    Hoffman, J
    Parsons, R
    Baden-Tillson, H
    Pfannkoch, C
    Rogers, YH
    Smith, HO
    [J]. SCIENCE, 2004, 304 (5667) : 66 - 74
  • [66] The sequence of the human genome
    Venter, JC
    Adams, MD
    Myers, EW
    Li, PW
    Mural, RJ
    Sutton, GG
    Smith, HO
    Yandell, M
    Evans, CA
    Holt, RA
    Gocayne, JD
    Amanatides, P
    Ballew, RM
    Huson, DH
    Wortman, JR
    Zhang, Q
    Kodira, CD
    Zheng, XQH
    Chen, L
    Skupski, M
    Subramanian, G
    Thomas, PD
    Zhang, JH
    Miklos, GLG
    Nelson, C
    Broder, S
    Clark, AG
    Nadeau, C
    McKusick, VA
    Zinder, N
    Levine, AJ
    Roberts, RJ
    Simon, M
    Slayman, C
    Hunkapiller, M
    Bolanos, R
    Delcher, A
    Dew, I
    Fasulo, D
    Flanigan, M
    Florea, L
    Halpern, A
    Hannenhalli, S
    Kravitz, S
    Levy, S
    Mobarry, C
    Reinert, K
    Remington, K
    Abu-Threideh, J
    Beasley, E
    [J]. SCIENCE, 2001, 291 (5507) : 1304 - +
  • [67] Assembly of polymorphic genomes:: Algorithms and application to Ciona savignyi
    Vinson, JP
    Jaffe, DB
    O'Neill, K
    Karlsson, EK
    Stange-Thomann, N
    Anderson, S
    Mesirov, JP
    Satoh, N
    Satou, Y
    Nusbaum, C
    Birren, B
    Galagan, JE
    Lander, ES
    [J]. GENOME RESEARCH, 2005, 15 (08) : 1127 - 1135
  • [68] The diploid genome sequence of an Asian individual
    Wang, Jun
    Wang, Wei
    Li, Ruiqiang
    Li, Yingrui
    Tian, Geng
    Goodman, Laurie
    Fan, Wei
    Zhang, Junqing
    Li, Jun
    Zhang, Juanbin
    Guo, Yiran
    Feng, Binxiao
    Li, Heng
    Lu, Yao
    Fang, Xiaodong
    Liang, Huiqing
    Du, Zhenglin
    Li, Dong
    Zhao, Yiqing
    Hu, Yujie
    Yang, Zhenzhen
    Zheng, Hancheng
    Hellmann, Ines
    Inouye, Michael
    Pool, John
    Yi, Xin
    Zhao, Jing
    Duan, Jinjie
    Zhou, Yan
    Qin, Junjie
    Ma, Lijia
    Li, Guoqing
    Yang, Zhentao
    Zhang, Guojie
    Yang, Bin
    Yu, Chang
    Liang, Fang
    Li, Wenjie
    Li, Shaochuan
    Li, Dawei
    Ni, Peixiang
    Ruan, Jue
    Li, Qibin
    Zhu, Hongmei
    Liu, Dongyuan
    Lu, Zhike
    Li, Ning
    Guo, Guangwu
    Zhang, Jianguo
    Ye, Jia
    [J]. NATURE, 2008, 456 (7218) : 60 - U1
  • [69] The complete genome of an individual by massively parallel DNA sequencing
    Wheeler, David A.
    Srinivasan, Maithreyan
    Egholm, Michael
    Shen, Yufeng
    Chen, Lei
    McGuire, Amy
    He, Wen
    Chen, Yi-Ju
    Makhijani, Vinod
    Roth, G. Thomas
    Gomes, Xavier
    Tartaro, Karrie
    Niazi, Faheem
    Turcotte, Cynthia L.
    Irzyk, Gerard P.
    Lupski, James R.
    Chinault, Craig
    Song, Xing-zhi
    Liu, Yue
    Yuan, Ye
    Nazareth, Lynne
    Qin, Xiang
    Muzny, Donna M.
    Margulies, Marcel
    Weinstock, George M.
    Gibbs, Richard A.
    Rothberg, Jonathan M.
    [J]. NATURE, 2008, 452 (7189) : 872 - U5
  • [70] Globally Distributed Uncultivated Oceanic N2-Fixing Cyanobacteria Lack Oxygenic Photosystem II
    Zehr, Jonathan P.
    Bench, Shellie R.
    Carter, Brandon J.
    Hewson, Ian
    Niazi, Faheem
    Shi, Tuo
    Tripp, H. James
    Affourtit, Jason P.
    [J]. SCIENCE, 2008, 322 (5904) : 1110 - 1112