A survey of sequence alignment algorithms for next-generation sequencing

被引:546
作者
Li, Heng [1 ]
Homer, Nils [2 ,3 ]
机构
[1] Broad Inst, Cambridge, MA 02142 USA
[2] Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90024 USA
[3] Univ Calif Los Angeles, Dept Human Genet, Los Angeles, CA 90024 USA
关键词
new sequencing technologies; alignment algorithm; sequence analysis; LOCAL ALIGNMENT; READ ALIGNMENT; DNA-SEQUENCES; OLIGONUCLEOTIDES; ULTRAFAST; BLAST; TOOL;
D O I
10.1093/bib/bbq015
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Rapidly evolving sequencing technologies produce data on an unparalleled scale. A central challenge to the analysis of this data is sequence alignment, whereby sequence reads must be compared to a reference. A wide variety of alignment algorithms and software have been subsequently developed over the past two years. In this article, we will systematically review the current development of these algorithms and introduce their practical applications on different types of experimental data. We come to the conclusion that short-read alignment is no longer the bottleneck of data analyses. We also consider future development of alignment algorithms with respect to emerging long sequence reads and the prospect of cloud computing.
引用
收藏
页码:473 / 483
页数:11
相关论文
共 70 条
  • [1] Abouelhoda M. I., 2004, Journal of Discrete Algorithms, V2, P53, DOI 10.1016/S1570-8667(03)00065-0
  • [2] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [3] BASIC LOCAL ALIGNMENT SEARCH TOOL
    ALTSCHUL, SF
    GISH, W
    MILLER, W
    MYERS, EW
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) : 403 - 410
  • [4] ReAligner: A program for refining DNA sequence multi-alignments
    Anson, EL
    Myers, EW
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 1997, 4 (03) : 369 - 383
  • [5] BAEZAYATES RA, 1992, LECT NOTES COMPUT SC, V644, P185
  • [6] THE SMALLEST AUTOMATION RECOGNIZING THE SUBWORDS OF A TEXT
    BLUMER, A
    BLUMER, J
    HAUSSLER, D
    EHRENFEUCHT, A
    CHEN, MT
    SEIFERAS, J
    [J]. THEORETICAL COMPUTER SCIENCE, 1985, 40 (01) : 31 - 55
  • [7] Burkhardt S., 2001, Combinatorial Pattern Matching. 12th Annual Symposium, CPM 2001. Proceedings (Lecture Notes in Computer Science Vol. 2089), P73
  • [8] Burrows M, 1994, BLOCK SORTING LOSSLE
  • [9] Cao X, 2005, LECT NOTES COMPUT SC, V3453, P4
  • [10] BamView: viewing mapped read alignment data in the context of the reference sequence
    Carver, Tim
    Bohme, Ulrike
    Otto, Thomas D.
    Parkhill, Julian
    Berriman, Matthew
    [J]. BIOINFORMATICS, 2010, 26 (05) : 676 - 677