SOAP2: an improved ultrafast tool for short read alignment

被引:2852
作者
Li, Ruiqiang [1 ,2 ]
Yu, Chang [1 ]
Li, Yingrui [1 ]
Lam, Tak-Wah [3 ]
Yiu, Siu-Ming [3 ]
Kristiansen, Karsten [2 ]
Wang, Jun [1 ,2 ]
机构
[1] Beijing Genom Inst Shenzhen, Shenzhen 518083, Peoples R China
[2] Univ So Denmark, Dept Biochem & Mol Biol, DK-5230 Odense M, Denmark
[3] Univ Hong Kong, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
GENOME; DNA;
D O I
10.1093/bioinformatics/btp336
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
SOAP2 is a significantly improved version of the short oligonucleotide alignment program that both reduces computer memory usage and increases alignment speed at an unprecedented rate. We used a Burrows Wheeler Transformation (BWT) compression index to substitute the seed strategy for indexing the reference sequence in the main memory. We tested it on the whole human genome and found that this new algorithm reduced memory usage from 14.7 to 5.4GB and improved alignment speed by 20-30 times. SOAP2 is compatible with both single-and paired-end reads. Additionally, this tool now supports multiple text and compressed. le formats. A consensus builder has also been developed for consensus assembly and SNP detection from alignment of short reads on a reference genome.
引用
收藏
页码:1966 / 1967
页数:2
相关论文
共 6 条
[1]  
Burrow M., 1994, 124 DIG EQ CORP
[2]   Compressed indexing and local alignment of DNA [J].
Lam, T. W. ;
Sung, W. K. ;
Tam, S. L. ;
Wong, C. K. ;
Yiu, S. M. .
BIOINFORMATICS, 2008, 24 (06) :791-797
[3]   Ultrafast and memory-efficient alignment of short DNA sequences to the human genome [J].
Langmead, Ben ;
Trapnell, Cole ;
Pop, Mihai ;
Salzberg, Steven L. .
GENOME BIOLOGY, 2009, 10 (03)
[4]   Mapping short DNA sequencing reads and calling variants using mapping quality scores [J].
Li, Heng ;
Ruan, Jue ;
Durbin, Richard .
GENOME RESEARCH, 2008, 18 (11) :1851-1858
[5]   SOAP: short oligonucleotide alignment program [J].
Li, Ruiqiang ;
Li, Yingrui ;
Kristiansen, Karsten ;
Wang, Jun .
BIOINFORMATICS, 2008, 24 (05) :713-714
[6]   The diploid genome sequence of an Asian individual [J].
Wang, Jun ;
Wang, Wei ;
Li, Ruiqiang ;
Li, Yingrui ;
Tian, Geng ;
Goodman, Laurie ;
Fan, Wei ;
Zhang, Junqing ;
Li, Jun ;
Zhang, Juanbin ;
Guo, Yiran ;
Feng, Binxiao ;
Li, Heng ;
Lu, Yao ;
Fang, Xiaodong ;
Liang, Huiqing ;
Du, Zhenglin ;
Li, Dong ;
Zhao, Yiqing ;
Hu, Yujie ;
Yang, Zhenzhen ;
Zheng, Hancheng ;
Hellmann, Ines ;
Inouye, Michael ;
Pool, John ;
Yi, Xin ;
Zhao, Jing ;
Duan, Jinjie ;
Zhou, Yan ;
Qin, Junjie ;
Ma, Lijia ;
Li, Guoqing ;
Yang, Zhentao ;
Zhang, Guojie ;
Yang, Bin ;
Yu, Chang ;
Liang, Fang ;
Li, Wenjie ;
Li, Shaochuan ;
Li, Dawei ;
Ni, Peixiang ;
Ruan, Jue ;
Li, Qibin ;
Zhu, Hongmei ;
Liu, Dongyuan ;
Lu, Zhike ;
Li, Ning ;
Guo, Guangwu ;
Zhang, Jianguo ;
Ye, Jia .
NATURE, 2008, 456 (7218) :60-U1