High quality SNP calling using Illumina data at shallow coverage

被引:33
作者
Malhis, Nawar [1 ]
Jones, Steven J. M. [1 ]
机构
[1] BC Canc Agcy, Genome Sci Ctr, Vancouver, BC, Canada
关键词
SHORT READ ALIGNMENT; ULTRAFAST;
D O I
10.1093/bioinformatics/btq092
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Detection of single nucleotide polymorphisms (SNPs) has been a major application in processing second generation sequencing (SGS) data. In principle, SNPs are called on single base differences between a reference genome and a sequence generated from SGS short reads of a sample genome. However, this exercise is far from trivial; several parameters related to sequencing quality, and/or reference genome properties, play essential effect on the accuracy of called SNPs especially at shallow coverage data. In this work, we present Slider II, an alignment and SNP calling approach that demonstrates improved algorithmic approaches enabling larger number of called SNPs with lower false positive rate. In addition to the regular alignment and SNP calling, as an optional feature, Slider II is capable of utilizing information about known SNPs of a target genome, as priors, in the alignment and SNPs calling to enhance it's capability of detecting these known SNPs and novel SNPs and mutations in their vicinity.
引用
收藏
页码:1029 / 1035
页数:7
相关论文
共 10 条
[1]   Ultrafast and memory-efficient alignment of short DNA sequences to the human genome [J].
Langmead, Ben ;
Trapnell, Cole ;
Pop, Mihai ;
Salzberg, Steven L. .
GENOME BIOLOGY, 2009, 10 (03)
[2]   Mapping short DNA sequencing reads and calling variants using mapping quality scores [J].
Li, Heng ;
Ruan, Jue ;
Durbin, Richard .
GENOME RESEARCH, 2008, 18 (11) :1851-1858
[3]   Fast and accurate short read alignment with Burrows-Wheeler transform [J].
Li, Heng ;
Durbin, Richard .
BIOINFORMATICS, 2009, 25 (14) :1754-1760
[4]   SOAP2: an improved ultrafast tool for short read alignment [J].
Li, Ruiqiang ;
Yu, Chang ;
Li, Yingrui ;
Lam, Tak-Wah ;
Yiu, Siu-Ming ;
Kristiansen, Karsten ;
Wang, Jun .
BIOINFORMATICS, 2009, 25 (15) :1966-1967
[5]   ZOOM! Zillions of oligos mapped [J].
Lin, Hao ;
Zhang, Zefeng ;
Zhang, Michael Q. ;
Ma, Bin ;
Li, Ming .
BIOINFORMATICS, 2008, 24 (21) :2431-2437
[6]   Slider-maximum use of probability information for alignment of short sequence reads and SNP detection [J].
Malhis, Nawar ;
Butterfield, Yaron S. N. ;
Ester, Martin ;
Jones, Steven J. M. .
BIOINFORMATICS, 2009, 25 (01) :6-13
[7]   A general approach to single-nucleotide polymorphism discovery [J].
Marth, GT ;
Korf, I ;
Yandell, MD ;
Yeh, RT ;
Gu, ZJ ;
Zakeri, H ;
Stitziel, NO ;
Hillier, L ;
Kwok, PY ;
Gish, WR .
NATURE GENETICS, 1999, 23 (04) :452-456
[8]   High-throughput sequence alignment using Graphics Processing Units [J].
Schatz, Michael C. ;
Trapnell, Cole ;
Delcher, Arthur L. ;
Varshney, Amitabh .
BMC BIOINFORMATICS, 2007, 8
[9]   Mutational evolution in a lobular breast tumour profiled at single nucleotide resolution [J].
Shah, Sohrab P. ;
Morin, Ryan D. ;
Khattra, Jaswinder ;
Prentice, Leah ;
Pugh, Trevor ;
Burleigh, Angela ;
Delaney, Allen ;
Gelmon, Karen ;
Guliany, Ryan ;
Senz, Janine ;
Steidl, Christian ;
Holt, Robert A. ;
Jones, Steven ;
Sun, Mark ;
Leung, Gillian ;
Moore, Richard ;
Severson, Tesa ;
Taylor, Greg A. ;
Teschendorff, Andrew E. ;
Tse, Kane ;
Turashvili, Gulisa ;
Varhol, Richard ;
Warren, Rene L. ;
Watson, Peter ;
Zhao, Yongjun ;
Caldas, Carlos ;
Huntsman, David ;
Hirst, Martin ;
Marra, Marco A. ;
Aparicio, Samuel .
NATURE, 2009, 461 (7265) :809-U67
[10]   Using quality scores and longer reads improves accuracy of Solexa read mapping [J].
Smith, Andrew D. ;
Xuan, Zhenyu ;
Zhang, Michael Q. .
BMC BIOINFORMATICS, 2008, 9 (1)