BS Seeker: precise mapping for bisulfite sequencing

被引:161
作者
Chen, Pao-Yang [1 ]
Cokus, Shawn J. [1 ]
Pellegrini, Matteo [1 ]
机构
[1] Univ Calif Los Angeles, Dept Mol Cell & Dev Biol, Los Angeles, CA 90024 USA
来源
BMC BIOINFORMATICS | 2010年 / 11卷
关键词
DNA METHYLATION; ALIGNMENT; MAPS;
D O I
10.1186/1471-2105-11-203
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Bisulfite sequencing using next generation sequencers yields genome-wide measurements of DNA methylation at single nucleotide resolution. Traditional aligners are not designed for mapping bisulfite-treated reads, where the unmethylated Cs are converted to Ts. We have developed BS Seeker, an approach that converts the genome to a three-letter alphabet and uses Bowtie to align bisulfite-treated reads to a reference genome. It uses sequence tags to reduce mapping ambiguity. Post-processing of the alignments removes non-unique and low-quality mappings. Results: We tested our aligner on synthetic data, a bisulfite-converted Arabidopsis library, and human libraries generated from two different experimental protocols. We evaluated the performance of our approach and compared it to other bisulfite aligners. The results demonstrate that among the aligners tested, BS Seeker is more versatile and faster. When mapping to the human genome, BS Seeker generates alignments significantly faster than RMAP and BSMAP. Furthermore, BS Seeker is the only alignment tool that can explicitly account for tags which are generated by certain library construction protocols. Conclusions: BS Seeker provides fast and accurate mapping of bisulfite-converted reads. It can work with BS reads generated from the two different experimental protocols, and is able to efficiently map reads to large mammalian genomes. The Python program is freely available at http://pellegrini.mcdb.ucla.edu/BS_Seeker/BS_Seeker.html.
引用
收藏
页数:6
相关论文
共 14 条
[1]   Shotgun bisulphite sequencing of the Arabidopsis genome reveals DNA methylation patterning [J].
Cokus, Shawn J. ;
Feng, Suhua ;
Zhang, Xiaoyu ;
Chen, Zugen ;
Merriman, Barry ;
Haudenschild, Christian D. ;
Pradhan, Sriharsa ;
Nelson, Stanley F. ;
Pellegrini, Matteo ;
Jacobsen, Steven E. .
NATURE, 2008, 452 (7184) :215-219
[2]   Substantial biases in ultra-short read data sets from high-throughput DNA sequencing [J].
Dohm, Juliane C. ;
Lottaz, Claudio ;
Borodina, Tatiana ;
Himmelbauer, Heinz .
NUCLEIC ACIDS RESEARCH, 2008, 36 (16)
[3]   A GENOMIC SEQUENCING PROTOCOL THAT YIELDS A POSITIVE DISPLAY OF 5-METHYLCYTOSINE RESIDUES IN INDIVIDUAL DNA STRANDS [J].
FROMMER, M ;
MCDONALD, LE ;
MILLAR, DS ;
COLLIS, CM ;
WATT, F ;
GRIGG, GW ;
MOLLOY, PL ;
PAUL, CL .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (05) :1827-1831
[4]   High definition profiling of mammalian DNA methylation by array capture and single molecule bisulfite sequencing [J].
Hodges, Emily ;
Smith, Andrew D. ;
Kendall, Jude ;
Xuan, Zhenyu ;
Ravi, Kandasamy ;
Rooks, Michelle ;
Zhang, Michael Q. ;
Ye, Kenny ;
Bhattacharjee, Arindam ;
Brizuela, Leonardo ;
McCombie, W. Richard ;
Wigler, Michael ;
Hannon, Gregory J. ;
Hicks, James B. .
GENOME RESEARCH, 2009, 19 (09) :1593-1605
[5]  
Kent WJ, 2002, GENOME RES, V12, P656, DOI [10.1101/gr.229202, 10.1101/gr.229202. Article published online before March 2002]
[6]   Ultrafast and memory-efficient alignment of short DNA sequences to the human genome [J].
Langmead, Ben ;
Trapnell, Cole ;
Pop, Mihai ;
Salzberg, Steven L. .
GENOME BIOLOGY, 2009, 10 (03)
[7]   Mapping short DNA sequencing reads and calling variants using mapping quality scores [J].
Li, Heng ;
Ruan, Jue ;
Durbin, Richard .
GENOME RESEARCH, 2008, 18 (11) :1851-1858
[8]   SOAP: short oligonucleotide alignment program [J].
Li, Ruiqiang ;
Li, Yingrui ;
Kristiansen, Karsten ;
Wang, Jun .
BIOINFORMATICS, 2008, 24 (05) :713-714
[9]   Highly integrated single-base resolution maps of the epigenome in Arabidopsis [J].
Lister, Ryan ;
O'Malley, Ronan C. ;
Tonti-Filippini, Julian ;
Gregory, Brian D. ;
Berry, Charles C. ;
Millar, A. Harvey ;
Ecker, Joseph R. .
CELL, 2008, 133 (03) :523-536
[10]   Human DNA methylomes at base resolution show widespread epigenomic differences [J].
Lister, Ryan ;
Pelizzola, Mattia ;
Dowen, Robert H. ;
Hawkins, R. David ;
Hon, Gary ;
Tonti-Filippini, Julian ;
Nery, Joseph R. ;
Lee, Leonard ;
Ye, Zhen ;
Ngo, Que-Minh ;
Edsall, Lee ;
Antosiewicz-Bourget, Jessica ;
Stewart, Ron ;
Ruotti, Victor ;
Millar, A. Harvey ;
Thomson, James A. ;
Ren, Bing ;
Ecker, Joseph R. .
NATURE, 2009, 462 (7271) :315-322