SAMMate: a GUI tool for processing short read alignments in SAM/BAM format

被引:45
作者
Xu, Guorong [1 ]
Deng, Nan [1 ]
Zhao, Zhiyu [1 ]
Judeh, Thair [1 ]
Flemington, Erik [2 ,3 ]
Zhu, Dongxiao [1 ,2 ,3 ,4 ]
机构
[1] Univ New Orleans, Dept Comp Sci, 2000 Lakeshore Dr, New Orleans, LA 70148 USA
[2] Tulane Canc Ctr, New Orleans, LA 70112 USA
[3] Tulane Hlth Sci Ctr, New Orleans, LA 70112 USA
[4] Childrens Hosp, Res Inst Children, New Orleans, LA 70118 USA
来源
SOURCE CODE FOR BIOLOGY AND MEDICINE | 2011年 / 6卷 / 01期
关键词
D O I
10.1186/1751-0473-6-2
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Next Generation Sequencing (NGS) technology generates tens of millions of short reads for each DNA/RNA sample. A key step in NGS data analysis is the short read alignment of the generated sequences to a reference genome. Although storing alignment information in the Sequence Alignment/Map (SAM) or Binary SAM (BAM) format is now standard, biomedical researchers still have difficulty accessing this information. Results: We have developed a Graphical User Interface (GUI) software tool named SAMMate. SAMMate allows biomedical researchers to quickly process SAM/BAM files and is compatible with both single-end and paired-end sequencing technologies. SAMMate also automates some standard procedures in DNA-seq and RNA-seq data analysis. Using either standard or customized annotation files, SAMMate allows users to accurately calculate the short read coverage of genomic intervals. In particular, for RNA-seq data SAMMate can accurately calculate the gene expression abundance scores for customized genomic intervals using short reads originating from both exons and exon-exon junctions. Furthermore, SAMMate can quickly calculate a whole-genome signal map at basewise resolution allowing researchers to solve an array of bioinformatics problems. Finally, SAMMate can export both a wiggle file for alignment visualization in the UCSC genome browser and an alignment statistics report. The biological impact of these features is demonstrated via several case studies that predict miRNA targets using short read alignment information files. Conclusions: With just a few mouse clicks, SAMMate will provide biomedical researchers easy access to important alignment information stored in SAM/BAM files. Our software is constantly updated and will greatly facilitate the downstream analysis of NGS data. Both the source code and the GUI executable are freely available under the GNU General Public License at http://sammate. sourceforge. net.
引用
收藏
页数:11
相关论文
共 19 条
[1]   NGSView: an extensible open source editor for next-generation sequencing data [J].
Arner, Erik ;
Hayashizaki, Yoshihide ;
Daub, Carsten O. .
BIOINFORMATICS, 2010, 26 (01) :125-126
[2]   MapView: visualization of short reads alignment on a desktop computer [J].
Bao, Hua ;
Guo, Hui ;
Wang, Jinwei ;
Zhou, Renchao ;
Lu, Xuemei ;
Shi, Suhua .
BIOINFORMATICS, 2009, 25 (12) :1554-1555
[3]   A Statistical Change Point Model Approach for the Detection of DNA Copy Number Variations in Array CGH Data [J].
Chen, Jie ;
Wang, Yu-Ping .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2009, 6 (04) :529-541
[4]  
Jiang H, 2008, BIOINFORMATICS, V24
[5]   Statistical inferences for isoform expression in RNA-Seq [J].
Jiang, Hui ;
Wong, Wing Hung .
BIOINFORMATICS, 2009, 25 (08) :1026-1032
[6]   Principles and challenges of genome-wide DNA methylation analysis [J].
Laird, Peter W. .
NATURE REVIEWS GENETICS, 2010, 11 (03) :191-203
[7]   Ultrafast and memory-efficient alignment of short DNA sequences to the human genome [J].
Langmead, Ben ;
Trapnell, Cole ;
Pop, Mihai ;
Salzberg, Steven L. .
GENOME BIOLOGY, 2009, 10 (03)
[8]  
Li H., 2008, GENOME RES
[9]   Fast and accurate short read alignment with Burrows-Wheeler transform [J].
Li, Heng ;
Durbin, Richard .
BIOINFORMATICS, 2009, 25 (14) :1754-1760
[10]   Quantitative and Qualitative RNA-Seq-Based Evaluation of Epstein-Barr Virus Transcription in Type I Latency Burkitt's Lymphoma Cells [J].
Lin, Zhen ;
Xu, Guorong ;
Deng, Nan ;
Taylor, Christopher ;
Zhu, Dongxiao ;
Flemington, Erik K. .
JOURNAL OF VIROLOGY, 2010, 84 (24) :13053-13058