Manipulation of FASTQ data with Galaxy

被引:491
作者
Blankenberg, Daniel [3 ]
Gordon, Assaf [4 ]
Von Kuster, Gregory [3 ]
Coraor, Nathan [3 ]
Taylor, James [1 ,2 ]
Nekrutenko, Anton [3 ]
机构
[1] Emory Univ, Dept Biol, Atlanta, GA 30322 USA
[2] Emory Univ, Dept Math & Comp Sci, Atlanta, GA 30322 USA
[3] Penn State Univ, Huck Inst Life Sci, University Pk, PA 16803 USA
[4] Cold Spring Harbor Lab, Howard Hughes Med Inst, Watson Sch Biol Sci, Cold Spring Harbor, NY 11724 USA
基金
美国国家科学基金会;
关键词
D O I
10.1093/bioinformatics/btq281
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Here, we describe a tool suite that functions on all of the commonly known FASTQ format variants and provides a pipeline for manipulating next generation sequencing data taken from a sequencing machine all the way through the quality filtering steps.
引用
收藏
页码:1783 / 1785
页数:3
相关论文
共 4 条
[1]   A framework for collaborative analysis of ENCODE data: Making large-scale analyses biologist-friendly [J].
Blankenberg, Daniel ;
Taylor, James ;
Schenck, Ian ;
He, Jianbin ;
Zhang, Yi ;
Ghent, Matthew ;
Veeraraghavan, Narayanan ;
Albert, Istvan ;
Miller, Webb ;
Makova, Kateryna D. ;
Hardison, Ross C. ;
Nekrutenko, Anton .
GENOME RESEARCH, 2007, 17 (06) :960-964
[2]  
Blankenberg Daniel, 2010, Curr Protoc Mol Biol, VChapter 19, DOI 10.1002/0471142727.mb1910s89
[3]   The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants [J].
Cock, Peter J. A. ;
Fields, Christopher J. ;
Goto, Naohisa ;
Heuer, Michael L. ;
Rice, Peter M. .
NUCLEIC ACIDS RESEARCH, 2010, 38 (06) :1767-1771
[4]  
Taylor James, 2007, Curr Protoc Bioinformatics, VChapter 10, DOI 10.1002/0471250953.bi1005s19