Motivation: A large choice of tools exists for many standard tasks in the analysis of high-throughput sequencing (HTS) data. However, once a project deviates from standard workflows, custom scripts are needed. Results: We present HTSeq, a Python library to facilitate the rapid development of such scripts. HTSeq offers parsers for many common data formats in HTS projects, as well as classes to represent data, such as genomic coordinates, sequences, sequencing reads, alignments, gene model information and variant calls, and provides data structures that allow for querying via genomic coordinates. We also present htseq-count, a tool developed with HTSeq that preprocesses RNA-Seq data for differential expression analysis by counting the overlap of reads with genes.
引用
收藏
页码:166 / 169
页数:4
相关论文
共 14 条
[1]
Beazley DM, 1996, PROCEEDINGS OF THE FOURTH ANNUAL TCL/TK WORKSHOP, P129
机构:
Walter & Eliza Hall Inst Med Res, Bioinformat Div, Parkville, Vic 3052, Australia
Univ Melbourne, Dept Comp & Informat Syst, Melbourne, Vic 3010, AustraliaWalter & Eliza Hall Inst Med Res, Bioinformat Div, Parkville, Vic 3052, Australia
Liao, Yang
;
Smyth, Gordon K.
论文数: 0引用数: 0
h-index: 0
机构:
Walter & Eliza Hall Inst Med Res, Bioinformat Div, Parkville, Vic 3052, Australia
Univ Melbourne, Dept Math & Stat, Parkville, Vic 3010, AustraliaWalter & Eliza Hall Inst Med Res, Bioinformat Div, Parkville, Vic 3052, Australia
Smyth, Gordon K.
;
Shi, Wei
论文数: 0引用数: 0
h-index: 0
机构:
Walter & Eliza Hall Inst Med Res, Bioinformat Div, Parkville, Vic 3052, Australia
Univ Melbourne, Dept Comp & Informat Syst, Melbourne, Vic 3010, AustraliaWalter & Eliza Hall Inst Med Res, Bioinformat Div, Parkville, Vic 3052, Australia
机构:
Walter & Eliza Hall Inst Med Res, Bioinformat Div, Parkville, Vic 3052, Australia
Univ Melbourne, Dept Comp & Informat Syst, Melbourne, Vic 3010, AustraliaWalter & Eliza Hall Inst Med Res, Bioinformat Div, Parkville, Vic 3052, Australia
Liao, Yang
;
Smyth, Gordon K.
论文数: 0引用数: 0
h-index: 0
机构:
Walter & Eliza Hall Inst Med Res, Bioinformat Div, Parkville, Vic 3052, Australia
Univ Melbourne, Dept Math & Stat, Parkville, Vic 3010, AustraliaWalter & Eliza Hall Inst Med Res, Bioinformat Div, Parkville, Vic 3052, Australia
Smyth, Gordon K.
;
Shi, Wei
论文数: 0引用数: 0
h-index: 0
机构:
Walter & Eliza Hall Inst Med Res, Bioinformat Div, Parkville, Vic 3052, Australia
Univ Melbourne, Dept Comp & Informat Syst, Melbourne, Vic 3010, AustraliaWalter & Eliza Hall Inst Med Res, Bioinformat Div, Parkville, Vic 3052, Australia