featureCounts: an efficient general purpose program for assigning sequence reads to genomic features

被引:17747
作者
Liao, Yang [1 ,2 ]
Smyth, Gordon K. [1 ,3 ]
Shi, Wei [1 ,2 ]
机构
[1] Walter & Eliza Hall Inst Med Res, Bioinformat Div, Parkville, Vic 3052, Australia
[2] Univ Melbourne, Dept Comp & Informat Syst, Melbourne, Vic 3010, Australia
[3] Univ Melbourne, Dept Math & Stat, Parkville, Vic 3010, Australia
基金
英国医学研究理事会;
关键词
DIFFERENTIAL EXPRESSION ANALYSIS; ACCURATE; ALIGNMENT; QUANTIFICATION; BIOCONDUCTOR; BINDING;
D O I
10.1093/bioinformatics/btt656
中图分类号
Q5 [生物化学];
学科分类号
070307 [化学生物学];
摘要
Motivation: Next-generation sequencing technologies generate millions of short sequence reads, which are usually aligned to a reference genome. In many applications, the key information required for downstream analysis is the number of reads mapping to each genomic feature, for example to each exon or each gene. The process of counting reads is called read summarization. Read summarization is required for a great variety of genomic analyses but has so far received relatively little attention in the literature. Results: We present featureCounts, a read summarization program suitable for counting reads generated from either RNA or genomic DNA sequencing experiments. featureCounts implements highly efficient chromosome hashing and feature blocking techniques. It is considerably faster than existing methods (by an order of magnitude for gene-level summarization) and requires far less computer memory. It works with either single or paired-end reads and provides a wide range of options appropriate for different sequencing applications.
引用
收藏
页码:923 / 930
页数:8
相关论文
共 48 条
[1]
Aboyoun P. H., 2013, GENOMICRANGES REPRES
[2]
Anders S, 2013, HTSEQ ANAL HIGH THRO
[3]
Differential expression analysis for sequence count data [J].
Anders, Simon ;
Huber, Wolfgang .
GENOME BIOLOGY, 2010, 11 (10)
[4]
Count-based differential expression analysis of RNA sequencing data using R and Bioconductor [J].
Anders, Simon ;
McCarthy, Davis J. ;
Chen, Yunshun ;
Okoniewski, Michal ;
Smyth, Gordon K. ;
Huber, Wolfgang ;
Robinson, Mark D. .
NATURE PROTOCOLS, 2013, 8 (09) :1765-1786
[5]
Detecting differential usage of exons from RNA-seq data [J].
Anders, Simon ;
Reyes, Alejandro ;
Huber, Wolfgang .
GENOME RESEARCH, 2012, 22 (10) :2008-2017
[6]
[Anonymous], 2013, Voom! Precision weights unlock linear model analysis tools for RNAseq read counts
[7]
A Two-Stage Poisson Model for Testing RNA-Seq Data [J].
Auer, Paul L. ;
Doerge, Rebecca W. .
STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2011, 10 (01)
[8]
Genome-wide hydroxymethylation tested using the HELP-GT assay shows redistribution in cancer [J].
Bhattacharyya, Sanchari ;
Yu, Yiting ;
Suzuki, Masako ;
Campbell, Nathaniel ;
Mazdo, Jozef ;
Vasanthakumar, Aparna ;
Bhagat, Tushar D. ;
Nischal, Sangeeta ;
Christopeit, Maximilian ;
Parekh, Samir ;
Steidl, Ulrich ;
Godley, Lucy ;
Maitra, Anirban ;
Greally, John M. ;
Verma, Amit .
NUCLEIC ACIDS RESEARCH, 2013, 41 (16) :e157
[9]
Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species [J].
Bradnam, Keith R. ;
Fass, Joseph N. ;
Alexandrov, Anton ;
Baranay, Paul ;
Bechner, Michael ;
Birol, Inanc ;
Boisvert, Sebastien ;
Chapman, Jarrod A. ;
Chapuis, Guillaume ;
Chikhi, Rayan ;
Chitsaz, Hamidreza ;
Chou, Wen-Chi ;
Corbeil, Jacques ;
Del Fabbro, Cristian ;
Docking, T. Roderick ;
Durbin, Richard ;
Earl, Dent ;
Emrich, Scott ;
Fedotov, Pavel ;
Fonseca, Nuno A. ;
Ganapathy, Ganeshkumar ;
Gibbs, Richard A. ;
Gnerre, Sante ;
Godzaridis, Elenie ;
Goldstein, Steve ;
Haimel, Matthias ;
Hall, Giles ;
Haussler, David ;
Hiatt, Joseph B. ;
Ho, Isaac Y. ;
Howard, Jason ;
Hunt, Martin ;
Jackman, Shaun D. ;
Jaffe, David B. ;
Jarvis, Erich D. ;
Jiang, Huaiyang ;
Kazakov, Sergey ;
Kersey, Paul J. ;
Kitzman, Jacob O. ;
Knight, James R. ;
Koren, Sergey ;
Lam, Tak-Wah ;
Lavenier, Dominique ;
Laviolette, Francois ;
Li, Yingrui ;
Li, Zhenyu ;
Liu, Binghang ;
Liu, Yue ;
Luo, Ruibang ;
MacCallum, Iain .
GIGASCIENCE, 2013, 2
[10]
Ensembl 2012 [J].
Flicek, Paul ;
Amode, M. Ridwan ;
Barrell, Daniel ;
Beal, Kathryn ;
Brent, Simon ;
Carvalho-Silva, Denise ;
Clapham, Peter ;
Coates, Guy ;
Fairley, Susan ;
Fitzgerald, Stephen ;
Gil, Laurent ;
Gordon, Leo ;
Hendrix, Maurice ;
Hourlier, Thibaut ;
Johnson, Nathan ;
Kaehaeri, Andreas K. ;
Keefe, Damian ;
Keenan, Stephen ;
Kinsella, Rhoda ;
Komorowska, Monika ;
Koscielny, Gautier ;
Kulesha, Eugene ;
Larsson, Pontus ;
Longden, Ian ;
McLaren, William ;
Muffato, Matthieu ;
Overduin, Bert ;
Pignatelli, Miguel ;
Pritchard, Bethan ;
Riat, Harpreet Singh ;
Ritchie, Graham R. S. ;
Ruffier, Magali ;
Schuster, Michael ;
Sobral, Daniel ;
Tang, Y. Amy ;
Taylor, Kieron ;
Trevanion, Stephen ;
Vandrovcova, Jana ;
White, Simon ;
Wilson, Mark ;
Wilder, Steven P. ;
Aken, Bronwen L. ;
Birney, Ewan ;
Cunningham, Fiona ;
Dunham, Ian ;
Durbin, Richard ;
Fernandez-Suarez, Xose M. ;
Harrow, Jennifer ;
Herrero, Javier ;
Hubbard, Tim J. P. .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D84-D90