Flexible expressed region analysis for RNA-seq with derfinder

被引:36
作者
Collado-Torres, Leonardo [1 ,2 ,3 ]
Nellore, Abhinav [1 ,2 ,4 ]
Frazee, Alyssa C. [1 ,2 ]
Wilks, Christopher [2 ,4 ]
Love, Michael I. [5 ,6 ]
Langmead, Ben [1 ,2 ,4 ]
Irizarry, Rafael A. [5 ,6 ]
Leek, Jeffrey T. [1 ,2 ]
Jaffe, Andrew E. [1 ,2 ,3 ,7 ]
机构
[1] Johns Hopkins Bloomberg Sch Publ Hlth, Dept Biostat, Baltimore, MD 21205 USA
[2] Johns Hopkins Univ, Ctr Computat Biol, Baltimore, MD 21205 USA
[3] Lieber Inst Brain Dev, Johns Hopkins Med Campus, Baltimore, MD 21205 USA
[4] Johns Hopkins Univ, Dept Comp Sci, Baltimore, MD 21218 USA
[5] Harvard TH Chan Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
[6] Harvard Univ, Dana Farber Canc Inst, Boston, MA 02215 USA
[7] Johns Hopkins Univ, Dept Mental Hlth, Baltimore, MD 21205 USA
关键词
DIFFERENTIAL EXPRESSION; GENE; TRANSCRIPTOME; RECONSTRUCTION; CONTRIBUTES;
D O I
10.1093/nar/gkw852
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
070307 [化学生物学]; 071010 [生物化学与分子生物学];
摘要
Differential expression analysis of RNA sequencing (RNA-seq) data typically relies on reconstructing transcripts or counting reads that overlap known gene structures. We previously introduced an intermediate statistical approach called differentially expressed region (DER) finder that seeks to identify contiguous regions of the genome showing differential expression signal at single base resolution without relying on existing annotation or potentially inaccurate transcript assembly. We present the derfinder software that improves our annotation-agnostic approach to RNA-seq analysis by: (i) implementing a computationally efficient bump-hunting approach to identify DERs that permits genome-scale analyses in a large number of samples, (ii) introducing a flexible statistical modeling framework, including multi-group and timecourse analyses and (iii) introducing a new set of data visualizations for expressed region analysis. We apply this approach to public RNA-seq data from the Genotype-Tissue Expression (GTEx) project and BrainSpan project to show that derfinder permits the analysis of hundreds of samples at base resolution in R, identifies expression outside of known gene boundaries and can be used to visualize expressed regions at base-resolution. In simulations, our base resolution approaches enable discovery in the presence of incomplete annotation and is nearly as powerful as feature-level methods when the annotation is complete. derfinder analysis using expressed region-level and single base-level approaches provides a compromise between full transcript reconstruction and feature-level analysis.
引用
收藏
页数:13
相关论文
共 47 条
[1]
Differential expression analysis for sequence count data [J].
Anders, Simon ;
Huber, Wolfgang .
GENOME BIOLOGY, 2010, 11 (10)
[2]
HTSeq-a Python']Python framework to work with high-throughput sequencing data [J].
Anders, Simon ;
Pyl, Paul Theodor ;
Huber, Wolfgang .
BIOINFORMATICS, 2015, 31 (02) :166-169
[3]
The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans [J].
Ardlie, Kristin G. ;
DeLuca, David S. ;
Segre, Ayellet V. ;
Sullivan, Timothy J. ;
Young, Taylor R. ;
Gelfand, Ellen T. ;
Trowbridge, Casandra A. ;
Maller, Julian B. ;
Tukiainen, Taru ;
Lek, Monkol ;
Ward, Lucas D. ;
Kheradpour, Pouya ;
Iriarte, Benjamin ;
Meng, Yan ;
Palmer, Cameron D. ;
Esko, Tonu ;
Winckler, Wendy ;
Hirschhorn, Joel N. ;
Kellis, Manolis ;
MacArthur, Daniel G. ;
Getz, Gad ;
Shabalin, Andrey A. ;
Li, Gen ;
Zhou, Yi-Hui ;
Nobel, Andrew B. ;
Rusyn, Ivan ;
Wright, Fred A. ;
Lappalainen, Tuuli ;
Ferreira, Pedro G. ;
Ongen, Halit ;
Rivas, Manuel A. ;
Battle, Alexis ;
Mostafavi, Sara ;
Monlong, Jean ;
Sammeth, Michael ;
Mele, Marta ;
Reverter, Ferran ;
Goldmann, Jakob M. ;
Koller, Daphne ;
Guigo, Roderic ;
McCarthy, Mark I. ;
Dermitzakis, Emmanouil T. ;
Gamazon, Eric R. ;
Im, Hae Kyung ;
Konkashbaev, Anuar ;
Nicolae, Dan L. ;
Cox, Nancy J. ;
Flutre, Timothee ;
Wen, Xiaoquan ;
Stephens, Matthew .
SCIENCE, 2015, 348 (6235) :648-660
[4]
BrainSpan Atlas of the Developing Human Brain, 2011, 1RC2MH08992101 ARRA
[5]
Local control of striatal dopamine release [J].
Cachope, Roger ;
Cheer, Joseph F. .
FRONTIERS IN BEHAVIORAL NEUROSCIENCE, 2014, 8
[6]
Collado-Torres L., 2016, F1000Research, V4, P1, DOI [10.12688/f1000research.6379.2, DOI 10.12688/F1000RESEARCH.6379.2]
[7]
Ensembl 2015 [J].
Cunningham, Fiona ;
Amode, M. Ridwan ;
Barrell, Daniel ;
Beal, Kathryn ;
Billis, Konstantinos ;
Brent, Simon ;
Carvalho-Silva, Denise ;
Clapham, Peter ;
Coates, Guy ;
Fitzgerald, Stephen ;
Gil, Laurent ;
Giron, Carlos Garcia ;
Gordon, Leo ;
Hourlier, Thibaut ;
Hunt, Sarah E. ;
Janacek, Sophie H. ;
Johnson, Nathan ;
Juettemann, Thomas ;
Kaehaeri, Andreas K. ;
Keenan, Stephen ;
Martin, Fergal J. ;
Maurel, Thomas ;
McLaren, William ;
Murphy, Daniel N. ;
Nag, Rishi ;
Overduin, Bert ;
Parker, Anne ;
Patricio, Mateus ;
Perry, Emily ;
Pignatelli, Miguel ;
Riat, Harpreet Singh ;
Sheppard, Daniel ;
Taylor, Kieron ;
Thormann, Anja ;
Vullo, Alessandro ;
Wilder, Steven P. ;
Zadissa, Amonida ;
Aken, Bronwen L. ;
Birney, Ewan ;
Harrow, Jennifer ;
Kinsella, Rhoda ;
Muffato, Matthieu ;
Ruffier, Magali ;
Searle, Stephen M. J. ;
Spudich, Giulietta ;
Trevanion, Stephen J. ;
Yates, Andy ;
Zerbino, Daniel R. ;
Flicek, Paul .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D662-D669
[8]
The Drosophila melanogaster transcriptome by paired-end RNA sequencing [J].
Daines, Bryce ;
Wang, Hui ;
Wang, Liguo ;
Li, Yumei ;
Han, Yi ;
Emmert, David ;
Gelbart, William ;
Wang, Xia ;
Li, Wei ;
Gibbs, Richard ;
Chen, Rui .
GENOME RESEARCH, 2011, 21 (02) :315-324
[9]
mRNA expression, splicing and editing in the embryonic and adult mouse cerebral cortex [J].
Dillman, Allissa A. ;
Hauser, David N. ;
Gibbs, J. Raphael ;
Nalls, Michael A. ;
McCoy, Melissa K. ;
Rudenko, Iakov N. ;
Galter, Dagmar ;
Cookson, Mark R. .
NATURE NEUROSCIENCE, 2013, 16 (04) :499-U178
[10]
Aneuploidy as a mechanism for stress-induced liver adaptation [J].
Duncan, Andrew W. ;
Newell, Amy E. Hanlon ;
Bi, Weimin ;
Finegold, Milton J. ;
Olson, Susan B. ;
Beaudet, Arthur L. ;
Grompe, Markus .
JOURNAL OF CLINICAL INVESTIGATION, 2012, 122 (09) :3307-3315