Identifying differentially expressed transcripts from RNA-seq data with biological variation

被引:144
作者
Glaus, Peter [1 ]
Honkela, Antti [2 ]
Rattray, Magnus [3 ,4 ]
机构
[1] Univ Manchester, Sch Comp Sci, Manchester M13 9PL, Lancs, England
[2] Univ Helsinki, Dept Comp Sci, Helsinki Inst Informat Technol HIIT, FI-00014 Helsinki, Finland
[3] Univ Sheffield, Dept Comp Sci, Sheffield S10 2HQ, S Yorkshire, England
[4] Univ Sheffield, Sheffield Inst Translat Neurosci, Sheffield S10 2HQ, S Yorkshire, England
基金
英国生物技术与生命科学研究理事会; 芬兰科学院;
关键词
REPRODUCIBILITY; INFERENCE;
D O I
10.1093/bioinformatics/bts260
中图分类号
Q5 [生物化学];
学科分类号
070307 [化学生物学];
摘要
Motivation: High-throughput sequencing enables expression analysis at the level of individual transcripts. The analysis of transcriptome expression levels and differential expression (DE) estimation requires a probabilistic approach to properly account for ambiguity caused by shared exons and finite read sampling as well as the intrinsic biological variance of transcript expression. Results: We present Bayesian inference of transcripts from sequencing data (BitSeq), a Bayesian approach for estimation of transcript expression level from RNA-seq experiments. Inferred relative expression is represented by Markov chain Monte Carlo samples from the posterior probability distribution of a generative model of the read data. We propose a novel method for DE analysis across replicates which propagates uncertainty from the sample-level model while modelling biological variance using an expression-level-dependent prior. We demonstrate the advantages of our method using simulated data as well as an RNA-seq dataset with technical and biological replication for both studied conditions.
引用
收藏
页码:1721 / 1728
页数:8
相关论文
共 31 条
[1]
Differential expression analysis for sequence count data [J].
Anders, Simon ;
Huber, Wolfgang .
GENOME BIOLOGY, 2010, 11 (10)
[2]
[Anonymous], 2003, Bayesian Data Analysis
[3]
LOWESS - A PROGRAM FOR SMOOTHING SCATTERPLOTS BY ROBUST LOCALLY WEIGHTED REGRESSION [J].
CLEVELAND, WS .
AMERICAN STATISTICIAN, 1981, 35 (01) :54-54
[4]
Stem cell transcriptome profiling via massive-scale mRNA sequencing [J].
Cloonan, Nicole ;
Forrest, Alistair R. R. ;
Kolle, Gabriel ;
Gardiner, Brooke B. A. ;
Faulkner, Geoffrey J. ;
Brown, Mellissa K. ;
Taylor, Darrin F. ;
Steptoe, Anita L. ;
Wani, Shivangi ;
Bethel, Graeme ;
Robertson, Alan J. ;
Perkins, Andrew C. ;
Bruce, Stephen J. ;
Lee, Clarence C. ;
Ranade, Swati S. ;
Peckham, Heather E. ;
Manning, Jonathan M. ;
McKernan, Kevin J. ;
Grimmond, Sean M. .
NATURE METHODS, 2008, 5 (07) :613-619
[5]
Substantial biases in ultra-short read data sets from high-throughput DNA sequencing [J].
Dohm, Juliane C. ;
Lottaz, Claudio ;
Borodina, Tatiana ;
Himmelbauer, Heinz .
NUCLEIC ACIDS RESEARCH, 2008, 36 (16)
[6]
Geman D., 1993, Journal of Applied Statistics, V20, P25, DOI [10.1080/02664769300000058, DOI 10.1080/02664769300000058]
[7]
The developmental transcriptome of Drosophila melanogaster [J].
Graveley, Brenton R. ;
Brooks, Angela N. ;
Carlson, JosephW. ;
Duff, Michael O. ;
Landolin, Jane M. ;
Yang, Li ;
Artieri, Carlo G. ;
van Baren, Marijke J. ;
Boley, Nathan ;
Booth, Benjamin W. ;
Brown, James B. ;
Cherbas, Lucy ;
Davis, Carrie A. ;
Dobin, Alex ;
Li, Renhua ;
Lin, Wei ;
Malone, John H. ;
Mattiuzzo, Nicolas R. ;
Miller, David ;
Sturgill, David ;
Tuch, Brian B. ;
Zaleski, Chris ;
Zhang, Dayu ;
Blanchette, Marco ;
Dudoit, Sandrine ;
Eads, Brian ;
Green, Richard E. ;
Hammonds, Ann ;
Jiang, Lichun ;
Kapranov, Phil ;
Langton, Laura ;
Perrimon, Norbert ;
Sandler, Jeremy E. ;
Wan, Kenneth H. ;
Willingham, Aarron ;
Zhang, Yu ;
Zou, Yi ;
Andrews, Justen ;
Bickel, Peter J. ;
Brenner, Steven E. ;
Brent, Michael R. ;
Cherbas, Peter ;
Gingeras, Thomas R. ;
Hoskins, Roger A. ;
Kaufman, Thomas C. ;
Oliver, Brian ;
Celniker, Susan E. .
NATURE, 2011, 471 (7339) :473-479
[8]
Finding scientific topics [J].
Griffiths, TL ;
Steyvers, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 :5228-5235
[9]
baySeq: Empirical Bayesian methods for identifying differential expression in sequence count data [J].
Hardcastle, Thomas J. ;
Kelly, Krystyna A. .
BMC BIOINFORMATICS, 2010, 11
[10]
Katz Y, 2010, NAT METHODS, V7, P1009, DOI [10.1038/nmeth.1528, 10.1038/NMETH.1528]