Statistical inferences for isoform expression in RNA-Seq

被引:319
作者
Jiang, Hui [2 ]
Wong, Wing Hung [1 ]
机构
[1] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
[2] Stanford Univ, Inst Computat & Math Engn, Stanford, CA 94305 USA
关键词
HUMAN TRANSCRIPTOME; GENOME; RESOLUTION; ARRAYS;
D O I
10.1093/bioinformatics/btp113
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The development of RNA sequencing (RNA-Seq) makes it possible for us to measure transcription at an unprecedented precision and throughput. However, challenges remain in understanding the source and distribution of the reads, modeling the transcript abundance and developing efficient computational methods. In this article, we develop a method to deal with the isoform expression estimation problem. The count of reads falling into a locus on the genome annotated with multiple isoforms is modeled as a Poisson variable. The expression of each individual isoform is estimated by solving a convex optimization problem and statistical inferences about the parameters are obtained from the posterior distribution by importance sampling. Our results show that isoform expression inference in RNA-Seq is possible by employing appropriate statistical methods.
引用
收藏
页码:1026 / 1032
页数:7
相关论文
共 16 条
[1]  
[Anonymous], 2002, Monte Carlo strategies in scientific computing
[2]   Stem cell transcriptome profiling via massive-scale mRNA sequencing [J].
Cloonan, Nicole ;
Forrest, Alistair R. R. ;
Kolle, Gabriel ;
Gardiner, Brooke B. A. ;
Faulkner, Geoffrey J. ;
Brown, Mellissa K. ;
Taylor, Darrin F. ;
Steptoe, Anita L. ;
Wani, Shivangi ;
Bethel, Graeme ;
Robertson, Alan J. ;
Perkins, Andrew C. ;
Bruce, Stephen J. ;
Lee, Clarence C. ;
Ranade, Swati S. ;
Peckham, Heather E. ;
Manning, Jonathan M. ;
McKernan, Kevin J. ;
Grimmond, Sean M. .
NATURE METHODS, 2008, 5 (07) :613-619
[3]   An integrated software system for analyzing ChIP-chip and ChIP-seq data [J].
Ji, Hongkai ;
Jiang, Hui ;
Ma, Wenxiu ;
Johnson, David S. ;
Myers, Richard M. ;
Wong, Wing H. .
NATURE BIOTECHNOLOGY, 2008, 26 (11) :1293-1300
[4]   SeqMap: mapping massive amount of oligonucleotides to the genome [J].
Jiang, Hui ;
Wong, Wing Hung .
BIOINFORMATICS, 2008, 24 (20) :2395-2396
[5]   Cross-hybridization modeling on Affymetrix exon arrays [J].
Kapur, Karen ;
Jiang, Hui ;
Xing, Yi ;
Wong, Wing Hung .
BIOINFORMATICS, 2008, 24 (24) :2887-2893
[6]   The UCSC Genome Browser Database: 2008 update [J].
Karolchik, D. ;
Kuhn, R. M. ;
Baertsch, R. ;
Barber, G. P. ;
Clawson, H. ;
Diekhans, M. ;
Giardine, B. ;
Harte, R. A. ;
Hinrichs, A. S. ;
Hsu, F. ;
Kober, K. M. ;
Miller, W. ;
Pedersen, J. S. ;
Pohl, A. ;
Raney, B. J. ;
Rhead, B. ;
Rosenbloom, K. R. ;
Smith, K. E. ;
Stanke, M. ;
Thakkapallayil, A. ;
Trumbower, H. ;
Wang, T. ;
Zweig, A. S. ;
Haussler, D. ;
Kent, W. J. .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D773-D779
[7]   Highly integrated single-base resolution maps of the epigenome in Arabidopsis [J].
Lister, Ryan ;
O'Malley, Ronan C. ;
Tonti-Filippini, Julian ;
Gregory, Brian D. ;
Berry, Charles C. ;
Millar, A. Harvey ;
Ecker, Joseph R. .
CELL, 2008, 133 (03) :523-536
[8]   RNA-seq: An assessment of technical reproducibility and comparison with gene expression arrays [J].
Marioni, John C. ;
Mason, Christopher E. ;
Mane, Shrikant M. ;
Stephens, Matthew ;
Gilad, Yoav .
GENOME RESEARCH, 2008, 18 (09) :1509-1517
[9]   Mapping and quantifying mammalian transcriptomes by RNA-Seq [J].
Mortazavi, Ali ;
Williams, Brian A. ;
McCue, Kenneth ;
Schaeffer, Lorian ;
Wold, Barbara .
NATURE METHODS, 2008, 5 (07) :621-628
[10]   The transcriptional landscape of the yeast genome defined by RNA sequencing [J].
Nagalakshmi, Ugrappa ;
Wang, Zhong ;
Waern, Karl ;
Shou, Chong ;
Raha, Debasish ;
Gerstein, Mark ;
Snyder, Michael .
SCIENCE, 2008, 320 (5881) :1344-1349