Computational and analytical framework for small RNA profiling by high-throughput sequencing

被引:95
作者
Fahlgren, Noah [1 ,2 ]
Sullivan, Christopher M. [1 ,2 ]
Kasschau, Kristin D. [1 ,2 ]
Chapman, Elisabeth J. [1 ,2 ]
Cumbie, Jason S. [1 ,2 ]
Montgomery, Taiowa A. [1 ,2 ]
Gilbert, Sunny D. [1 ,2 ]
Dasenko, Mark [1 ]
Backman, Tyler W. H. [1 ,2 ]
Givan, Scott A. [1 ,2 ]
Carrington, James C. [1 ,2 ]
机构
[1] Oregon State Univ, Ctr Genome Res & Biocomp, Corvallis, OR 97331 USA
[2] Oregon State Univ, Dept Bot & Plant Pathol, Corvallis, OR 97331 USA
关键词
small RNA; sequencing-by-synthesis; CASHX; SAM-seq; oligoribonucleotide standards; statistical methods; ENDOGENOUS SIRNAS; MICRORNAS; PATHWAY; 21U-RNAS; IDENTIFICATION; BIOGENESIS; METABOLISM; EXPRESSION; EVOLUTION; PROTEINS;
D O I
10.1261/rna.1473809
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The advent of high-throughput sequencing (HTS) methods has enabled direct approaches to quantitatively profile small RNA populations. However, these methods have been limited by several factors, including representational artifacts and lack of established statistical methods of analysis. Furthermore, massive HTS data sets present new problems related to data processing and mapping to a reference genome. Here, we show that cluster-based sequencing-by-synthesis technology is highly reproducible as a quantitative profiling tool for several classes of small RNA from Arabidopsis thaliana. We introduce the use of synthetic RNA oligoribonucleotide standards to facilitate objective normalization between HTS data sets, and adapt microarray-type methods for statistical analysis of multiple samples. These methods were tested successfully using mutants with small RNA biogenesis (miRNA-defective dcl1 mutant and siRNA-defective dcl2 dcl3 dcl4 triple mutant) or effector protein (ago1 mutant) deficiencies. Computational methods were also developed to rapidly and accurately parse, quantify, and map small RNA data.
引用
收藏
页码:992 / 1002
页数:11
相关论文
共 61 条
[1]   Evolution of microRNA genes by inverted duplication of target gene sequences in Arabidopsis thaliana [J].
Allen, E ;
Xie, ZX ;
Gustafson, AM ;
Sung, GH ;
Spatafora, JW ;
Carrington, JC .
NATURE GENETICS, 2004, 36 (12) :1282-1290
[2]  
Allison D.B., 2006, DNA MICROARRAYS RELA
[3]   Microarray data analysis: from disarray to consolidation and consensus [J].
Allison, DB ;
Cui, XQ ;
Page, GP ;
Sabripour, M .
NATURE REVIEWS GENETICS, 2006, 7 (01) :55-65
[4]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[5]   A uniform system for microRNA annotation [J].
Ambros, V ;
Bartel, B ;
Bartel, DP ;
Burge, CB ;
Carrington, JC ;
Chen, XM ;
Dreyfuss, G ;
Eddy, SR ;
Griffiths-Jones, S ;
Marshall, M ;
Matzke, M ;
Ruvkun, G ;
Tuschl, T .
RNA, 2003, 9 (03) :277-279
[6]  
[Anonymous], 2006, Introduction to Linear Regression Analysis
[7]   A two-hit trigger for siRNA biogenesis in plants [J].
Axtell, Michael J. ;
Jan, Calvin ;
Rajagopalan, Ramya ;
Bartel, David P. .
CELL, 2006, 127 (03) :565-577
[8]   PRG-1 and 21U-RNAs interact to form the piRNA complex required for fertility in C-elegans [J].
Batista, Pedro J. ;
Ruby, J. Graham ;
Claycomb, Julie M. ;
Chiang, Rosaria ;
Fahlgren, Noah ;
Kasschau, Kristin D. ;
Chaves, Daniel A. ;
Gu, Weifeng ;
Vasale, Jessica J. ;
Duan, Shenghua ;
Conte, Darryl, Jr. ;
Luo, Shujun ;
Schroth, Gary P. ;
Carrington, James C. ;
Bartel, David P. ;
Mello, Craig C. .
MOLECULAR CELL, 2008, 31 (01) :67-78
[9]   Whole-genome re-sequencing [J].
Bentley, David R. .
CURRENT OPINION IN GENETICS & DEVELOPMENT, 2006, 16 (06) :545-552
[10]  
Brennecke J, 2007, CELL, V128, P1089, DOI 10.1016/j.cell.2007.01.043