CAP-miRSeq: a comprehensive analysis pipeline for microRNA sequencing data

被引:129
作者
Sun, Zhifu [1 ]
Evans, Jared [1 ]
Bhagwate, Aditya [1 ]
Middha, Sumit [1 ]
Bockol, Matthew [1 ]
Yan, Huihuang [1 ]
Kocher, Jean-Pierre [1 ]
机构
[1] Mayo Clin, Div Biomed Stat & Informat, Dept Hlth Sci Res, Rochester, MN 55905 USA
来源
BMC GENOMICS | 2014年 / 15卷
关键词
miRNA sequencing; Analysis pipeline; Differential expression; Variant detection; DIFFERENTIAL EXPRESSION ANALYSIS; IDENTIFICATION; BIOGENESIS; SOFTWARE; GENES;
D O I
10.1186/1471-2164-15-423
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: miRNAs play a key role in normal physiology and various diseases miRNA profiling through next generation sequencing (miRNA-seq) has become the main platform for biological research and biomarker discovery. However, analyzing miRNA sequencing data is challenging as it needs significant amount of computational resources and bioinformatics expertise. Several web based analytical tools have been developed but they are limited to processing one or a pair of samples at time and are not suitable for a large scale study. Lack of flexibility and reliability of these web applications are also common issues. Results: We des/eloped a Comprehensive. Analysis Pipeline for microRNA Sequencing data (CAP-miRSeq) that integrates read pre-processing, alignment, mature/precursor/novel miRNA detection and quantification, data visualization, variant detection in miRNA coding region, and more flexible differential expression analysis between experimental conditions. According to computational infrastructure, users can install the package locally or deploy it in Amazon Cloud to run samples sequentially or in parallel for a large number of samples for speedy analyses. In either case, summary and expression reports for all samples are generated for easier quality assessment and downstream analyses. Using well characterized data, we demonstrated the pipeline's superior performances, flexibility, and practical use in research and biomarker discovery. Conclusions: CAP-miRSeq is a powerful and flexible tool for users to process and analyze miRNA-seq data scalable from a few to hundreds of samples. The results are presented in the convenient way for investigators or analysts to conduct further investigation and discovery.
引用
收藏
页数:10
相关论文
共 27 条
[1]   miRDeep*: an integrated application tool for miRNA identification from RNA sequencing data [J].
An, Jiyuan ;
Lai, John ;
Lehman, Melanie L. ;
Nelson, Colleen C. .
NUCLEIC ACIDS RESEARCH, 2013, 41 (02) :727-737
[2]   Differential expression analysis for sequence count data [J].
Anders, Simon ;
Huber, Wolfgang .
GENOME BIOLOGY, 2010, 11 (10)
[3]   SomamiR: a database for somatic mutations impacting microRNA function in cancer [J].
Bhattacharya, Anindya ;
Ziebarth, Jesse D. ;
Cui, Yan .
NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) :D977-D982
[4]   RNA editing of human microRNAs [J].
Blow, Matthew J. ;
Grocock, Russell J. ;
van Dongen, Stijn ;
Enright, Anton J. ;
Dicks, Ed ;
Futreal, P. Andrew ;
Wooster, Richard ;
Stratton, Michael R. .
GENOME BIOLOGY, 2006, 7 (04)
[5]   MicroRNAs: Potential biomarkers for cancer diagnosis, prognosis and targets for therapy [J].
Cho, William C. S. .
INTERNATIONAL JOURNAL OF BIOCHEMISTRY & CELL BIOLOGY, 2010, 42 (08) :1273-1281
[6]   A Novel miRNA Processing Pathway Independent of Dicer Requires Argonaute2 Catalytic Activity [J].
Cifuentes, Daniel ;
Xue, Huiling ;
Taylor, David W. ;
Patnode, Heather ;
Mishima, Yuichiro ;
Cheloufi, Sihem ;
Ma, Enbo ;
Mane, Shrikant ;
Hannon, Gregory J. ;
Lawson, Nathan D. ;
Wolfe, Scot A. ;
Giraldez, Antonio J. .
SCIENCE, 2010, 328 (5986) :1694-1698
[7]   Meta-analysis of small RNA-sequencing errors reveals ubiquitous post-transcriptional RNA modifications [J].
Ebhardt, H. Alexander ;
Tsang, Herbert H. ;
Dai, Denny C. ;
Liu, Yifeng ;
Bostan, Babak ;
Fahlman, Richard P. .
NUCLEIC ACIDS RESEARCH, 2009, 37 (08) :2461-2470
[8]   miRDeep2 accurately identifies known and hundreds of novel microRNA genes in seven animal clades [J].
Friedlaender, Marc R. ;
Mackowiak, Sebastian D. ;
Li, Na ;
Chen, Wei ;
Rajewsky, Nikolaus .
NUCLEIC ACIDS RESEARCH, 2012, 40 (01) :37-52
[9]   Systematic comparison of microarray profiling, real-time PCR, and next-generation sequencing technologies for measuring differential microRNA expression [J].
Git, Anna ;
Dvinge, Heidi ;
Salmon-Divon, Mali ;
Osborne, Michelle ;
Kutter, Claudia ;
Hadfield, James ;
Bertone, Paul ;
Caldas, Carlos .
RNA, 2010, 16 (05) :991-1006
[10]   Genome-Wide Identification of SNPs in MicroRNA Genes and the SNP Effects on MicroRNA Target Binding and Biogenesis [J].
Gong, Jing ;
Tong, Yin ;
Zhang, Hong-Mei ;
Wang, Kai ;
Hu, Tao ;
Shan, Ge ;
Sun, Jun ;
Guo, An-Yuan .
HUMAN MUTATION, 2012, 33 (01) :254-263