MIReNA: finding microRNAs with high accuracy and no learning at genome scale and from deep sequencing data

被引:108
作者
Mathelier, Anthony [1 ,2 ]
Carbone, Alessandra [1 ,2 ]
机构
[1] Univ Paris 06, FRE3214, F-75006 Paris, France
[2] CNRS, FRE3214, Lab Genom Microorganismes, F-75006 Paris, France
关键词
SECONDARY STRUCTURE; IDENTIFICATION; PRECURSORS; CLASSIFICATION; PREDICTION; CONSERVATION; COMPLEXITY; ALGORITHM; EVOLUTION; MIRNAS;
D O I
10.1093/bioinformatics/btq329
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: MicroRNAs (miRNAs) are a class of endogenes derived from a precursor (pre-miRNA) and involved in post-transcriptional regulation. Experimental identification of novel miRNAs is difficult because they are often transcribed under specific conditions and cell types. Several computational methods were developed to detect new miRNAs starting from known ones or from deep sequencing data, and to validate their pre-miRNAs. Results: We present a genome-wide search algorithm, called MIReNA, that looks for miRNA sequences by exploring a multidimensional space defined by only five (physical and combinatorial) parameters characterizing acceptable pre-miRNAs. MIReNA validates pre-miRNAs with high sensitivity and specificity, and detects new miRNAs by homology from known miRNAs or from deep sequencing data. A performance comparison between MIReNA and four available predictive systems has been done. MIReNA approach is strikingly simple but it turns out to be powerful at least as much as more sophisticated algorithmic methods. MIReNA obtains better results than three known algorithms that validate pre-miRNAs. It demonstrates that machine-learning is not a necessary algorithmic approach for pre-miRNAs computational validation. In particular, machine learning algorithms can only confirm pre-miRNAs that look alike known ones, this being a limitation while exploring species with no known pre-miRNAs. The possibility to adapt the search to specific species, possibly characterized by specific properties of their miRNAs and pre-miRNAs, is a major feature of MIReNA. A parameter adjustment calibrates specificity and sensitivity in MIReNA, a key feature for predictive systems, which is not present in machine learning approaches. Comparison of MIReNA with miRDeep using deep sequencing data to predict miRNAs highlights a highly specific predictive power of MIReNA.
引用
收藏
页码:2226 / 2234
页数:9
相关论文
共 40 条
[1]   microPred: effective classification of pre-miRNAs for human miRNA gene prediction [J].
Batuwita, Rukshan ;
Palade, Vasile .
BIOINFORMATICS, 2009, 25 (08) :989-995
[2]   Identification of new small non-coding RNAs from tobacco and Arabidopsis [J].
Billoud, B ;
De Paepe, R ;
Baulcombe, D ;
Boccara, M .
BIOCHIMIE, 2005, 87 (9-10) :905-910
[3]   DBEST - DATABASE FOR EXPRESSED SEQUENCE TAGS [J].
BOGUSKI, MS ;
LOWE, TMJ ;
TOLSTOSHEV, CM .
NATURE GENETICS, 1993, 4 (04) :332-333
[4]   On the origin and functions of RNA-mediated silencing: from protists to man [J].
Cerutti, Heriberto ;
Casas-Mollano, J. Armando .
CURRENT GENETICS, 2006, 50 (02) :81-99
[5]   Gene silencing in the marine diatom Phaeodactylum tricornutum [J].
De Riso, Valentina ;
Raniello, Raffaella ;
Maumus, Florian ;
Rogato, Alessandra ;
Bowler, Chris ;
Falciatore, Angela .
NUCLEIC ACIDS RESEARCH, 2009, 37 (14)
[6]   PREDICTION OF RNA SECONDARY STRUCTURE [J].
DELISI, C ;
CROTHERS, DM .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1971, 68 (11) :2682-&
[7]   Identification of plant microRNA homologs [J].
Dezulian, T ;
Remmert, M ;
Palatnik, JF ;
Weigel, D ;
Huson, DH .
BIOINFORMATICS, 2006, 22 (03) :359-360
[8]   RNAi in Budding Yeast [J].
Drinnenberg, Ines A. ;
Weinberg, David E. ;
Xie, Kathleen T. ;
Mower, Jeffrey P. ;
Wolfe, Kenneth H. ;
Fink, Gerald R. ;
Bartel, David P. .
SCIENCE, 2009, 326 (5952) :544-550
[9]   Discovering microRNAs from deep sequencing data using miRDeep [J].
Friedlaender, Marc R. ;
Chen, Wei ;
Adamidi, Catherine ;
Maaskola, Jonas ;
Einspanier, Ralf ;
Knespel, Signe ;
Rajewsky, Nikolaus .
NATURE BIOTECHNOLOGY, 2008, 26 (04) :407-415
[10]   The microRNA Registry [J].
Griffiths-Jones, S .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D109-D111