LeARN: a platform for detecting, clustering and annotating non-coding RNAs

被引:9
作者
Noirot, Celine [1 ,2 ]
Gaspin, Christine [2 ]
Schiex, Thomas [2 ]
Gouzy, Jerome [1 ]
机构
[1] INRA, CNR, Lab Interact Plantes Micro Organismes, UMR 441, F-31320 Castanet Tolosan, France
[2] INRA, Unite Biometrie Intelligence Artificielle, UR 875, F-31320 Castanet Tolosan, France
关键词
D O I
10.1186/1471-2105-9-21
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: In the last decade, sequencing projects have led to the development of a number of annotation systems dedicated to the structural and functional annotation of protein-coding genes. These annotation systems manage the annotation of the non-protein coding genes (ncRNAs) in a very crude way, allowing neither the edition of the secondary structures nor the clustering of ncRNA genes into families which are crucial for appropriate annotation of these molecules. Results: LeARN is a flexible software package which handles the complete process of ncRNA annotation by integrating the layers of automatic detection and human curation. Conclusion: This software provides the infrastructure to deal properly with ncRNAs in the framework of any annotation project. It fills the gap between existing prediction software, that detect independent ncRNA occurrences, and public ncRNA repositories, that do not offer the flexibility and interactivity required for annotation projects. The software is freely available from the download section of the website http://bioinfo.genopole-toulouse.prd.fr/LeARN.
引用
收藏
页数:11
相关论文
共 34 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   Detection of 91 potential in plant conserved plant microRNAs in Arabidopsis thaliana and Oryza sativa identifies important target genes [J].
Bonnet, E ;
Wuyts, J ;
Rouzé, P ;
Van de Peer, Y .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (31) :11511-11516
[3]   AGMIAL:: implementing an annotation strategy for prokaryote genomes as a distributed system [J].
Bryson, K. ;
Loux, V. ;
Bossy, R. ;
Nicolas, P. ;
Chaillou, S. ;
van de Guchte, M. ;
Penaud, S. ;
Maguin, E. ;
Hoebeke, M. ;
Bessieres, P. ;
Gibrat, J-F .
NUCLEIC ACIDS RESEARCH, 2006, 34 (12) :3533-3545
[4]   Multiple sequence alignment with the Clustal series of programs [J].
Chenna, R ;
Sugawara, H ;
Koike, T ;
Lopez, R ;
Gibson, TJ ;
Higgins, DG ;
Thompson, JD .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3497-3500
[5]   ESSA: an integrated and interactive computer tool for analysing RNA secondary structure [J].
Chetouani, F ;
Monestie, P ;
Thebault, P ;
Gaspin, C ;
Michot, B .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3514-3522
[6]   The MtMMPL1 early nodulin is a novel member of the matrix metalloendoproteinase family with a role in Medicago truncatula infection by Sinorhizobium meliloti [J].
Combier, Jean-Philippe ;
Vernie, Tatiana ;
de Billy, Francxoise ;
El Yahyaoui, Fikri ;
Mathis, Rene ;
Gamas, Pascal .
PLANT PHYSIOLOGY, 2007, 144 (02) :703-716
[7]   Small RNAs make big splash [J].
Couzin, J .
SCIENCE, 2002, 298 (5602) :2296-2297
[8]   MSARI: Multiple sequence alignments for statistical detection of RNA secondary structure [J].
Coventry, A ;
Kleitman, DJ ;
Berger, B .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (33) :12102-12107
[9]   ddbRNA: detection of conserved secondary structures in multiple alignments [J].
di Bernardo, D ;
Down, T ;
Hubbard, T .
BIOINFORMATICS, 2003, 19 (13) :1606-1611
[10]   A memory-efficient dynamic programming algorithm for optimal alignment of a sequence to an RNA secondary structure [J].
Eddy, SR .
BMC BIOINFORMATICS, 2002, 3 (1)