Mining Mammalian Transcript Data for Functional Long Non-Coding RNAs

被引:126
作者
Khachane, Amit N. [1 ]
Harrison, Paul M. [1 ]
机构
[1] McGill Univ, Dept Biol, Montreal, PQ H3A 1B1, Canada
来源
PLOS ONE | 2010年 / 5卷 / 04期
基金
加拿大自然科学与工程研究理事会;
关键词
FATE DETERMINATION FACTOR; ACTIVATED-RECEPTOR-DELTA; E-2 SIGNALING PATHWAYS; BREAST-CANCER; TUMOR-SUPPRESSOR; MESSENGER-RNA; IN-VIVO; GENE; GROWTH; EXPRESSION;
D O I
10.1371/journal.pone.0010316
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: The role of long non-coding RNAs (lncRNAs) in controlling gene expression has garnered increased interest in recent years. Sequencing projects, such as Fantom3 for mouse and H-InvDB for human, have generated abundant data on transcribed components of mammalian cells, the majority of which appear not to be protein-coding. However, much of the non-protein-coding transcriptome could merely be a consequence of 'transcription noise'. It is therefore essential to use bioinformatic approaches to identify the likely functional candidates in a high throughput manner. Principal Findings: We derived a scheme for classifying and annotating likely functional lncRNAs in mammals. Using the available experimental full-length cDNA data sets for human and mouse, we identified 78 lncRNAs that are either syntenically conserved between human and mouse, or that originate from the same protein-coding genes. Of these, 11 have significant sequence homology. We found that these lncRNAs exhibit: (i) patterns of codon substitution typical of non-coding transcripts; (ii) preservation of sequences in distant mammals such as dog and cow, (iii) significant sequence conservation relative to their corresponding flanking regions (in 50% cases, flanking regions do not have homology at all; and in the remaining, the degree of conservation is significantly less); (iv) existence mostly as single-exon forms (8/11); and, (v) presence of conserved and stable secondary structure motifs within them. We further identified orthologous protein-coding genes that are contributing to the pool of lncRNAs; of which, genes implicated in carcinogenesis are significantly over-represented. Conclusion: Our comparative mammalian genomics approach coupled with evolutionary analysis identified a small population of conserved long non-protein-coding RNAs (lncRNAs) that are potentially functional across Mammalia. Additionally, our analysis indicates that amongst the orthologous protein-coding genes that produce lncRNAs, those implicated in cancer pathogenesis are significantly over-represented, suggesting that these lncRNAs could play an important role in cancer pathomechanisms.
引用
收藏
页数:8
相关论文
共 54 条
[1]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]  
[Anonymous], 2009, NATURE, V457, P1028
[3]   CGMIM: Automated text-mining of Online Mendelian Inheritance in Man (OMIM) to identify genetically-associated cancers and candidate genes [J].
Bajdik, CD ;
Kuo, B ;
Rusaw, S ;
Jones, S ;
Brooks-Wilson, A .
BMC BIOINFORMATICS, 2005, 6 (1)
[4]   Structures of regulatory elements in mRNAs [J].
Batey, Robert T. .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2006, 16 (03) :299-306
[5]   Role for a bidentate ribonuclease in the initiation step of RNA interference [J].
Bernstein, E ;
Caudy, AA ;
Hammond, SM ;
Hannon, GJ .
NATURE, 2001, 409 (6818) :363-366
[6]   Mechanisms of alternative pre-messenger RNA splicing [J].
Black, DL .
ANNUAL REVIEW OF BIOCHEMISTRY, 2003, 72 :291-336
[7]   Intragenic breakpoint within RAD51L1 in a t(6;14)(p21.3;q24) of a pulmonary chondroid hamartoma [J].
Blank, C ;
Schoenmakers, EFPM ;
Rogalla, P ;
Huys, EHLPG ;
van Rijk, AAF ;
Drieschner, N ;
Bullerdiek, J .
CYTOGENETICS AND CELL GENETICS, 2001, 95 (1-2) :17-19
[8]   Waste not, want not - transcript excess in multicellular eukaryotes [J].
Brosius, J .
TRENDS IN GENETICS, 2005, 21 (05) :287-288
[9]   The transcripts of SFRP1, CEP63 and EIF4G2 genes are frequently downregulated in transitional cell carcinomas of the bladder [J].
Buim, ME ;
Soares, FA ;
Sarkis, AS ;
Nagai, MA .
ONCOLOGY, 2005, 69 (06) :445-454
[10]   Up-regulation of dicer, a component of the microRNA machinery, in prostate adenocarcinoma [J].
Chiosea, Simion ;
Jelezcova, Elena ;
Chandran, Uma ;
Acquafondata, Marie ;
McHale, Teresa ;
Sobol, Robert W. ;
Dhir, Rajiv .
AMERICAN JOURNAL OF PATHOLOGY, 2006, 169 (05) :1812-1820