Prediction of new associations between ncRNAs and diseases exploiting multi-type hierarchical clustering

被引:21
作者
Barracchia, Emanuele Pio [1 ]
Pio, Gianvito [1 ]
D'Elia, Domenica [3 ]
Ceci, Michelangelo [1 ,2 ,4 ]
机构
[1] Univ Bari Aldo Moro, Dept Comp Sci, Via Orabona 4, I-70125 Bari, Italy
[2] Natl Interuniv Consortium Informat CINI, Big Data Lab, I-00185 Rome, Italy
[3] CNR, Inst Biomed Technol, I-70126 Bari, Italy
[4] Jozef Stefan Inst, Dept Knowledge Technol, Jamova 39, Ljubljana 1000, Slovenia
关键词
Non-coding RNA (ncRNAs); Diseases; Cancer; Heterogeneous network; Clustering; Link prediction; DNA ELEMENTS; MICRORNAS; ENCYCLOPEDIA;
D O I
10.1186/s12859-020-3392-2
中图分类号
Q5 [生物化学];
学科分类号
070307 [化学生物学];
摘要
BackgroundThe study of functional associations between ncRNAs and human diseases is a pivotal task of modern research to develop new and more effective therapeutic approaches. Nevertheless, it is not a trivial task since it involves entities of different types, such as microRNAs, lncRNAs or target genes whose expression also depends on endogenous or exogenous factors. Such a complexity can be faced by representing the involved biological entities and their relationships as a network and by exploiting network-based computational approaches able to identify new associations. However, existing methods are limited to homogeneous networks (i.e., consisting of only one type of objects and relationships) or can exploit only a small subset of the features of biological entities, such as the presence of a particular binding domain, enzymatic properties or their involvement in specific diseases.ResultsTo overcome the limitations of existing approaches, we propose the system LP-HCLUS, which exploits a multi-type hierarchical clustering method to predict possibly unknown ncRNA-disease relationships. In particular, LP-HCLUS analyzes heterogeneous networks consisting of several types of objects and relationships, each possibly described by a set of features, and extracts multi-type clusters that are subsequently exploited to predict new ncRNA-disease associations. The extracted clusters are overlapping, hierarchically organized, involve entities of different types, and allow LP-HCLUS to catch multiple roles of ncRNAs in diseases at different levels of granularity. Our experimental evaluation, performed on heterogeneous attributed networks consisting of microRNAs, lncRNAs, diseases, genes and their known relationships, shows that LP-HCLUS is able to obtain better results with respect to existing approaches. The biological relevance of the obtained results was evaluated according to both quantitative (i.e., TPR@k, Areas Under the TPR@k, ROC and Precision-Recall curves) and qualitative (i.e., according to the consultation of the existing literature) criteria.ConclusionsThe obtained results prove the utility of LP-HCLUS to conduct robust predictive studies on the biological role of ncRNAs in human diseases. The produced predictions can therefore be reliably considered as new, previously unknown, relationships among ncRNAs and diseases.
引用
收藏
页数:24
相关论文
共 43 条
[1]
Long Noncoding RNA: Genome Organization and Mechanism of Action [J].
Akhade, Vijay Suresh ;
Pal, Debosree ;
Kanduri, Chandrasekhar .
LONG NON CODING RNA BIOLOGY, 2017, 1008 :47-74
[2]
ncPred: ncRNA-disease association prediction through tripartite network-based inference [J].
Alaimo, Salvatore ;
Giugno, Rosalba ;
Pulvirenti, Alfredo .
Frontiers in Bioengineering and Biotechnology, 2014, 2 (DEC)
[3]
[Anonymous], BIOINFORMATICS
[4]
[Anonymous], 2017, ACM COMPUT SURV, DOI DOI 10.1145/3012704
[5]
miRNA sponges: soaking up miRNAs for regulation of gene expression [J].
Bak, Rasmus O. ;
Mikkelsen, Jacob Giehm .
WILEY INTERDISCIPLINARY REVIEWS-RNA, 2014, 5 (03) :317-333
[6]
DisGeNET: a Cytoscape plugin to visualize, integrate, search and analyze gene-disease networks [J].
Bauer-Mehren, Anna ;
Rautschka, Michael ;
Sanz, Ferran ;
Furlong, Laura I. .
BIOINFORMATICS, 2010, 26 (22) :2924-2926
[7]
Benson DA, 2017, RES, V45, P42, DOI [10.1093/nar/gks1195, DOI 10.1093/NAR/GKS1195]
[8]
Blockeel H., 1998, Machine Learning. Proceedings of the Fifteenth International Conference (ICML'98), P55
[9]
Arena-Idb: a platform to build human non-coding RNA interaction networks [J].
Bonnici, Vincenzo ;
De Caro, Giorgio ;
Constantino, Giorgio ;
Liuni, Sabino ;
D'Elia, Domenica ;
Bombieri, Nicola ;
Licciulli, Flavio ;
Giugno, Rosalba .
BMC BIOINFORMATICS, 2018, 19 :231-244
[10]
The Noncoding RNA Revolution-Trashing Old Rules to Forge New Ones [J].
Cech, Thomas R. ;
Steitz, Joan A. .
CELL, 2014, 157 (01) :77-94