Semantic similarity analysis of protein data: assessment with biological features and issues

被引:141
作者
Guzzi, Pietro H. [1 ]
Mina, Marco [2 ]
Guerra, Concettina [2 ,3 ,4 ,5 ]
Cannataro, Mario [1 ]
机构
[1] Magna Graecia Univ Catanzaro, Dept Med & Surg Sci, I-88100 Catanzaro, Italy
[2] Univ Padua, Dept Informat Engn, I-35100 Padua, Italy
[3] Georgia Inst Technol, Coll Comp, Atlanta, GA 30332 USA
[4] Univ Rome, Rome, Italy
[5] Purdue Univ, W Lafayette, IN 47907 USA
关键词
Semantic similarity measures; protein data; biological features; GENE ONTOLOGY; FUNCTIONAL SIMILARITY; EXPRESSION; TECHNOLOGIES; PREDICTION; ALGORITHM; NETWORKS; MODULES; TERMS; TOOL;
D O I
10.1093/bib/bbr066
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The integration of proteomics data with biological knowledge is a recent trend in bioinformatics. A lot of biological information is available and is spread on different sources and encoded in different ontologies (e.g. Gene Ontology). Annotating existing protein data with biological information may enable the use (and the development) of algorithms that use biological ontologies as framework to mine annotated data. Recently many methodologies and algorithms that use ontologies to extract knowledge from data, as well as to analyse ontologies themselves have been proposed and applied to other fields. Conversely, the use of such annotations for the analysis of protein data is a relatively novel research area that is currently becoming more and more central in research. Existing approaches span from the definition of the similarity among genes and proteins on the basis of the annotating terms, to the definition of novel algorithms that use such similarities for mining protein data on a proteome-wide scale. This work, after the definition of main concept of such analysis, presents a systematic discussion and comparison of main approaches. Finally, remaining challenges, as well as possible future directions of research are presented.
引用
收藏
页码:569 / 585
页数:17
相关论文
共 64 条
[1]   Functionally guided alignment of protein interaction networks for module detection [J].
Ali, Waqar ;
Deane, Charlotte M. .
BIOINFORMATICS, 2009, 25 (23) :3166-3173
[2]  
[Anonymous], 2008 IEEE S COMP COM
[3]  
[Anonymous], VISUALIZING GO DISTA
[4]  
[Anonymous], INFORM THEORETIC DEF
[5]  
[Anonymous], 2005, P ISMB 2005 SIG M BI
[6]  
[Anonymous], 1995, arXiv
[7]  
[Anonymous], THESIS LISBONA U
[8]  
[Anonymous], INT JOINT C BIOINF S
[9]  
[Anonymous], BIOINFOMATICS
[10]  
[Anonymous], INT C RES COMP LING