Disjunctive shared information between ontology concepts: application to Gene Ontology

被引:41
作者
Couto, Francisco M. [1 ]
Silva, Mario J. [1 ]
机构
[1] Univ Lisbon, Fac Ciencias, Dept Informat, P-1749016 Lisbon, Portugal
来源
JOURNAL OF BIOMEDICAL SEMANTICS | 2011年 / 2卷
关键词
Gene Ontology; Direct Acyclic Graph; Semantic Similarity; Protein Pair; Ontology Concept;
D O I
10.1186/2041-1480-2-5
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: ontologies available motivates the application of similarity measures to compare ontology concepts or, by extension, the entities described therein. A common approach, known as semantic similarity, compares ontology concepts through the information content they share in the ontology. However, different disjunctive ancestors in the ontology are frequently neglected, or not properly explored, by semantic similarity measures. Results: This paper proposes a novel method, dubbed DiShIn, that effectively exploits the multiple inheritance relationships present in many biomedical ontologies. DiShIn calculates the shared information content of two ontology concepts, based on the information content of the disjunctive common ancestors of the concepts being compared. DiShIn identifies these disjunctive ancestors through the number of distinct paths from the concepts to their common ancestors. Conclusions: DiShIn was applied to Gene Ontology and its performance was evaluated against state-of-the-art measures using CESSM, a publicly available evaluation platform of protein similarity measures. By modifying the way traditional semantic similarity measures calculate the shared information content, DiShIn was able to obtain a statistically significant higher correlation between semantic and sequence similarity. Moreover, the incorporation of DiShIn in existing applications that exploit multiple inheritance would reduce their execution time.
引用
收藏
页数:16
相关论文
共 33 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   A shortest-path graph kernel for estimating gene product semantic similarity [J].
Alvarez, Marco A. ;
Qi, Xiaojun ;
Yan, Changhui .
JOURNAL OF BIOMEDICAL SEMANTICS, 2011, 2
[3]  
[Anonymous], 1997, P 10 RES COMP LING I
[4]  
[Anonymous], 1998, ICML
[5]   Understanding and using the meaning of statements in a bio-ontology: recasting the Gene Ontology in OWL [J].
Aranguren, Mikel Egana ;
Bechhofer, Sean ;
Lord, Phillip ;
Sattler, Ulrike ;
Stevens, Robert .
BMC BIOINFORMATICS, 2007, 8 (1)
[6]  
Azuaje F., 2005, P ISMB 2005 SIG M BI
[7]   The Gene Ontology Annotation (GOA) Database: sharing knowledge in Uniprot with Gene Ontology [J].
Camon, E ;
Magrane, M ;
Barrell, D ;
Lee, V ;
Dimmer, E ;
Maslen, J ;
Binns, D ;
Harte, N ;
Lopez, R ;
Apweiler, R .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D262-D266
[8]  
COUTO F, 2005, P ACM C INF KNOWL MA
[9]   Measuring semantic similarity between Gene Ontology terms [J].
Couto, Francisco M. ;
Silva, Mario J. ;
Coutinho, Pedro M. .
DATA & KNOWLEDGE ENGINEERING, 2007, 61 (01) :137-152
[10]  
Cross V.V., 2002, Similarity and Compatibility in Fuzzy Set Theory: Assessment and Applications