Identification of OBO nonalignments and its implications for OBO enrichment

被引:14
作者
Bada, Michael [1 ]
Hunter, Lawrence [1 ]
机构
[1] Univ Colorado Denver, Dept Pharmacol, Aurora, CO 80045 USA
关键词
D O I
10.1093/bioinformatics/btn194
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Existing projects that focus on the semiautomatic addition of links between existing terms in the Open Biomedical Ontologies can take advantage of reasoners that can make new inferences between terms that are based on the added formal definitions and that reflect nonalignments between the linked terms. However, these projects require that these definitions be necessary and sufficient, a strong requirement that often does not hold. If such definitions cannot be added, the reasoners cannot point to the nonalignments through the suggestion of new inferences. Results: We describe a methodology by which we have identified over 1900 instances of nonredundant nonalignments between terms from the Gene Ontology (GO) biological process (BP), cellular component (CC) and molecular function (MF) ontologies, Chemical Entities of Biological Interest (ChEBI) and the Cell Type Ontology (CL). Many of the 39.8% of these nonalignments whose object terms are more atomic than the subject terms are not currently examined in other ontology-enrichment projects due to the fact that the necessary and sufficient conditions required for the inferences are not currently examined. Analysis of the ratios of nonalignments to assertions from which the nonalignments were identified suggests that BP-MF, BP-BP, BP-CL and CC-CC terms are relatively well-aligned, while ChEBI-MF, BP-ChEBI and CC-MF terms are relatively not aligned well. We propose four ways to resolve an identified nonalignment and recommend an analogous implementation of our methodology in ontology-enrichment tools to identify types of nonalignments that are currently not detected.
引用
收藏
页码:1448 / 1455
页数:8
相关论文
共 12 条
[1]  
ARANGUREN ME, 2004, THESIS U MANCHESTER
[2]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[3]  
BADA M, 2007, J BIOMED IN IN PRESS
[4]   An ontology for cell types [J].
Bard, J ;
Rhee, SY ;
Ashburner, M .
GENOME BIOLOGY, 2005, 6 (02)
[5]  
BERNAUER J, 1994, P 1994 ANN S COMP AP
[6]   Investigating subsumption in SNOMED CT: An exploration into large description logic-based biomedical terminologies [J].
Bodenreider, Olivier ;
Smith, Barry ;
Kumar, Anand ;
Burgun, Anita .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2007, 39 (03) :183-195
[7]  
BURGUN A, 2005, P 2005 ANN S AM MED
[8]   Auditing the unified medical language system with semantic methods [J].
Cimino, JJ .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 1998, 5 (01) :41-51
[9]  
DEGTYARENKO K, 2003, P 2003 INT CHEM INF
[10]   Obol: integrating language and meaning in bio-ontologies [J].
Mungall, CJ .
COMPARATIVE AND FUNCTIONAL GENOMICS, 2004, 5 (6-7) :509-520