Obol: integrating language and meaning in bio-ontologies

被引:58
作者
Mungall, CJ [1 ]
机构
[1] Univ Calif Berkeley, Dept Mol & Cell Biol, HHMI, Berkeley, CA 94720 USA
来源
COMPARATIVE AND FUNCTIONAL GENOMICS | 2004年 / 5卷 / 6-7期
关键词
D O I
10.1002/cfg.435
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Ontologies are intended to capture and formalize a domain of knowledge. The ontologies comprising the Open Biological Ontologies (OBO) project, which includes the Gene Ontology (GO), are formalizations of various domains of biological knowledge. Ontologies within OBO typically lack computable definitions that serve to differentiate a term from other similar terms. The computer is unable to determine the meaning of a term, which presents problems for tools such as automated reasoners. Reasoners can be of enormous benefit in managing a complex ontology. OBO term names frequently implicitly encode the kind of definitions that can be used by computational tools, such as automated reasoners. The definitions encoded in the names are not easily amenable to computation, because the names are ostensibly natural language phrases designed for human users. These names are highly regular in their grammar, and can thus be treated as valid sentences in some formal or computable language. With a description of the rules underlying this formal language, term names can be parsed to derive computable definitions, which can then be reasoned over. This paper describes the effort to elucidate that language, called Obol, and the attempts to reason over the resulting definitions. The current implementation finds unique non-trivial definitions for around half of the terms in the GO, and has been used to find 223 missing relationships, which have since been added to the ontology. Obol has utility as an ontology maintenance tool, and as a means of generating computable definitions for a whole ontology. The software is available under an open-source license from: http://www.fruitfly. org/-cjm/obol. Supplementary material for this article can be found at: http://www. interscience.wiley.com/jpages/1531-6912/suppmat. Copyright (C) 2004 John Wiley Sons, Ltd.
引用
收藏
页码:509 / 520
页数:12
相关论文
共 19 条
[1]  
[Anonymous], STANFORD ENCY PHILOS
[2]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[3]  
BARD J, 2004, UNPUB GENOME BIOL
[4]  
Chomsky Noam, 1959, Infromation and Control, V2, P137, DOI 10.1016/S0019-9958(59)90362-6
[5]  
Clocksin W. F., 1981, Programming in Prolog
[6]  
Drysdale R, 2001, Brief Bioinform, V2, P68, DOI 10.1093/bib/2.1.68
[7]  
Gkoutos GV, 2003, PACIFIC SYMPOSIUM ON BIOCOMPUTING 2004, P178
[8]   The Gene Ontology (GO) database and informatics resource [J].
Harris, MA ;
Clark, J ;
Ireland, A ;
Lomax, J ;
Ashburner, M ;
Foulger, R ;
Eilbeck, K ;
Lewis, S ;
Marshall, B ;
Mungall, C ;
Richter, J ;
Rubin, GM ;
Blake, JA ;
Bult, C ;
Dolan, M ;
Drabkin, H ;
Eppig, JT ;
Hill, DP ;
Ni, L ;
Ringwald, M ;
Balakrishnan, R ;
Cherry, JM ;
Christie, KR ;
Costanzo, MC ;
Dwight, SS ;
Engel, S ;
Fisk, DG ;
Hirschman, JE ;
Hong, EL ;
Nash, RS ;
Sethuraman, A ;
Theesfeld, CL ;
Botstein, D ;
Dolinski, K ;
Feierbach, B ;
Berardini, T ;
Mundodi, S ;
Rhee, SY ;
Apweiler, R ;
Barrell, D ;
Camon, E ;
Dimmer, E ;
Lee, V ;
Chisholm, R ;
Gaudet, P ;
Kibbe, W ;
Kishore, R ;
Schwarz, EM ;
Sternberg, P ;
Gwinn, M .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D258-D261
[9]  
HARRSLEV VRM, 2003, P 2 INT WORKSH EV ON
[10]   Extension and integration of the gene ontology (GO): Combining GO vocabularies with external vocabularies [J].
Hill, DP ;
Blake, JA ;
Richardson, JE ;
Ringwald, M .
GENOME RESEARCH, 2002, 12 (12) :1982-1991