Formalization of taxon-based constraints to detect inconsistencies in annotation and ontology development

被引:34
作者
Deegan , Jennifer I. [1 ]
Dimmer, Emily C. [1 ]
Mungall, Christopher J. [2 ]
机构
[1] European Bioinformat Inst, Cambridge CB10 1SD, England
[2] Univ Calif Berkeley, Lawrence Berkeley Lab, Berkeley, CA 94720 USA
来源
BMC BIOINFORMATICS | 2010年 / 11卷
关键词
GO;
D O I
10.1186/1471-2105-11-530
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The Gene Ontology project supports categorization of gene products according to their location of action, the molecular functions that they carry out, and the processes that they are involved in. Although the ontologies are intentionally developed to be taxon neutral, and to cover all species, there are inherent taxon specificities in some branches. For example, the process 'lactation' is specific to mammals and the location 'mitochondrion' is specific to eukaryotes. The lack of an explicit formalization of these constraints can lead to errors and inconsistencies in automated and manual annotation. Results: We have formalized the taxonomic constraints implicit in some GO classes, and specified these at various levels in the ontology. We have also developed an inference system that can be used to check for violations of these constraints in annotations. Using the constraints in conjunction with the inference system, we have detected and removed errors in annotations and improved the structure of the ontology. Conclusions: Detection of inconsistencies in taxon-specificity enables gradual improvement of the ontologies, the annotations, and the formalized constraints. This is progressively improving the quality of our data. The full system is available for download, and new constraints or proposed changes to constraints can be submitted online at https://sourceforge.net/tracker/?atid=605890&group_id=36855.
引用
收藏
页数:10
相关论文
共 21 条
[1]  
[Anonymous], 2003, OVERVIEW SWI PROLOG
[2]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[3]   The Gene Ontology Annotation (GOA) project - application of GO in SWISS-PROT, TrEMBL and InterPro [J].
Camon, E ;
Barrell, D ;
Brooksbank, C ;
Magrane, M ;
Apweiler, R .
COMPARATIVE AND FUNCTIONAL GENOMICS, 2003, 4 (01) :71-74
[4]  
Carbon Seth., 2008, Bioinformatics
[5]   It's all GO for plant scientists [J].
Clark, JI ;
Brooksbank, C ;
Lomax, J .
PLANT PHYSIOLOGY, 2005, 138 (03) :1268-1279
[6]  
COURTOT M, 2009, ICBO
[7]   OBO-Edit - an ontology editor for biologists [J].
Day-Richter, John ;
Harris, Midori A. ;
Haendel, Melissa .
BIOINFORMATICS, 2007, 23 (16) :2198-2200
[8]   Antigenic subclasses of polytropic murine leukemia virus (MLV) isolates reflect three distinct groups of endogenous polytropic MLV-related sequences in NFS/N mice [J].
Evans, LH ;
Lavignon, M ;
Taylor, M ;
Alamgir, ASM .
JOURNAL OF VIROLOGY, 2003, 77 (19) :10327-10338
[9]   The Gene Ontology's Reference Genome Project: A Unified Framework for Functional Annotation across Species [J].
Gaudet, Pascale ;
Chisholm, Rex ;
Berardini, Tanya ;
Dimmer, Emily ;
Engel, Stacia R. ;
Fey, Petra ;
Hill, David P. ;
Howe, Doug ;
Hu, James C. ;
Huntley, Rachael ;
Khodiyar, Varsha K. ;
Kishore, Ranjana ;
Li, Donghui ;
Lovering, Ruth C. ;
McCarthy, Fiona ;
Ni, Li ;
Petri, Victoria ;
Siegele, Deborah A. ;
Tweedie, Susan ;
Van Auken, Kimberly ;
Wood, Valerie ;
Basu, Siddhartha ;
Carbon, Seth ;
Dolan, Mary ;
Mungall, Christopher J. ;
Dolinski, Kara ;
Thomas, Paul ;
Ashburner, Michael ;
Blake, Judith A. ;
Cherry, J. Michael ;
Lewis, Suzanna E. .
PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (07)
[10]  
*GOBO, GOBO PERL TOOK