An open annotation ontology for science on web 3.0

被引:46
作者
Ciccarese P. [1 ]
Ocana M. [2 ]
Garcia Castro L.J. [3 ]
Das S. [1 ]
Clark T. [1 ,4 ]
机构
[1] Harvard Medical School and Massachusetts General Hospital, Boston, MA
[2] Balboa Systems, Newton, MA
[3] Universität der Bundeswehr München, Neubiberg
[4] School of Computer Science, University of Manchester, Manchester
基金
美国国家卫生研究院;
关键词
Domain Ontology; Biomedical Ontology; Annotation Type; Simple Knowledge Organization System; Neuroscience Information Framework;
D O I
10.1186/2041-1480-2-S2-S4
中图分类号
学科分类号
摘要
Background: There is currently a gap between the rich and expressive collection of published biomedical ontologies, and the natural language expression of biomedical papers consumed on a daily basis by scientific researchers. The purpose of this paper is to provide an open, shareable structure for dynamic integration of biomedical domain ontologies with the scientific document, in the form of an Annotation Ontology (AO), thus closing this gap and enabling application of formal biomedical ontologies directly to the literature as it emerges. Methods: Initial requirements for AO were elicited by analysis of integration needs between biomedical web communities, and of needs for representing and integrating results of biomedical text mining. Analysis of strengths and weaknesses of previous efforts in this area was also performed. A series of increasingly refined annotation tools were then developed along with a metadata model in OWL, and deployed for feedback and additional requirements the ontology to users at a major pharmaceutical company and a major academic center. Further requirements and critiques of the model were also elicited through discussions with many colleagues and incorporated into the work. Results: This paper presents Annotation Ontology (AO), an open ontology in OWL-DL for annotating scientific documents on the web. AO supports both human and algorithmic content annotation. It enables "stand-off" or independent metadata anchored to specific positions in a web document by any one of several methods. In AO, the document may be annotated but is not required to be under update control of the annotator. AO contains a provenance model to support versioning, and a set model for specifying groups and containers of annotation. AO is freely available under open source license at http://purl.org/ao/ , and extensive documentation including screencasts is available on AO's Google Code page: http://code.google.com/p/annotation-ontology/. Conclusions: The Annotation Ontology meets critical requirements for an open, freely shareable model in OWL, of annotation metadata created against scientific documents on the Web. We believe AO can become a very useful common model for annotation metadata on Web documents, and will enable biomedical domain ontologies to be used quite widely to annotate the scientific literature. Potential collaborators and those with new relevant use cases are invited to contact the authors. © 2011 Ciccarese et al; licensee BioMed Central Ltd.
引用
收藏
相关论文
共 62 条
[1]
Smith B., The Logic of Biological Classification and the Foundations of Biomedical Ontology., Invited Papers from the 10th International Conference in Logic Methodology and Philosophy of Science, Oviedo, Spain, 2003, (2004)
[2]
Natale D., Arighi C., Barker W., Blake J., Chang T.-C., Hu Z., Liu H., Smith B., Wu C., Framework for a Protein Ontology, BMC Bioinformatics, 8, (2007)
[3]
Alexander C.Y., Methods in biomedical ontology, J of Biomedical Informatics, 39, 3, pp. 252-266, (2006)
[4]
Smith B., Ashburner M., Rosse C., Bard J., Bug W., Ceusters W., Goldberg L.J., Eilbeck K., Ireland A., Mungall C.J., Et al., The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration, Nat Biotechnol, 25, 11, pp. 1251-1255, (2007)
[5]
Natale D.A., Arighi C.N., Barker W.C., Blake J., Chang T.C., Hu Z., Liu H., Smith B., Wu C.H., Framework for a protein ontology, BMC Bioinformatics, 8, (2007)
[6]
Natale D.A., Arighi C.N., Barker W.C., Blake J.A., Bult C.J., Caudy M., Drabkin H.J., D'Eustachio P., Evsikov A.V., Huang H., Et al., The Protein Ontology: a structured representation of protein forms and complexes, Nucleic Acids Res, 39, DATABASE ISSUE, pp. D539-545, (2011)
[7]
Smith B., Ceusters W., Klagges B., Kohler J., Kumar A., Lomax J., Mungall C., Neuhaus F., Rector A.L., Rosse C., Relations in biomedical ontologies, Genome Biol, 6, 5, (2005)
[8]
Aranguren M.E., Bechhofer S., Lord P., Sattler U., Stevens R., Understanding and using the meaning of statements in a bio-ontology: recasting the Gene Ontology in OWL, BMC Bioinformatics, 8, (2007)
[9]
Eilbeck K., Lewis S.E., Mungall C.J., Yandell M., Stein L., Durbin R., Ashburner M., The Sequence Ontology: a tool for the unification of genome annotations, Genome Biol, 6, 5, (2005)
[10]
Ashburner M., Ball C., Blake J., Botstein D., Butler H., Cherry J., Davis A., Dolinski K., Dwight S., Eppig J., Et al., Gene ontology: tool for the unification of biology. The Gene Ontology Consortium, Nature Genetics, 25, 1, pp. 25-29, (2000)