Building a biomedical ontology recommender web service

被引:33
作者
Jonquet C. [1 ]
Musen M.A. [1 ]
Shah N.H. [1 ]
机构
[1] Center for Biomedical Informatics Research, Stanford University, 94305, CA
基金
美国国家卫生研究院;
关键词
Unify Medical Language System; Ontology Concept; Recommender Service; Biomedical Ontology; Concept Recognition;
D O I
10.1186/2041-1480-1-S1-S1
中图分类号
学科分类号
摘要
Background: Researchers in biomedical informatics use ontologies and terminologies to annotate their data in order to facilitate data integration and translational discoveries. As the use of ontologies for annotation of biomedical datasets has risen, a common challenge is to identify ontologies that are best suited to annotating specific datasets. The number and variety of biomedical ontologies is large, and it is cumbersome for a researcher to figure out which ontology to use. Methods: We present the Biomedical Ontology Recommender web service. The system uses textual metadata or a set of keywords describing a domain of interest and suggests appropriate ontologies for annotating or representing the data. The service makes a decision based on three criteria. The first one is coverage, or the ontologies that provide most terms covering the input text. The second is connectivity, or the ontologies that are most often mapped to by other ontologies. The final criterion is size, or the number of concepts in the ontologies. The service scores the ontologies as a function of scores of the annotations created using the National Center for Biomedical Ontology (NCBO) Annotator web service. We used all the ontologies from the UMLS Metathesaurus and the NCBO BioPortal. Results: We compare and contrast our Recommender by an exhaustive functional comparison to previously published efforts. We evaluate and discuss the results of several recommendation heuristics in the context of three real world use cases. The best recommendations heuristics, rated 'very relevant' by expert evaluators, are the ones based on coverage and connectivity criteria. The Recommender service (alpha version) is available to the community and is embedded into BioPortal. © 2010 Jonquet et al; licensee BioMed Central Ltd.
引用
收藏
相关论文
共 33 条
[1]  
Butte A.J., Chen R., Finding disease-related genomic experiments within an international repository: first steps in translational bioinformatics, American Medical Informatics Association Annual Symposium, pp. 106-110, (2006)
[2]  
Bodenreider O., Stevens R., Bio-ontologies: Current Trends and Future Directions, Briefings in Bioinformatics, 7, 3, pp. 256-274, (2006)
[3]  
Sabou M., Lopez V., Motta E., Ontology Selection on the Real Semantic Web: How to Cover the Queens Birthday Dinner?, 15th International Conference on Knowledge Engineering and Knowledge Management Managing Knowledge in a World of Networks, pp. 96-111, (2006)
[4]  
Pafilis E., O'Donoghue S.I., Jensen L.J., Horn H., Kuhn M., Brown N.P., Schneider R., Reflect: augmented browsing for the life scientist, Nature Biotechnology, 27, pp. 508-510, (2009)
[5]  
Jonquet C., Shah N.H., Musen M.A., The Open Biomedical Annotator. AMIA Summit on Translational Bioinformatics, pp. 56-60, (2009)
[6]  
Shah N.H., Jonquet C., Chiang A.P., Butte A.J., Chen R., Musen M.A., Ontology-driven Indexing of Public Datasets for Translational Bioinformatics, BMC Bioinformatics, (2009)
[7]  
Noy N.F., Shah N.H., Whetzel P.L., Dai B., Dorf M., Griffith N.B., Jonquet C., Rubin D.L., Storey M.A., Chute C.G., Musen M.A., BioPortal: ontologies and integrated data resources at the click of a mouse, Nucleic Acids Research, (2009)
[8]  
Bodenreider O., The Unified Medical Language System (UMLS): integrating biomedical terminology, Nucleic Acids Research, 32, pp. 267-270, (2004)
[9]  
Ghazvinian A., Noy N., Jonquet C., Shah N., Musen M., What Four Million Mappings Can Tell You about Two Hundred Ontologies, 8th International Semantic Web Conference, ISWC'09, Volume 5823 of Lecture Notes in Computer Science, pp. 229-242, (2009)
[10]  
Alani H., Brewster C., Ontology Ranking Based on the Analysis of Concept Structures, 3rd International Conference on Knowledge Capture, K-Cap'05., pp. 51-58, (2005)