Competency Evaluation of Plant Character Ontologies Against Domain Literature

被引:12
作者
Cui, Hong [1 ]
机构
[1] Univ Arizona, Sch Informat Resources & Lib Sci, Tucson, AZ 85721 USA
来源
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY | 2010年 / 61卷 / 06期
基金
美国国家科学基金会;
关键词
D O I
10.1002/asi.21325
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Specimen identification keys are still the most commonly created tools used by systematic biologists to access biodiversity information. Creating identification keys requires analyzing and synthesizing large amounts of information from specimens and their descriptions and is a very labor-intensive and time-consuming activity. Automating the generation of identification keys from text descriptions becomes a highly attractive text mining application in the biodiversity domain. Fine-grained semantic annotation of morphological descriptions of organisms is a necessary first step in generating keys from text. Machine-readable ontologies are needed in this process because most biological characters are only implied (i.e., not stated) in descriptions. The immediate question to ask is "How well do existing ontologies support semantic annotation and automated key generation?"With the intention to either select an existing ontology or develop a unified ontology based on existing ones, this paper evaluates the coverage, semantic consistency, and inter-ontology agreement of a biodiversity character ontology and three plant glossaries that may be turned into ontologies. The coverage and semantic consistency of the ontology/glossaries are checked against the authoritative domain literature, namely, Flora of North America and Flora of China. The evaluation results suggest that more work is needed to improve the coverage and interoperability of the ontology/glossaries. More concepts need to be added to the ontology/glossaries and careful work is needed to improve the semantic consistency. The method used in this paper to evaluate the ontology/glossaries can be used to propose new candidate concepts from the domain literature and suggest appropriate definitions.
引用
收藏
页码:1144 / 1165
页数:22
相关论文
共 36 条
[1]  
[Anonymous], 4 INT C LANG RES EV
[2]  
[Anonymous], 2012, OWL 2 Web Ontology Language: Document overview
[3]  
[Anonymous], WORKSH BAS ONT ISS K
[4]  
[Anonymous], 1990, Building Large knowledge-Based systems
[5]   AN APPLICATION OF EXPERT SYSTEMS TECHNOLOGY TO BIOLOGICAL IDENTIFICATION [J].
ATKINSON, WD ;
GAMMERMAN, A .
TAXON, 1987, 36 (04) :705-714
[6]  
Chakaravarthy VT, 2007, P 26 ACM SIGMOD SIGA, P53
[7]  
Chan LoisMai., 2005, LIB C SUBJECT HEADIN
[8]  
DALLWITZ MJ, 1993, ADVANCES IN COMPUTER METHODS FOR SYSTEMATIC BIOLOGY, P287
[9]   FLEXIBLE COMPUTER-PROGRAM FOR GENERATING IDENTIFICATION KEYS [J].
DALLWITZ, MJ .
SYSTEMATIC ZOOLOGY, 1974, 23 (01) :50-57
[10]  
Day-Richter J, 2006, OBO FLAT FILE FORMAT