Biological ontology enhancement with fuzzy relations: A text-mining framework

被引:9
作者
Abulaish, M [1 ]
Dey, L [1 ]
机构
[1] Jamia Millia Islamia, Dept Math, New Delhi 25, India
来源
2005 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, PROCEEDINGS | 2005年
关键词
biological information extraction; text mining; fuzzy ontology; fuzzy relation;
D O I
10.1109/WI.2005.43
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Domain ontology can help in information retrieval from documents. But ontology is a pre-defined structure with crisp concept descriptions and inter-concept relations. However, due to the dynamic nature of the document repository, ontology should be upgradeable with information extracted through text mining of documents in the domain. This also necessitates that concepts, their descriptions and inter-concept relations should be associated with a degree of fuzziness that will indicate the support for the extracted knowledge according to the currently available resources. Supports may be revised with more knowledge coming in future. This approach preserves the basic structured knowledge format for storing domain knowledge, but at the same time allows for update of information. In this paper, we have proposed a mechanism which initiates text mining with a set of ontological concepts, and thereafter extracts fuzzy relations through text mining. Membership values of relations are functions of frequency of co-occurrence of concepts and relations. We have worked on the GENIA corpus and shown how fuzzy relations can be further used for guided information extraction from MEDLINE documents.
引用
收藏
页码:379 / 385
页数:7
相关论文
共 15 条
[1]   OIL: An ontology infrastructure for the Semantic Web [J].
Fensel, D ;
van Harmelen, F ;
Horrocks, I ;
McGuinness, DL ;
Patel-Schneider, PF .
IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 2001, 16 (02) :38-45
[2]  
GUARINO N, 1995, TOWARDS VERY LARGE KNOWLEDGE BASES, P25
[3]  
HORROCKS I, 2001, P 17 INT JOINT C ART, P199
[4]   GENIA corpus-a semantically annotated corpus for bio-textmining [J].
Kim, J-D ;
Ohta, T. ;
Tateisi, Y. ;
Tsujii, J. .
BIOINFORMATICS, 2003, 19 :i180-i182
[5]   Web mining model and its applications for information gathering [J].
Li, YF ;
Zhong, N .
KNOWLEDGE-BASED SYSTEMS, 2004, 17 (5-6) :207-217
[6]  
Liddle Stephen, 2003, P INF SYST TECHN ITS, P21
[7]   Textpresso:: An ontology-based information retrieval and extraction system for biological literature [J].
Müller, HM ;
Kenny, EE ;
Sternberg, PW .
PLOS BIOLOGY, 2004, 2 (11) :1984-1998
[8]  
QUAN TT, 2004, P 2004 KNOWL DISC ON
[9]   Facts from text - Is text mining ready to deliver? [J].
Rebholz-Schuhmann, D ;
Kirsch, H ;
Couto, F .
PLOS BIOLOGY, 2005, 3 (02) :188-191
[10]   Fuzzy query interface for a business database [J].
Ribeiro, RA ;
Moreira, AM .
INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 2003, 58 (04) :363-391