Information retrieval and knowledge discovery utilizing a biomedical patent Semantic Web

被引:58
作者
Mukherjea, S [1 ]
Bamba, B [1 ]
Kankar, P [1 ]
机构
[1] Indian Inst Technol, IBM India Res Lab, New Delhi 110016, India
关键词
biomedical information retrieval; Semantic Web; information extraction;
D O I
10.1109/TKDE.2005.130
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Before undertaking new biomedical research, identifying concepts that have already been patented is essential. A traditional keyword-based search on patent databases may not be sufficient to retrieve all the relevant information, especially for the biomedical domain. This paper presents BioPatentMiner, a system that facilitates information retrieval and knowledge discovery from biomedical patents. The system first identifies biological terms and relations from the patents and then integrates the information from the patents with knowledge from biomedical ontologies to create a Semantic Web. Besides keyword search and queries linking the properties specified by one or more RDF triples, the system can discover semantic associations between the Web resources. The system also determines the importance of the resources to rank the results of a search and prevent information overload while determining the semantic associations.
引用
收藏
页码:1099 / 1110
页数:12
相关论文
共 19 条
[1]  
[Anonymous], P 12 INT WORLD WID W
[2]  
[Anonymous], 1998, P ACM SIAM S DISCR A
[3]   The anatomy of a large-scale hypertextual Web search engine [J].
Brin, S ;
Page, L .
COMPUTER NETWORKS AND ISDN SYSTEMS, 1998, 30 (1-7) :107-117
[4]  
CARMEL D, 2001, P 10 TEXT RETR C, P228
[5]  
CHEN J, 1998, P ACM SIGMOD C
[6]  
KANDO N, 2000, ACM SIGIR FORUM, V34, P28
[7]  
KANKAR P, 2002, P 2 SIAM INT C DAT M
[8]  
KARVOUNARAKIS S, 2002, P 11 INT WORLD WID W
[9]  
LARKEY L, 1999, P ACM DIG LIB C
[10]   The stochastic approach for link-structure analysis (SALSA) and the TKC effect [J].
Lempel, R ;
Moran, S .
COMPUTER NETWORKS-THE INTERNATIONAL JOURNAL OF COMPUTER AND TELECOMMUNICATIONS NETWORKING, 2000, 33 (1-6) :387-401