Towards a semantic lexicon for biological language processing

被引:5
作者
Verspoor, K [1 ]
机构
[1] Los Alamos Natl Lab, Los Alamos, NM 87545 USA
来源
COMPARATIVE AND FUNCTIONAL GENOMICS | 2005年 / 6卷 / 1-2期
关键词
natural language processing; lexicon; unified medical language system;
D O I
10.1002/cfg.451
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
This paper explores the use of the resources in the National Library of Medicine's Unified Medical Language System (UMLS) for the construction of a lexicon useful for processing texts in the field of molecular biology. A lexicon is constructed from overlapping terms in the UMLS SPECIALIST lexicon and the UMLS Metathesaurus to obtain both morphosyntactic and semantic information for terms, and the coverage of a domain corpus is assessed. Over 77% of tokens in the domain corpus are found in the constructed lexicon, validating the lexicon's coverage of the most frequent terms in the domain and indicating that the constructed lexicon is potentially an important resource for biological text processing. Copyright (c) 2005 John Wiley & Sons, Ltd.
引用
收藏
页码:61 / 66
页数:6
相关论文
共 6 条
[1]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[2]  
Friedman C, 2001, J AM MED INFORM ASSN, P189
[3]   How knowledge drives understanding - matching medical ontologies with the needs of medical language processing [J].
Hahn, U ;
Romacker, M ;
Schulz, S .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 1999, 15 (01) :25-51
[4]   A semantic lexicon for medical language processing [J].
Johnson, SB .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 1999, 6 (03) :205-218
[5]  
McCray AT, 2001, J AM MED INFORM ASSN, P448
[6]  
Ohta T., 2002, PROC HUM LANG TECHNO, P73