Auditing the unified medical language system with semantic methods

被引:78
作者
Cimino, JJ [1 ]
机构
[1] Columbia Univ, Coll Phys & Surg, Dept Med Informat, New York, NY 10027 USA
关键词
D O I
10.1136/jamia.1998.0050041
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: The National Library of Medicine's (NLM) Unified Medical Language System (UMLS) includes a Metathesaurus (Meta), which is a compilation of medical terms drawn from over 30 controlled vocabularies, and a Semantic Net, which contains the semantic types used to categorize Meta concepts and the semantic relations to connect them. Meta has been constructed through lexical matching techniques and human review. The purpose of this study was to audit the Meta using semantic techniques to identify possible inconsistencies. Methods: Five different techniques were applied: (1) detection of ambiguity in Meta concepts with two or more semantic types, (2) detection of interchangeable keyword synonyms, (3) detection of redundant pairs of Meta concepts (using lexical matching combined with keyword synonyms), (4) detection of inconsistent parent-child relationships in Meta (based on the semantic type information), and (5) discovery of pairs of semantic types for which relations could be added to the Semantic Net, based on "other" relationships between Meta concepts. Results: Of 57,592 concepts with multiple semantic ty pes, 1817 (3.2%) were judged to be ambiguous. Keyword analysis showed 7121 pairs of interchangeable words. Using the keyword pairs, 5031 pairs of potentially redundant concepts were suggested, of which 3274 (65.1%) were judged to actually be redundant. Review of the 100,586 parent-child relationships revealed 544 (0.54%) that were incorrect. Review of the 219,664 "Other" relationships suggested 1299 places in the Semantic Net where relations between pairs of semantic types could be added. Conclusion: Semantic techniques, alone or in combination, can be used to audit the UMLS to detect inconsistencies that are not detectable through lexical techniques alone. Use of these methods to augment the UMLS maintenance process will lead to improvement in the UMLS.
引用
收藏
页码:41 / 51
页数:11
相关论文
共 22 条
[1]  
Bean C. A., 1996, Knowledge Organization and Change. Proceedings of the Fourth International ISKO Conference, P80
[2]  
CAMBELL KE, 1994, JAMIA, V1, P218
[3]  
CAMBELL KE, 1992, P 16 ANN S COMP APP, P354
[4]  
Chute C G, 1991, Proc Annu Symp Comput Appl Med Care, P185
[5]  
Cimino J J, 1995, Medinfo, V8 Pt 1, P117
[6]  
CIMINO JJ, 1990, M D COMPUT, V7, P104
[7]   KNOWLEDGE-BASED APPROACHES TO THE MAINTENANCE OF A LARGE CONTROLLED MEDICAL TERMINOLOGY [J].
CIMINO, JJ ;
CLAYTON, PD ;
HRIPCSAK, G ;
JOHNSON, SB .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 1994, 1 (01) :35-50
[8]  
CIMINO JJ, 1991, METHOD INFORM MED, V30, P179
[9]  
DUJOLS P, 1991, METHOD INFORM MED, V30, P30
[10]  
HOLLANDER D, 1988, IDENTIFICATION SEMAN