A GENERAL NATURAL-LANGUAGE TEXT PROCESSOR FOR CLINICAL RADIOLOGY

被引:453
作者
FRIEDMAN, C
ALDERSON, PO
AUSTIN, JHM
CIMINO, JJ
JOHNSON, SB
机构
[1] COLUMBIA UNIV,NEW YORK,NY 10032
[2] COLUMBIA PRESBYTERIAN MED CTR,NEW YORK,NY 10032
关键词
D O I
10.1136/jamia.1994.95236146
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: Development of a general natural-language processor that identifies clinical information in narrative reports and maps that information into a structured representation containing clinical terms. Design: The natural-language processor provides three phases of processing, all. of which are driven by different knowledge sources. The first phase performs the parsing. It identifies the structure of the text through use of a grammar that defines semantic patterns and a target form. The second phase, regularization, standardizes the terms in the initial target structure via a compositional mapping of multi-word phrases. The third phase, encoding, maps the terms to a controlled vocabulary. Radiology is the test domain for the processor and the target structure is a formal model for representing clinical information in that domain. Measurements: The impression sections of 230 radiology reports were encoded by the processor. Results of an automated query of the resultant database for the occurrences of four diseases were compared with the analysis of a panel of three physicians to determine recall and precision. Results: Without training specific to the four diseases, recall and precision of the system (combined effect of the processor and query generator) were 70% and 87%. Training of the query component increased recall to 85% without changing precision.
引用
收藏
页码:161 / 174
页数:14
相关论文
共 40 条
[1]  
Baud R. H., 1991, 3RD P C ART INT MED, P173
[2]  
BAUD RH, 1992, METHOD INFORM MED, V31, P117
[3]  
BELL DS, 1992, 16TH P ANN S COMP AP, P789
[4]  
BENOIT RG, 1992, 16TH P ANN S COMP AP, P787
[5]  
Campbell K. E., 1992, MEDINFO 92. Proceedings of the Seventh World Congress on Medical Informatics, P1437
[6]  
CAMPBELL KE, 1992, 16TH P ANN S COMP AP, P354
[7]  
Canfield K., 1990, Fourteenth Annual Symposium on Computer Applications in Medical Care. Standards in Medical Informatics. A Conference of the American Medical Informatics Association, P350
[8]  
CANFIELD K, 1990, 13TH P ANN S COMP AP, P559
[9]   KNOWLEDGE-BASED APPROACHES TO THE MAINTENANCE OF A LARGE CONTROLLED MEDICAL TERMINOLOGY [J].
CIMINO, JJ ;
CLAYTON, PD ;
HRIPCSAK, G ;
JOHNSON, SB .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 1994, 1 (01) :35-50
[10]  
CRISTEA D, 1988, CLIN COMPUT, V16, P156