Automatic concept extraction from spoken medical reports

被引:23
作者
Happe, A [1 ]
Pouliquen, B
Burgun, A
Cuggia, M
Le Beux, P
机构
[1] Intermede, F-35580 Guignen, France
[2] Commiss European Communities, IPSC, Joint Res Ctr, I-21020 Ispra, Italy
[3] Fac Med, Lab Informat Med, F-35033 Rennes, France
关键词
speech recognition; automatic indexing; terminology; MeSH; medical records;
D O I
10.1016/S1386-5056(03)00055-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: The objective of this project is to investigate methods whereby a combination of speech recognition and automated indexing methods substitute for current transcription and indexing practices. Methods: We based our study on existing speech recognition software programs and on NOMINDEX, a tool that extracts MeSH concepts from medical text in natural language and that is mainly based on a French medical lexicon and on the UMLS. For each document, the process consists of three steps: (1) dictation and digital audio recording, (2) speech recognition, (3) automatic indexing. The evaluation consisted of a comparison between the set of concepts extracted by NOMINDEx after the speech recognition phase and the set of keywords manually extracted from the initial document. The method was evaluated on a set of 28 patient discharge summaries extracted from the MENELAS corpus in French, corresponding to in-patients admitted for coronarography. Results: The overall precision was 73% and the overall recall was 90%. Indexing errors were mainly due to word sense ambiguity and abbreviations. A specific issue was the fact that the standard French translation of MeSH terms tacks diacritics. A preliminary evaluation of speech recognition tools showed that the rate of accurate recognition was higher than 98%. Only 3% of the indexing errors were generated by inadequate speech recognition. Discussion: We discuss several areas to focus on to improve this prototype. However, the very tow rate of indexing errors due to speech recognition errors highlights the potential benefits of combining speech recognition techniques and automatic indexing. (C) 2003 Elsevier Ireland Ltd. All rights reserved.
引用
收藏
页码:255 / 263
页数:9
相关论文
共 23 条
[1]   Computer-based speech recognition as an alternative to medical transcription [J].
Borowitz, SM .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2001, 8 (01) :101-102
[2]   Toward speech as a knowledge resource [J].
Brown, EW ;
Srinivasan, S ;
Coden, A ;
Ponceleon, D ;
Cooper, JW ;
Amir, A .
IBM SYSTEMS JOURNAL, 2001, 40 (04) :985-1001
[3]  
CODEN AR, 2001, ACM SIGIR FORUM APR
[4]   Comparative evaluation of three continuous speech recognition software packages in the generation of medical reports [J].
Devine, EG ;
Gaehde, SA ;
Curtis, AC .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2000, 7 (05) :462-468
[5]  
Franz P, 2000, J AM MED INFORM ASSN, P250
[6]   The utility and cost effectiveness of voice recognition technology in surgical pathology [J].
Henricks, WH ;
Roumina, K ;
Skilton, BE ;
Ozan, DJ ;
Goss, GR .
MODERN PATHOLOGY, 2002, 15 (05) :565-571
[7]   In vitro evaluation of a program for machine-aided indexing [J].
Jacquemin, C ;
Daille, B ;
Royauté, J ;
Polanco, X .
INFORMATION PROCESSING & MANAGEMENT, 2002, 38 (06) :765-792
[8]  
James D.A., 1995, THESIS U CAMBRIDGE, P3
[9]  
Langlotz CP, 2000, J DIGIT IMAGING, V13, P49, DOI 10.1053/jdim.2000.6825
[10]   BY-PRODUCTS OF THE COMPUTER-ASSISTED DIAGNOSTIC SYSTEM [ADM] [J].
LENOIR, P ;
CHALES, G ;
ROGER, MJ .
MEDICAL INFORMATICS, 1981, 6 (01) :33-40