A CONTINUOUS-SPEECH INTERFACE TO A DECISION-SUPPORT SYSTEM .2. AN EVALUATION USING A WIZARD-OF-OZ EXPERIMENTAL PARADIGM

被引:14
作者
DETMER, WM
SHIFFMAN, S
WYATT, JC
FRIEDMAN, CP
LANE, CD
FAGAN, LM
机构
[1] IMPERIAL CANC RES FUND,BIOMED INFORMAT UNIT,LONDON,ENGLAND
[2] UNIV N CAROLINA,SCH MED,COMP & COGNIT LAB,CHAPEL HILL,NC
关键词
D O I
10.1136/jamia.1995.95202548
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: Evaluate the performance of a continuous-speech interface to a decision support system. Design: The authors performed a prospective evaluation of a speech interface that matches unconstrained utterances of physicians with controlled-vocabulary terms from Quick Medical Reference (QMR). The performance of the speech interface was assessed in two stages: in the real-time experiment, physician subjects viewed audiovisual stimuli intended to evoke clinical findings, spoke a description of each finding into the speech interface, and then chose from a list generated by the interface the QMR term that most closely matched the finding. Subjects believed that the speech recognizer decoded their utterances; in reality, a hidden experimenter typed utterances into the interface (Wizard-of-Oz experimental design). Later, the authors replayed the same utterances through the speech recognizer and measured how accurately utterances matched with appropriate QMR terms using the results of the real-time experiment as the ''gold standard.'' Measurements: The authors measured how accurately the speech-recognition system converted input utterances to text strings (recognition accuracy) and how accurately the speech interface matched input utterances to appropriate QMR terms (semantic accuracy). Results: Overall recognition accuracy was less than 50%. However, using language-processing techniques that match keywords in recognized utterances to keywords in QMR terms, the semantic accuracy of the system was 81%. Conclusions: Reasonable semantic accuracy was attained when language-processing techniques were used to accommodate for speech misrecognition. In addition, the Wizard-of-Oz experimental design offered many advantages for this evaluation. The authors believe that this technique may be useful to future evaluators of speech-input systems.
引用
收藏
页码:46 / 57
页数:12
相关论文
共 19 条
[1]  
ANDERSON JG, 1994, EVALUATING HLTH CARE
[2]   TOWARDS AUTOMATIC EVALUATION OF MULTIMODAL USER INTERFACES [J].
COUTAZ, J ;
SALBER, D ;
BALBO, S .
KNOWLEDGE-BASED SYSTEMS, 1993, 6 (04) :267-274
[3]   WIZARD OF OZ STUDIES - WHY AND HOW [J].
DAHLBACK, N ;
JONSSON, A ;
AHRENBERG, L .
KNOWLEDGE-BASED SYSTEMS, 1993, 6 (04) :258-266
[4]  
GOULD FD, 1981, COMMUN ACM, V26, P295
[5]   TALKING TO COMPUTERS - AN EMPIRICAL-INVESTIGATION [J].
HAUPTMANN, AG ;
RUDNICKY, AI .
INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1988, 28 (06) :583-604
[6]  
HOOPER RS, 1965, 9556 IBM CORP TECHN
[7]  
ISAACS E, 1993, METHOD INFORM MED, V32, P18
[8]  
Johnson K, 1992, Proc Annu Symp Comput Appl Med Care, P757
[9]   COMBINING QUALITATIVE AND QUANTITATIVE METHODS IN INFORMATION-SYSTEMS RESEARCH - A CASE-STUDY [J].
KAPLAN, B ;
DUCHON, D .
MIS QUARTERLY, 1988, 12 (04) :571-586
[10]  
KUHN K, 1992, METHOD INFORM MED, V31, P268