EVALUATING THE ARTICULATION INDEX FOR AUDITORY VISUAL INPUT

被引:95
作者
GRANT, KW [1 ]
BRAIDA, LD [1 ]
机构
[1] MIT,ELECTR RES LAB,CAMBRIDGE,MA 02139
关键词
D O I
10.1121/1.400733
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
An investigation of the auditory-visual (AV) articulation index (AI) correction procedure outlined in the ANSI standard [ANSI S3.5-1969 (R1986)] was made by evaluating auditory (A), visual (V), and auditory-visual sentence identification for both wideband speech degraded by additive noise and a variety of bandpass-filtered speech conditions presented in quiet and in noise. When the data for each of the different listening conditions were averaged across talkers and subjects, the procedure outlined in the standard was fairly well supported, although deviations from the predicted AV score were noted for individual subjects as well as individual talkers. For filtered speech signals with AI(A) < 0.25, there was a tendency for the standard to underpredict AV scores. Conversely, for signals with AI(A) > 0.25, the standard consistently overpredicted AV scores. Additionally, synergistic effects, where the AI(A) obtained from the combination of different bandpass-filtered conditions was greater than the sum of the individual AI(A)'s, were observed for all nonadjacent filter-band combinations (e.g., the addition of a low-pass band with a 630-Hz cutoff and a high-pass band with a 3150-Hz cutoff). These latter deviations from the standard violate the basic assumption of additivity stated by Articulation Theory, but are consistent with earlier reports by Pollack [I. Pollack, J. Acoust. Soc. Am. 20, 259-266 (1948)], Licklider [J. C. R. Licklider, Psychology: A study of a Science, Vol. 1, edited by S. Koch (McGraw-Hill, New York, 1959), pp. 41-144], and Kryter [K. D. Kryter, J. Acoust. Soc. Am. 32, 547-556 (1960)].
引用
收藏
页码:2952 / 2960
页数:9
相关论文
共 38 条
[1]  
BRAIDA LD, 1988, J ACOUST SOC AM, V84, pS142
[2]   SPEECH-READING SUPPLEMENTED WITH AUDITORILY PRESENTED SPEECH PARAMETERS [J].
BREEUWER, M ;
PLOMP, R .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1986, 79 (02) :481-499
[3]  
BREEUWER M, 1984, J ACOUST SOC AM, V76, P686, DOI 10.1121/1.391255
[4]   SPEECHREADING SUPPLEMENTED WITH FORMANT-FREQUENCY INFORMATION FROM VOICED SPEECH [J].
BREEUWER, M ;
PLOMP, R .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1985, 77 (01) :314-317
[5]   UNDERLYING STRUCTURE OF AUDITORY-VISUAL CONSONANT PERCEPTION BY HEARING-IMPAIRED CHILDREN AND THE INFLUENCES OF SYLLABIC COMPRESSION [J].
BUSBY, PA ;
TONG, YC ;
CLARK, GM .
JOURNAL OF SPEECH AND HEARING RESEARCH, 1988, 31 (02) :156-165
[6]  
Davis H., 1978, HEARING DEAFNESS
[7]  
ERBER NP, 1972, J SPEECH HEAR RES, V14, P413
[8]   THE PERCEPTION OF SPEECH AND ITS RELATION TO TELEPHONY [J].
FLETCHER, H ;
GALT, RH .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1950, 22 (02) :89-151
[9]   FACTORS GOVERNING THE INTELLIGIBILITY OF SPEECH SOUNDS [J].
FRENCH, NR ;
STEINBERG, JC .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1947, 19 (01) :90-119
[10]  
Grant K. W., 1988, J ACOUST SOC AM, V84, pS45, DOI [10.1121/1.2026321, DOI 10.1121/1.2026321]