IDENTIFICATION OF STEADY-STATE VOWELS SYNTHESIZED FROM THE PETERSON AND BARNEY MEASUREMENTS

被引:30
作者
HILLENBRAND, J [1 ]
GAYVERT, RT [1 ]
机构
[1] RIT RES CORP,ROCHESTER,NY 14623
关键词
D O I
10.1121/1.406884
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The purpose of this study was to determine how well listeners can identify vowels based exclusively on static spectral cues. This was done by asking listeners to identify steady-state synthesized versions of 1520 vowels (76 talkers X 10 vowels X 2 repetitions) using Peterson and Barney's measured values of F0 and F1-F3 [J. Acoust. Soc. Am. 24, 175-184 (1952)]. The values for all control parameters remained constant throughout the 300-ms duration of each stimulus. A second set of 1520 signals was identical to these stimuli except that a falling pitch contour was used. The identification error rate for the flat-formant, flat-pitch signals was 27.3%, several times greater than the 5.6% error rate shown by Peterson and Barney's listeners. The introduction of a falling pitch contour resulted in a small but statistically reliable reduction in the error rate. The implications of these results for interpreting pattern recognition studies using the Peterson and Barney database are discussed. Results are also discussed in relation to the role of dynamic cues in vowel identification.
引用
收藏
页码:668 / 674
页数:7
相关论文
共 43 条
[1]   DURATION AS A CUE IN RECOGNITION OF SYNTHETIC VOWELS [J].
AINSWORTH, WA .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1972, 51 (02) :648-+
[2]  
[Anonymous], 1978, PHONETIC FEATURE SYS
[3]   VOWEL IDENTIFICATION - ORTHOGRAPHIC, PERCEPTUAL, AND ACOUSTIC ASPECTS [J].
ASSMANN, PF ;
NEAREY, TM ;
HOGAN, JT .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1982, 71 (04) :975-989
[4]   SPECTRAL FORM AND DURATION AS CUES IN RECOGNITION OF ENGLISH AND GERMAN VOWELS [J].
BENNETT, DC .
LANGUAGE AND SPEECH, 1968, 11 :65-&
[5]   Natural Frequency, Duration, and Intensity of Vowels in Reading [J].
Black, John W. .
JOURNAL OF SPEECH AND HEARING DISORDERS, 1949, 14 (03) :216-221
[6]  
Bladon A., 1982, REPRESENTATION SPEEC, P95
[7]   MODELING THE JUDGMENT OF VOWEL QUALITY DIFFERENCES [J].
BLADON, RAW ;
LINDBLOM, B .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1981, 69 (05) :1414-1422
[8]  
Carlson R., 1975, AUDITORY ANAL PERCEP, P55, DOI DOI 10.1016/B978-0-12-248550-3.50008-8
[9]   FREQUENCY AND TIME VARIATIONS OF THE 1ST FORMANT - PROPERTIES RELEVANT TO THE PERCEPTION OF VOWEL HEIGHT [J].
DIBENEDETTO, MG .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1989, 86 (01) :67-77
[10]   VOWEL REPRESENTATION - SOME OBSERVATIONS ON TEMPORAL AND SPECTRAL PROPERTIES OF THE 1ST FORMANT FREQUENCY [J].
DIBENEDETTO, MG .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1989, 86 (01) :55-66