Identification of frequency-shifted vowels

被引:28
作者
Assmann, Peter F. [1 ]
Nearey, Terrance M. [2 ]
机构
[1] Univ Texas Dallas, Sch Behav & Brain Sci, Richardson, TX 75083 USA
[2] Univ Alberta, Dept Linguist, Edmonton, AB T6G 2E7, Canada
基金
美国国家科学基金会;
关键词
D O I
10.1121/1.2980456
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Within certain limits, speech intelligibility is preserved with upward or downward scaling of the spectral envelope. To study these limits and assess their interaction with fundamental frequency (F0), vowels in /hVd/ syllables were processed using the STRAIGHT vocoder and presented to listeners for identification. Identification accuracy showed a gradual decline when the spectral envelope was scaled up or down in vowels spoken by men, women, and children. Upward spectral envelope shifts led to poorer identification of children's vowels compared to adults, while downward shifts had a greater impact on men's vowels compared to women and children. Coordinated shifts (F0 and spectral envelope shifted in the same direction) generally produced higher accuracy than conditions with F0 and spectral envelope shifted in opposite directions. Vowel identification was poorest in conditions with very high F0, consistent with suggestions from the literature that sparse sampling of the spectral envelope may be a factor in vowel identification. However, the gradual decline in accuracy as a function of both upward and downward spectral envelope shifts and the interaction between spectral envelope shifts and F0 suggests the additional operation of perceptual mechanisms sensitive to the statistical covariation of F0 and formant frequencies in natural speech. (C) 2008 Acoustical Society of America. [DOI: 10.1121/1.2980456]
引用
收藏
页码:3203 / 3212
页数:10
相关论文
共 40 条
[1]  
[Anonymous], 1991, Research design and statistical analysis
[2]   Relationship between fundamental and formant frequencies in voice preference [J].
Assmann, Peter F. ;
Nearey, Terrance M. .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2007, 122 (02) :EL35-EL43
[3]   Synthesis fidelity and time-varying spectral change in vowels [J].
Assmann, PF ;
Katz, WF .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2005, 117 (02) :886-895
[4]   Time-varying spectral change in the vowels of children and adults [J].
Assmann, PF ;
Katz, WF .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2000, 108 (04) :1856-1866
[5]  
ASSMANN PF, 2006, P 9 INT C SPOK LANG, P889
[6]  
Chiba T., 1941, The vowel, its nature and structure
[7]   Missing-data model of vowel identification [J].
de Cheveigné, A ;
Kawahara, H .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 105 (06) :3497-3508
[8]   On explaining certain male-female differences in the phonetic realization of vowel categories [J].
Diehl, RL ;
Lindblom, B ;
Hoemeke, KA ;
Fahey, RP .
JOURNAL OF PHONETICS, 1996, 24 (02) :187-208
[9]   Formant-frequency matching between sounds with different bandwidths and on different fundamental frequencies [J].
Dissard, P ;
Darwin, CJ .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2001, 110 (01) :409-415
[10]   Recognition of spectrally degraded and frequency-shifted vowels in acoustic and electric hearing [J].
Fu, QJ ;
Shannon, RV .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 105 (03) :1889-1900