AUTOMATIC DETECTION OF PROSODIC BOUNDARIES IN SPEECH

被引:10
作者
CAMPBELL, N
机构
[1] Advanced Telecommunications Research Institute, Interpreting Telephony Research Laboratories, Kyoto
关键词
SPEECH-SEGMENTATION; DURATION; SYLLABLES; PHRASING;
D O I
10.1016/0167-6393(93)90033-H
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes a method for automatic annotation of prosodic events in speech, using segmental duration information. It details a way of differentiating prominence-related lengthening from boundary-related lengthening, using durational clues alone, and discusses an anomaly in the phrasing characteristics of four speakers' readings of 200 phonetically-balanced sentences. An algorithm is described that uses syllable-level differences in normalised segmental duration measures to detect prosodic boundaries in a speech signal. Tests with read-speech data from four British-English RP speakers show high agreement between speakers with respect to the number of boundaries detected and the length of the phrases delimited by each pair of boundaries, but the correlation between speakers on actual boundary locations is low. There is particular disagreement between speakers in the case of a single function word linking two groups of content words. This discrepancy can be resolved if the boundary is taken to be at the function word location itself, rather than at one or other side of the word. These results are taken to indicate some freedom in the placement of prosodic boundaries in such cases, sometimes being cued by a syntactic boundary, and sometimes by a rhythmic one.
引用
收藏
页码:343 / 354
页数:12
相关论文
共 16 条
[1]  
Beckman M., 1992, P 2 INT C SPOKEN LAN, P867
[2]   SEGMENT DURATIONS IN A SYLLABLE FRAME [J].
CAMPBELL, WN ;
ISARD, SD .
JOURNAL OF PHONETICS, 1991, 19 (01) :37-47
[3]  
CAMPBELL WN, 1989, P EUROPEAN C SPEECH, P698
[4]  
CRYSTAL TH, 1986, IEEE INT C ACOUST SP, V51, P2791
[5]   ARTICULATORY TIMING AND THE PROSODIC INTERPRETATION OF SYLLABLE DURATION [J].
EDWARDS, J ;
BECKMAN, ME .
PHONETICA, 1988, 45 (2-4) :156-174
[6]  
EDWARDS JR, 1991, J ACOUST SOC AM, V89
[7]  
EDWARDS K, 1992, EVALUATION HMM BASED
[8]  
GAITENBY J, 1965, 112 HASK LAB NEW HAV
[9]  
GARDING E, 1960, STUD LINGUISTICA, V14, P37
[10]  
Klatt D. H., 1975, J PHONETICS, V3, P129, DOI [DOI 10.1016/S0095-4470(19)31360-9, 10.1016/S0095-4470(19)31360-9]