Analysis of speech segment duration with the lognormal distribution: A basis for unification and comparison

被引:23
作者
Rosen, KM [1 ]
机构
[1] Univ Wisconsin, Waisman Ctr, Madison, WI 53705 USA
关键词
D O I
10.1016/j.wocn.2005.02.001
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
This study re-examines published data with the lognormal distribution (LND) and presents a basis for the unification of many previous measurements of speech segment duration in connected speech. The application of the LND was motivated by the connection between previous speech models and the law of proportionate effects, which is known to generate LNDs. Distributions of speech segment length in previous studies [Psycholinguistics: Experiments in Spontaneous Speech, 1968; Language and Speech 25 (1982) 11-28; Journal of the Acoustical Society of America 72 (1982) 705-716; Journal of the Acoustical Society of America 83 (1988a) 1553-1573; Journal of the Acoustical Society of America 83 (1988b) 1574-1585; Speech Communication 19 (1996) 161-176] were re-plotted onto lognormal cumulative plots. With the exceptions of stressed consonants and the phoneme /f/, the data were consistent with the LND, based on the results of the Kolmogorov-Smirnov test and root mean square error of the least-squares fit. Aside from the exceptions, the results indicate that (1) the duration of pauses, vowels and consonant classes can be effectively modeled with two parameters (geometric mean and geometric standard deviation), and (2) linguistic and non-linguistic effects are proportionate to duration and combine multiplicatively. Analysis with the LND revealed specific characteristics in some of the distributions that were not observed in the original analysis with linear-scaled distributions. Examples of how the LND may be used to detect heterogeneous groups in data sets, to determine outliers, and to reveal differences in underlying processes (e.g., existence of incompressible portions) are given. Advantages of using LND parameters (i.e., geometric mean, geometric standard deviation) over linear parameters (e.g., coefficient of variation) are also discussed. (c) 2005 Elsevier Ltd. All rights reserved.
引用
收藏
页码:411 / 426
页数:16
相关论文
共 50 条
[21]  
Goldman-Eisler F., 1968, PSYCHOLINGUISTICS EX
[22]   From physical time to the first and second moments of psychological time [J].
Grondin, S .
PSYCHOLOGICAL BULLETIN, 2001, 127 (01) :22-44
[23]  
JOHNSON NL, 1970, CONTINUOUS UNIVARIAN
[24]   THE LOG TRANSFORMATION IS SPECIAL [J].
KEENE, ON .
STATISTICS IN MEDICINE, 1995, 14 (08) :811-819
[25]   SPEECH SEGMENT DURATIONS IN SENTENCE RECITATIONS BY CHILDREN AND ADULTS [J].
KENT, RD ;
FORNER, LL .
JOURNAL OF PHONETICS, 1980, 8 (02) :157-168
[26]  
KILLEEN PR, 1984, TIMING TIME PERCEPTI, P515
[27]  
KIRSNER K, 2003, GOTHENBURG PAPERS TH, P13
[28]   DURATION OF [S] IN ENGLISH WORDS [J].
KLATT, D .
JOURNAL OF SPEECH AND HEARING RESEARCH, 1974, 17 (01) :51-63
[29]  
Klatt D.H., 1975, J PHONETICS, V3, P129, DOI [DOI 10.1016/S0095-4470(19)31360-9, 10.1016/S0095-4470(19)31360-9]
[30]   INTERACTION BETWEEN 2 FACTORS THAT INFLUENCE VOWEL DURATION [J].
KLATT, DH .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1973, 54 (04) :1102-1104