Closure duration analysis of incomplete stop consonants due to stop-stop interaction

被引:14
作者
Ghosh, Prasanta Kumar [1 ]
Narayanan, Shrikanth S. [1 ]
机构
[1] Univ So Calif, Dept Elect Engn, Signal Anal & Interpretat Lab, Los Angeles, CA 90089 USA
关键词
linguistics; speech; RECOGNITION;
D O I
10.1121/1.3141876
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
An incomplete stop consonant is characterized either by an indistinguishable closure or a missing burst. If an incomplete stop happens due to a stop following another stop [stop-stop interaction (SSI)], its acoustics typically resemble that of a complete stop-one closure followed by a single burst. As a consequence, stop detectors would fail to distinguish an SSI from a complete stop. Analysis of the TIMIT corpus shows 35.04% incomplete stops (14.97% SSI). It is shown that by using automatically estimated (and hand-labeled) closure duration, complete stops can be distinguished from incomplete stops due to SSI with 69.66% (79.14%) accuracy.
引用
收藏
页码:EL1 / EL7
页数:7
相关论文
共 13 条
[1]   Acoustic-phonetic features for the automatic classification of stop consonants [J].
Ali, AMA ;
Van der Spiegel, J ;
Mueller, P .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (08) :833-841
[2]  
Browman C.P., 1991, Papers in Laboratory Phonology I: Between the Grammar and the Physics of Speech, P341, DOI DOI 10.1017/CBO9780511627736.019
[3]   THE DURATION OF AMERICAN-ENGLISH STOP CONSONANTS - AN OVERVIEW [J].
CRYSTAL, TH ;
HOUSE, AS .
JOURNAL OF PHONETICS, 1988, 16 (03) :285-294
[4]   Missing information in spoken word recognition: Nonreleased stop consonants [J].
Deelman, T ;
Connine, CM .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2001, 27 (03) :656-663
[5]   STOP-CONSONANT RECOGNITION - RELEASE BURSTS AND FORMANT TRANSITIONS AS FUNCTIONALLY EQUIVALENT, CONTEXT-DEPENDENT CUES [J].
DORMAN, MF ;
STUDDERTKENNEDY, M ;
RAPHAEL, LJ .
PERCEPTION & PSYCHOPHYSICS, 1977, 22 (02) :109-122
[6]  
Garofolo J. S., 1993, TIMIT ACOUSTIC PHONE
[7]   DURATIONAL RELATIONSHIP BETWEEN JAPANESE STOPS AND VOWELS [J].
HOMMA, Y .
JOURNAL OF PHONETICS, 1981, 9 (03) :273-281
[8]   Modeling the temporal dynamics of distinctive feature landmark detectors for speech recognition [J].
Jansen, Aren ;
Niyogi, Partha .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 124 (03) :1739-1758
[9]  
Malbos F., 1994, Proceedings of the IEEE-SP International Symposium on Time-Frequency and Time-Scale Analysis (Cat. No.94TH8007), P612, DOI 10.1109/TFSA.1994.467277
[10]  
MANUEL SY, 1992, P 2 C SPOK LANG PROC, P943