Closure duration analysis of incomplete stop consonants due to stop-stop interaction

被引：14

作者：

Ghosh, Prasanta Kumar ^{[1
]}

Narayanan, Shrikanth S. ^{[1
]}

机构：

[1] Univ So Calif, Dept Elect Engn, Signal Anal & Interpretat Lab, Los Angeles, CA 90089 USA

来源：

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA | 2009年 / 126卷 / 01期

关键词：

linguistics; speech; RECOGNITION;

D O I：

10.1121/1.3141876

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

An incomplete stop consonant is characterized either by an indistinguishable closure or a missing burst. If an incomplete stop happens due to a stop following another stop [stop-stop interaction (SSI)], its acoustics typically resemble that of a complete stop-one closure followed by a single burst. As a consequence, stop detectors would fail to distinguish an SSI from a complete stop. Analysis of the TIMIT corpus shows 35.04% incomplete stops (14.97% SSI). It is shown that by using automatically estimated (and hand-labeled) closure duration, complete stops can be distinguished from incomplete stops due to SSI with 69.66% (79.14%) accuracy.

引用

页码：EL1 / EL7

页数：7

共 13 条

[1] Acoustic-phonetic features for the automatic classification of stop consonants [J].

Ali, AMA ;

Van der Spiegel, J ;

Mueller, P .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (08) :833-841

[2]

Browman C.P., 1991, Papers in Laboratory Phonology I: Between the Grammar and the Physics of Speech, P341, DOI DOI 10.1017/CBO9780511627736.019

[3] THE DURATION OF AMERICAN-ENGLISH STOP CONSONANTS - AN OVERVIEW [J].

CRYSTAL, TH ;

HOUSE, AS .

JOURNAL OF PHONETICS, 1988, 16 (03) :285-294

[4] Missing information in spoken word recognition: Nonreleased stop consonants [J].

Deelman, T ;

Connine, CM .

JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2001, 27 (03) :656-663

[5] STOP-CONSONANT RECOGNITION - RELEASE BURSTS AND FORMANT TRANSITIONS AS FUNCTIONALLY EQUIVALENT, CONTEXT-DEPENDENT CUES [J].

DORMAN, MF ;

STUDDERTKENNEDY, M ;

RAPHAEL, LJ .

PERCEPTION & PSYCHOPHYSICS, 1977, 22 (02) :109-122

[6]

Garofolo J. S., 1993, TIMIT ACOUSTIC PHONE

[7] DURATIONAL RELATIONSHIP BETWEEN JAPANESE STOPS AND VOWELS [J].

HOMMA, Y .

JOURNAL OF PHONETICS, 1981, 9 (03) :273-281

[8] Modeling the temporal dynamics of distinctive feature landmark detectors for speech recognition [J].

Jansen, Aren ;

Niyogi, Partha .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 124 (03) :1739-1758

[9]

Malbos F., 1994, Proceedings of the IEEE-SP International Symposium on Time-Frequency and Time-Scale Analysis (Cat. No.94TH8007), P612, DOI 10.1109/TFSA.1994.467277

[10]

MANUEL SY, 1992, P 2 C SPOK LANG PROC, P943

← 1 2 →