Acoustic-phonetic features for the automatic classification of stop consonants

被引:31
作者
Ali, AMA [1 ]
Van der Spiegel, J [1 ]
Mueller, P [1 ]
机构
[1] Texas Instruments Inc, Res & Dev, Warren, NJ 07059 USA
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2001年 / 9卷 / 08期
关键词
acoustic-phonetic; feature extraction; phoneme recognition; speech recognition; stop consonants;
D O I
10.1109/89.966086
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, the acoustic-phonetic characteristics of the American English stop consonants are investigated. Features studied in the literature are evaluated for their information content and new features are proposed. A statistically guided, knowledge-based, acoustic-phonetic system for the automatic classification of stops, in speaker independent continuous speech, is proposed. The system uses a new auditory-based front-end processing and incorporates new algorithms for the extraction and manipulation of the acoustic-phonetic features that proved to be rich in their information content. Recognition experiments are performed using hard decision algorithms on stops extracted from the TIMIT database continuous speech of 60 speakers (not used in the design process) from seven different dialects of American English. An accuracy of 96% is obtained for voicing detection, 90% for place of articulation detection and 86% for the overall classification of stops.
引用
收藏
页码:833 / 841
页数:9
相关论文
共 46 条
[31]   EFFECT OF BURST AMPLITUDE ON THE PERCEPTION OF STOP CONSONANT PLACE OF ARTICULATION [J].
OHDE, RN ;
STEVENS, KN .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1983, 74 (03) :706-714
[32]  
Potter Ralph, 1947, VISIBLE SPEECH
[33]  
RANGOUSSI M, 1995, P ICASSP, P792
[34]  
SAMUELIAN A, 1997, COMPUT SPEECH LANG, V11, P161
[35]  
SANDHU S, 1995, P IEEE ICASSP, P409
[36]   STOP CONSONANT DISCRIMINATION BASED ON HUMAN AUDITION [J].
SEARLE, CL ;
JACOBSON, JZ ;
RAYMENT, SG .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 (03) :799-809
[37]   A JOINT SYNCHRONY MEAN-RATE MODEL OF AUDITORY SPEECH PROCESSING [J].
SENEFF, S .
JOURNAL OF PHONETICS, 1988, 16 (01) :55-76
[38]  
STERN RM, 1992, P DARPA SPEECH 5 NAT, P274
[39]   ROLE OF FORMANT TRANSITIONS IN VOICED-VOICELESS DISTINCTION FOR STOPS [J].
STEVENS, KN ;
KLATT, DH .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1974, 55 (03) :653-659
[40]   INVARIANT CUES FOR PLACE OF ARTICULATION IN STOP CONSONANTS [J].
STEVENS, KN ;
BLUMSTEIN, SE .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 64 (05) :1358-1368