Modeling the temporal dynamics of distinctive feature landmark detectors for speech recognition

被引:13
作者
Jansen, Aren [1 ]
Niyogi, Partha [1 ]
机构
[1] Univ Chicago, Dept Comp Sci, Chicago, IL 60637 USA
关键词
D O I
10.1121/1.2956472
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper elaborates on a computational model for speech recognition that is inspired by several interrelated strands of research in phonology, acoustic phonetics, speech perception, and neuroscience. The goals are twofold: (i) to explore frameworks for recognition that may provide a viable alternative to. the current hidden Markov model (HMM) based speech recognition systems and (ii) to provide a computational platform that will facilitate engaging, quantifying, and testing various theories in the scientific traditions in phonetics, psychology, and neuroscience. This motivation leads to an approach that constructs a hierarchically structured point process representation based on distinctive feature landmark detectors and probabilistically integrates the firing patterns of these detectors to decode a phonological sequence. The accuracy of a broad class recognizer based on this framework is competitive with equivalent HMM-based systems. Various avenues for future development of the presented methodology are outlined. (C) 2008 Acoustical Society of America.
引用
收藏
页码:1739 / 1758
页数:20
相关论文
共 39 条
[1]   Robust acoustic object detection [J].
Amit, Y ;
Koloydenko, A ;
Niyogi, P .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2005, 118 (04) :2634-2648
[2]  
[Anonymous], ACOUSTIC PHOENETICS
[3]  
BURGES C, 2002, TR200202 U CHIC COMP
[4]  
CHANG S, 2000, P ICSLP INT C SPOK L
[5]   Template-based spike pattern identification with linear convolution and dynamic time warping [J].
Chi, Zhiyi ;
Wu, Wei ;
Haga, Zach ;
Hatsopoulos, Nicholas G. ;
Margoliash, Daniel .
JOURNAL OF NEUROPHYSIOLOGY, 2007, 97 (02) :1221-1235
[6]  
Chomsky Noam., 1968, The sound pattern of English
[7]  
DEMIROGLU C, 2004, P ACSSC AS C SIGN SY
[8]  
DESHMUKH O, 2002, P ICASSP INT C AC SP
[9]  
ESPYWILSON C, 2007, P INT ANTW BELG AUG
[10]   ACOUSTIC MEASURES FOR LINGUISTIC FEATURES DISTINGUISHING THE SEMIVOWELS WJRL IN AMERICAN ENGLISH [J].
ESPYWILSON, CY .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1992, 92 (02) :736-757