Production models as a structural basis for automatic speech recognition

被引:37
作者
Deng, L [1 ]
Ramsay, G
Sun, D
机构
[1] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON N2L 3G1, Canada
[2] AT&T Bell Labs, Murray Hill, NJ 07974 USA
关键词
speech production; speech recognition; analysis by synthesis; stochastic modeling; nonlinear phonology; phonetic interface; articulatory features; articulatory dynamics; stochastic target model;
D O I
10.1016/S0167-6393(97)00018-6
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We postulate in this paper that highly structured speech production models will have much to contribute to the ultimate success of speech recognition in View of the weaknesses of the theoretical foundation underpinning current technology. These weaknesses are analyzed in terms of phonological modeling and of phonetic-interface modeling. We present two probabilistic speech recognition models with the structure designed based on approximations to human speech production mechanisms, and conclude by suggesting that many of the advantages to be gained from interaction between speech production and speech recognition communities will develop from integrating production models with the probabilistic analysis-by-synthesis strategy currently used by the technology community. (C) 1997 Elsevier Science B.V.
引用
收藏
页码:93 / 111
页数:19
相关论文
共 82 条
  • [1] NASAL CONSONANTS AND INTERNAL STRUCTURE OF SEGMENTS
    ANDERSON, SR
    [J]. LANGUAGE, 1976, 52 (02) : 326 - 344
  • [2] [Anonymous], 1995, P 13 INT C PHON SCI
  • [3] [Anonymous], AUTOMATIC SPEECH SPE, DOI DOI 10.1007/978-1-4613-1367-0_1
  • [4] [Anonymous], THESIS MIT CAMBRIDGE
  • [5] [Anonymous], CONNECTIONIST SPEECH
  • [6] FORMANT TRAJECTORIES AS AUDIBLE GESTURES - AN ALTERNATIVE FOR SPEECH SYNTHESIS
    BAILLY, G
    LABOISSIERE, R
    SCHWARTZ, JL
    [J]. JOURNAL OF PHONETICS, 1991, 19 (01) : 9 - 23
  • [7] BAILLY G, 1995, P 13 INT C PHON SCI, V2, P230
  • [8] BAKIS R, 1993, FRONTIERS SPEECH PRO
  • [9] BAKIS R, 1991, P IEEE WORKSH AUT SP, P20
  • [10] Baum L.E., 1972, Inequalities III: Proceedings of the Third Symposium on Inequalities, page, V3, P1