Automatic speech recognition using articulatory features from subject-independent acoustic-to-articulatory inversion

被引:30
作者
Ghosh, Prasanta Kumar [1 ]
Narayanan, Shrikanth [1 ]
机构
[1] Univ So Calif, Dept Elect Engn, Signal Anal & Interpretat Lab, Los Angeles, CA 90089 USA
基金
美国国家科学基金会;
关键词
MODELS;
D O I
10.1121/1.3634122
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
An automatic speech recognition approach is presented which uses articulatory features estimated by a subject-independent acoustic-to-articulatory inversion. The inversion allows estimation of articulatory features from any talker's speech acoustics using only an exemplary subject's articulatory-to-acoustic map. Results are reported on a broad class phonetic classification experiment on speech from English talkers using data from three distinct English talkers as exemplars for inversion. Results indicate that the inclusion of the articulatory information improves classification accuracy but the improvement is more significant when the speaking style of the exemplar and the talker are matched compared to when they are mismatched. (C) 2011 Acoustical Society of America
引用
收藏
页码:EL251 / EL257
页数:7
相关论文
共 19 条
  • [1] ATTIAS H, 2003, P INT C AC SPEECH SI, V1
  • [2] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38
  • [3] Production models as a structural basis for automatic speech recognition
    Deng, L
    Ramsay, G
    Sun, D
    [J]. SPEECH COMMUNICATION, 1997, 22 (2-3) : 93 - 111
  • [4] FRANKEL J, 2001, P EUR DENM, P599, DOI DOI 10.1109/TSA.2005.851910
  • [5] Garofolo J. S., 1993, TIMIT ACOUSTIC PHONE
  • [6] Ghosh PK, 2011, INT CONF ACOUST SPEE, P4624
  • [7] A generalized smoothness criterion for acoustic-to-articulatory inversion
    Ghosh, Prasanta Kumar
    Narayanan, Shrikanth
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 128 (04) : 2162 - 2172
  • [8] Goldstein L.M., 1986, PHONOLOGY YB, V3, P219, DOI [DOI 10.1017/S0952675700000658, 10.1017/S0952675700000658]
  • [9] Accurate recovery of articulator positions from acoustics: New conclusions based on human data
    Hogden, J
    Lofqvist, A
    Gracco, V
    Zlokarnik, I
    Rubin, P
    Saltzman, E
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 100 (03) : 1819 - 1834
  • [10] Hollander M., 1973, Nonparametric statistical methods