A Chinese sign language recognition system based on SOFM/SRN/HMM

被引:71
作者
Gao, W
Fang, GL [1 ]
Zhao, DB
Chen, YQ
机构
[1] Harbin Inst Technol, Dept Comp Sci & Engn, Harbin 150001, Peoples R China
[2] Chinese Acad Sci, Inst Comp Technol, Beijing 100080, Peoples R China
基金
中国国家自然科学基金;
关键词
sign language recognition; Chinese sign language; hidden Markov model; self-organizing feature map; simple recurrent network;
D O I
10.1016/S0031-3203(04)00165-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 [模式识别与智能系统]; 0812 [计算机科学与技术]; 0835 [软件工程]; 1405 [智能科学与技术];
摘要
In sign language recognition (SLR), the major challenges now are developing methods that solve signer-independent continuous sign problems. In this paper, SOFM/HMM is first presented for modeling signer-independent isolated signs. The proposed method uses the self-organizing feature maps (SOFM) as different signers' feature extractor for continuous hidden Markov models (HMM) so as to transform input signs into significant and low-dimensional representations that can be well modeled by the emission probabilities of HMM. Based on these isolated sign models, a SOFM/SRN/HMM model is then proposed for signer-independent continuous SLR. This model applies the improved simple recurrent network (SRN) to segment continuous sign language in terms of transformed SOFM representations, and the outputs of SRN are taken as the HMM states in which the lattice Viterbi algorithm is employed to search the best matched word sequence. Experimental results demonstrate that the proposed system has better performance compared with conventional HMM system and obtains a word recognition rate of 82.9% over a 5113-sign vocabulary and an accuracy of 86.3% for signer-independent continuous SLR. (C) 2004 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:2389 / 2402
页数:14
相关论文
共 43 条
[1]
Hybrid HMM-NN modeling of stationary-transitional units for continuous speech recognition [J].
Albesano, D ;
Gemello, R ;
Mana, F .
INFORMATION SCIENCES, 2000, 123 (1-2) :3-11
[2]
[Anonymous], 1986, 8604 ICS U CAL
[3]
Assan M., 1997, P GEST WORKSH, P97
[4]
Bauer B., 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580), P440, DOI 10.1109/AFGR.2000.840672
[5]
Bauer B., 2001, International Gesture Workshop, V2298, P64
[6]
GLOBAL OPTIMIZATION OF A NEURAL NETWORK-HIDDEN MARKOV MODEL HYBRID [J].
BENGIO, Y ;
DEMORI, R ;
FLAMMIA, G ;
KOMPE, R .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1992, 3 (02) :252-259
[7]
CONTINUOUS SPEECH RECOGNITION BY CONNECTIONIST STATISTICAL-METHODS [J].
BOURLARD, H ;
MORGAN, N .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1993, 4 (06) :893-909
[8]
CORRADINI A, 2001, P INT GEST WORKSH, P34
[9]
Corrivetti G, 2000, CONFIN CEPHALALGICA, V9, P177
[10]
FINDING STRUCTURE IN TIME [J].
ELMAN, JL .
COGNITIVE SCIENCE, 1990, 14 (02) :179-211