Large-vocabulary continuous sign language recognition based on transition-movement models

被引:70
作者
Fang, Gaolin
Gao, Wen
Zhao, Debin
机构
[1] Harbin Inst Technol, Dept Comp Sci, Harbin 150001, Peoples R China
[2] Inst Comp Technol, Beijing 100080, Peoples R China
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS | 2007年 / 37卷 / 01期
基金
中国国家自然科学基金;
关键词
Chinese sign language (CSL); dynamic time warping (DTW); hidden Markov model (HMM); sign language recognition (SLR); temporal clustering algorithm;
D O I
10.1109/TSMCA.2006.886347
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The major challenges that sign language recognition (SLR) now faces are developing methods that solve large-vocabulary continuous sign problems. In this paper, transition-movement models (TMMs) are proposed to handle transition parts between two adjacent signs in large-vocabulary continuous SLR. For tackling mass transition movements arisen from a large vocabulary size, a temporal clustering algorithm improved from k-means by using dynamic time warping as its distance measure is proposed to dynamically cluster them; then, an iterative segmentation algorithm for automatically segmenting transition parts from continuous sentences and training these TMMs through a bootstrap process is presented. The clustered TMMs due to their excellent generalization are very suitable for large-vocabulary continuous SLR. Lastly, TMMs together with sign models are viewed as candidates of the Viterbi search algorithm for recognizing continuous sign language. Experiments demonstrate that continuous SLR based on TMMs has good performance over a large vocabulary of 5113 Chinese signs and obtains an average accuracy of 91.9%.
引用
收藏
页码:1 / 9
页数:9
相关论文
共 29 条
[1]  
[Anonymous], PSYCHOL PERSPECTIVES
[2]  
[Anonymous], P 5 INT C MULT INT I
[3]  
Assan M., 1997, P GEST WORKSH, P97
[4]  
Bauer B., 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580), P440, DOI 10.1109/AFGR.2000.840672
[5]  
Bauer B., 2001, International Gesture Workshop, V2298, P64
[6]  
Chen YQ, 2003, IEEE INTERNATIONAL WORKSHOP ON ANALYSIS AND MODELING OF FACE AND GESTURES, P236
[7]   Large vocabulary sign language recognition based on fuzzy decision trees [J].
Fang, GL ;
Gao, W ;
Zhao, DB .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2004, 34 (03) :305-314
[8]   A SRN/HMM system for signer-independent continuous sign language recognition [J].
Fang, GL ;
Gao, W .
FIFTH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, PROCEEDINGS, 2002, :312-317
[9]  
Gao W, 2004, SIXTH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, PROCEEDINGS, P553
[10]  
Gao W, 2004, PATTERN RECOGN, V37, P2389, DOI [10.1016/S0031-3203(04)00165-7, 10.1016/j.patcog.2004.04.008]