Modelling and segmenting subunits for sign language recognition based on hand motion analysis

被引:73
作者
Han, Junwei [1 ]
Awad, George [1 ]
Sutherland, Alistair [1 ]
机构
[1] Dublin City Univ, Sch Comp, Dublin 9, Ireland
关键词
Sign language recognition; Subunit; Phoneme; Hand motion; Dynamic time warping;
D O I
10.1016/j.patrec.2008.12.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Modelling and segmenting subunits is one of the important topics in sign language study. Many scholars have proposed the functional definition to subunits from the view of linguistics while the problem of efficiently implementing it using computer vision techniques is a challenge. On the other hand, a number of subunit segmentation work has been investigated for the task of vision-based sign language recognition whereas their subunits either somewhat lack the linguistic support or are improper. In this paper, we attempt to define and segment subunits using computer vision techniques, which also can be basically explained by sign language linguistics. A subunit is firstly defined as one continuous visual hand action in time and space, which comprises a series of interrelated consecutive frames. Then, a simple but efficient solution is developed to detect the Subunit boundary using hand motion discontinuity. Finally, temporal clustering by dynamic time warping is adopted to merge similar segments and refine the results. The presented work does not need prior knowledge of the types of signs or number of subunits and is more robust to signer behaviour variation. Furthermore, it correlates highly with the definition of syllables in sign language while sharing characteristics of syllables in spoken languages, A set of comprehensive experiments on real-world signing videos demonstrates the effectiveness of the proposed model. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:623 / 633
页数:11
相关论文
共 20 条
  • [1] [Anonymous], 1989, Sign Language Studies
  • [2] Awad G, 2006, INT C PATT RECOG, P239
  • [3] Bauer B., 2001, INT GESTURE WORKSHOP, P64
  • [4] Brentari Diane., 1998, A prosodic model of sign language phonology
  • [5] A novel approach to automatically extracting basic units from Chinese sign language
    Fang, GL
    Gao, XJ
    Gao, W
    Chen, YQ
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, 2004, : 454 - 457
  • [6] Han JW, 2006, PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION - PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE, P237
  • [7] Data clustering: A review
    Jain, AK
    Murty, MN
    Flynn, PJ
    [J]. ACM COMPUTING SURVEYS, 1999, 31 (03) : 264 - 323
  • [8] Klima E.S., 1979, SIGNS LANGUAGE
  • [9] Lee C., 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065), P2796, DOI 10.1109/ROBOT.2000.846451
  • [10] A real-time continuous gesture recognition system for sign language
    Liang, RH
    Ouhyoung, M
    [J]. AUTOMATIC FACE AND GESTURE RECOGNITION - THIRD IEEE INTERNATIONAL CONFERENCE PROCEEDINGS, 1998, : 558 - 567