A database-based framework for gesture recognition

被引:17
作者
Athitsos, Vassilis [1 ]
Wang, Haijing [1 ]
Stefan, Alexandra [1 ]
机构
[1] Univ Texas Arlington, Comp Sci & Engn Dept, Arlington, TX 76019 USA
基金
美国国家科学基金会;
关键词
Gesture recognition; Hand pose estimation; Embeddings; American Sign Language; Indexing methods; Image and video databases; SIMILARITY SEARCH; SPACES;
D O I
10.1007/s00779-009-0276-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
080201 [机械制造及其自动化];
摘要
Gestures are an important modality for human machine communication. Computer vision modules performing gesture recognition can be important components of intelligent homes, assistive environments, and human computer interfaces. A key problem in recognizing gestures is that the appearance of a gesture can vary widely depending on variables such as the person performing the gesture, or the position and orientation of the camera. This paper presents a database-based approach for addressing this problem. The large variability in appearance among different examples of the same gesture is addressed by creating large gesture databases, that store enough exemplars from each gesture to capture the variability within that gesture. This database-based approach is applied to two gesture recognition problems: handshape categorization and motion-based recognition of American Sign Language signs. A key aspect of our approach is the use of database indexing methods, in order to address the challenge of searching large databases without violating the time constraints of an online interactive system, where system response times of over a few seconds are oftentimes considered unacceptable. Our experiments demonstrate the benefits of the proposed database-based framework, and the feasibility of integrating large gesture databases into online interacting systems.
引用
收藏
页码:511 / 526
页数:16
相关论文
共 68 条
[1]
ALON J, 2005, IEEE MOT WORKSH, P254
[2]
[Anonymous], KNOWL INF SYST
[3]
[Anonymous], 1977, PROC IJCAI
[4]
Athitsos V, 2003, PROC CVPR IEEE, P432
[5]
ATHITSOS V, 2008, IEEE WORKSH COMP VIS
[6]
ATHITSOS V, 2005, INT WORKSH AUD VIS C
[7]
BoostMap: An embedding method for efficient nearest neighbor retrieval [J].
Athitsos, Vassilis ;
Alon, Jonathan ;
Sclaroff, Stan ;
Kollios, George .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (01) :89-104
[8]
Query-sensitive embeddings [J].
Athitsos, Vassilis ;
Hadjieleftheriou, Marios ;
Kollios, George ;
Sclaroff, Stan .
ACM TRANSACTIONS ON DATABASE SYSTEMS, 2007, 32 (02)
[9]
BAUER B, 2001, GEST WORKSH, P64
[10]
Shape matching and object recognition using shape contexts [J].
Belongie, S ;
Malik, J ;
Puzicha, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (04) :509-522