Hand gesture recognition using ensembles of radial basis function (RBF) networks and decision trees

被引:2
作者
Gutta, S
Imam, IF
Wechsler, H
机构
[1] GEORGE MASON UNIV,CTR PARALLEL & DISTRIBUTED COMPUTAT,DEPT COMP SCI,FAIRFAX,VA 22030
[2] GEORGE MASON UNIV,MACHINE LEARNING & INFERENCE LAB,DEPT COMP SCI,FAIRFAX,VA 22030
关键词
hybrid systems; hybrid learning; decision trees (AQDT); hand gesture recognition; ensembles of radial basis function (ERBF) networks;
D O I
10.1142/S021800149700038X
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hand gestures are the natural form of communication among people, yet human-computer interaction is still limited to mice movements. The use of hand gestures in the field of human-computer interaction has attracted renewed interest in the past several years. Special glove-based devices have been developed to analyze finger and hand motion and use them to manipulate and explore virtual worlds. To further enrich the naturalness of the interaction, different computer vision-based techniques have been developed. At the same time the need for more efficient systems has resulted in new gesture recognition approaches. In this paper we present an hybrid intelligent system for hand gesture recognition. The hybrid approach consists of an ensemble of connectionist networks radial basis functions (RBF) - and inductive decision trees (AQDT). Cross Validation (CV) experimental results yield a false negative rate of 1.7% and a false positive rate of 1% while the evaluation takes place on a data base including 150 images corresponding to 15 gestures of 5 subjects. In order to assess the robustness of the system, the vocabulary of the gestures has been increased from 15 to 25 and the size of the database from 150 to 750 images corresponding now to 15 subjects. Cross Validation (CV) experimental results yield a false negative rate of 3.6% and a false positive rate of 1.8% respectively. The benefits of our hybrid architecture include (i) robustness via query by consensus as provided by ensembles of networks when facing the inherent variability of the image formation and data acquisition process, (ii) classifications made using decision trees, (iii) flexible and adaptive thresholds as opposed to ad hoc and hard thresholds and (iv) interpretability of the way classification and retrieval is eventually achieved.
引用
收藏
页码:845 / 872
页数:28
相关论文
共 33 条
[1]  
[Anonymous], P IEEE INT WORKSH AU
[2]  
[Anonymous], 1995, Hybrid Intelligent Systems
[3]   Shape analysis using hybrid learning [J].
Bala, J ;
Wechsler, H .
PATTERN RECOGNITION, 1996, 29 (08) :1323-1333
[4]  
BALA J, 1995, P 14 INT JOINT C ART, V1, P719
[5]   DEMOCRACY IN NEURAL NETS - VOTING SCHEMES FOR CLASSIFICATION [J].
BATTITI, R ;
COLLA, AM .
NEURAL NETWORKS, 1994, 7 (04) :691-707
[6]  
Bezdek J.C, 1994, COMPUTATIONAL INTELL, P1
[7]  
BRADSHAW G, 1991, ADV NEURAL INF P SYS, V2, P497
[8]  
CHANG EI, 1991, ADV NEURAL INF P SYS, V2, P380
[9]  
DOWNTON AC, 1991, P 6 INT C IMAG AN PR, V2, P637
[10]  
FLOCCHINI P, 1992, INT J PATTERN RECOGN, V6, P446