SAVI: an actively controlled teleconferencing system

被引:8
作者
Herpers, R
Derpanis, K
MacLean, WJ
Verghese, G
Jenkin, M
Milios, E
Jepson, A
Tsotsos, JK
机构
[1] Univ Appl Sci St Augustin, Fachhsch Bonn Rhein Sieg, Dept Appl Comp Sci, D-53757 St Augustin, Germany
[2] York Univ, Dept Comp Sci, York Ctr Vis Res, N York, ON M3J 1P3, Canada
[3] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON M5S 3G4, Canada
[4] Univ Toronto, Dept Comp Sci, Toronto, ON M5S 3G4, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
active vision interface; teleconferencing system; face and hand gesture recognition; Computer Vision System;
D O I
10.1016/S0262-8856(00)00107-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A Stereo Active Vision Interface (SAVI) is introduced which detects frontal faces in real world environments and performs particular active control tasks dependent on hand gestures given by the person the system attends to. The SAVI system is thought of as a smart user interface for teleconferencing, telemedicine, and distance learning applications. To reduce the search space in the visual scene the processing is started with the detection of connected skin colour regions applying a new radial scanline algorithm. Subsequently, in the most salient skin colour region facial features are searched for while the skin colour blob is actively kept in the centre of the visual field of the camera system. After a successful evaluation of the facial features the associated person is able to give control commands to the system. For this contribution only visual control commands are investigated but there is no limitation for voice or any other commands. These control commands can either effect the observing system itself or any other active or robotic system wired to the principle observing system via TCP/IP sockets. The system is designed as a perception-action-cycle (PAC), processing sensory data of different kinds and qualities. Both the vision module and the head motion control module work at frame rate on a PC platform. Hence, the system is able to react instantaneously to changing conditions in the visual scene. (C) 2001 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:793 / 804
页数:12
相关论文
共 23 条
  • [1] CAI J, 1998, INT WORKSH MULT MED
  • [2] DAVINVCI L, 1550, TRATISE PAINTING, V1
  • [3] Freeman William T., 1995, P IEEE INT WORKSH AU, P179
  • [4] THE DESIGN AND USE OF STEERABLE FILTERS
    FREEMAN, WT
    ADELSON, EH
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1991, 13 (09) : 891 - 906
  • [5] Computer vision for computer games
    Freeman, WT
    Tanaka, K
    Ohta, J
    Kyuma, K
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, 1996, : 100 - 105
  • [6] HAMDAM R, 1999, P IEEE COMP SOC C CO, V2, P98
  • [7] Herpers R., 1999, Proceedings International Workshop on Recognition, Analysis, and Tracking of Faces and Gestures in Real-Time Systems. In Conjunction with ICCV'99 (Cat. No.PR00378), P96, DOI 10.1109/RATFG.1999.799230
  • [8] Edge and keypoint detection in facial regions
    Herpers, R
    Michaelis, M
    Lichtenauer, KHH
    Sommer, G
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, 1996, : 212 - 217
  • [9] HERPERS R, 1995, IEEE P INT WORKSH AU, P214
  • [10] HERPERS R, 1997, 9714 U KIEL