A local region based approach to lip tracking

被引:21
作者
Cheung, Yiu-ming [1 ]
Liu, Xin [1 ]
You, Xinge [2 ]
机构
[1] Hong Kong Baptist Univ, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China
[2] Huazhong Univ Sci & Technol, Elect & Informat Engn Dept, Wuhan 430074, Peoples R China
关键词
Lip tracking; Localized color active contour model; Semi-ellipse; Local region; Deformable model; IMAGE SEGMENTATION; CONTOUR EXTRACTION; COLOR; FEATURES; MOTION;
D O I
10.1016/j.patcog.2012.02.024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Lip tracking has played a significant role in a lip reading system. In this paper, we present a local region based approach to lip tracking, which consists of two phases: (i) lip contour extraction for the first lip frame, and followed by (ii) lip tracking in the subsequent lip frames. Initially, we construct a localized color active color model provided that the foreground and background regions around the object are locally different in color space. In the first phase, we find a combined semi-ellipse around the lip as the initial evolving curve and compute the localized energies for curve evolution such that the lip image is separated into lip and non-lip regions. Then, we utilize a 16-point deformable model (Wang et al., 2004 [20]) with geometric constraint to achieve lip contour extraction. In the second phase, we present a dynamic selection of the radius of local regions associated with the extracted lip contour of the previous frame to realize lip tracking. The proposed approach not only adapts to the lip movement, but it is also robust against the appearance of teeth, tongue and black hole. Extensive experiments show the efficiency of the proposed lip tracking algorithm in comparison with the existing methods. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3336 / 3347
页数:12
相关论文
共 39 条
[1]   Discriminative analysis of lip motion features for speaker identification and speech-reading [J].
Cetinguel, H. Ertan ;
Yemez, Yuecel ;
Erzin, Engin ;
Tekalp, A. Murat .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (10) :2879-2891
[2]   Lipreading from color video [J].
Chiou, GI ;
Hwang, JN .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1997, 6 (08) :1192-1195
[3]   A review of statistical approaches to level set segmentation: Integrating color, texture, motion and shape [J].
Cremers, Daniel ;
Rousson, Mikael ;
Deriche, Rachid .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2007, 72 (02) :195-215
[4]   Automatic Snakes for robust lip boundaries extraction [J].
Delmas, P ;
Coulon, PY ;
Fristot, V .
ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, :3069-3072
[5]  
Delmas P., 2002, 2002 7th International Conference on Control, Automation, Robotics and Vision (IEEE Cat. No.02EX649), P1421
[6]   Accurate and quasi-automatic lip tracking [J].
Eveno, N ;
Caplier, A ;
Coulon, PY .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2004, 14 (05) :706-715
[7]  
Eveno N, 2002, IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, pA125
[8]   Contour tracking in clutter: A subset approach [J].
Freedman, D ;
Brandstein, MS .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2000, 38 (02) :173-186
[9]  
HOU HS, 1978, IEEE T ACOUST SPEECH, V26, P508
[10]  
Jian YD, 2006, LECT NOTES COMPUT SC, V3851, P653