Segmentation and tracking of multiple video objects

被引:42
作者
Colombari, A. [1 ]
Fusiello, A. [1 ]
Murino, V. [1 ]
机构
[1] Univ Verona, Dipartimento Informat, I-37134 Verona, Italy
关键词
content-based representation; MPEG; video coding; video sequence analysis; mosaicing; motion segmentation;
D O I
10.1016/j.patcog.2006.07.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 [模式识别与智能系统]; 0812 [计算机科学与技术]; 0835 [软件工程]; 1405 [智能科学与技术];
摘要
This paper describes a technique that produces a content-based representation of a video shot composed by a background (still) mosaic and one or more foreground moving objects. Segmentation of moving objects is based on ego-motion compensation and on background modelling using tools from robust statistics. Region matching is carried out by an algorithm that operates on the Mahalanobis distance between region descriptors in two subsequent frames and uses singular value decomposition to compute a set of correspondences satisfying both the principle of proximity and the principle of exclusion. The sequence is represented as a layered graph, and specific techniques are introduced to cope with crossing and occlusion. Examples of MPEG-4 (main profile) encoding are reported. (c) 2006 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:1307 / 1317
页数:11
相关论文
共 21 条
[1]
A survey on the automatic indexing of video data [J].
Brunelli, R ;
Mich, O ;
Modena, CM .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 1999, 10 (02) :78-112
[2]
Cohen I., 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149), P319, DOI 10.1109/CVPR.1999.784651
[3]
A REVIEW OF STATISTICAL-DATA ASSOCIATION TECHNIQUES FOR MOTION CORRESPONDENCE [J].
COX, IJ .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 1993, 10 (01) :53-66
[4]
RANDOM SAMPLE CONSENSUS - A PARADIGM FOR MODEL-FITTING WITH APPLICATIONS TO IMAGE-ANALYSIS AND AUTOMATED CARTOGRAPHY [J].
FISCHLER, MA ;
BOLLES, RC .
COMMUNICATIONS OF THE ACM, 1981, 24 (06) :381-395
[5]
Giaccone P. R., 1998, BMVC 98. Proceedings of the Ninth British Machine Vision Conference, P619
[6]
Hampel Frank R., 1986, WILEY SERIES PROBABI
[7]
Efficient representations of video sequences and their applications [J].
Irani, M ;
Anandan, P ;
Bergen, J ;
Kumar, R ;
Hsu, S .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 1996, 8 (04) :327-351
[8]
KANATANI K, 1999, INT C COMP VIS, V1, P73
[9]
MPEG-4: Context and objectives [J].
Koenen, R ;
Pereira, F ;
Chiariglione, L .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 1997, 9 (04) :295-304
[10]
MPEG-7 -: The generic Multimedia Content Description standard, Part 1 [J].
Martínez, JM ;
Koenen, R ;
Pereira, F .
IEEE MULTIMEDIA, 2002, 9 (02) :78-87