Design and use of linear models for image motion analysis

被引:83
作者
Fleet, DJ [1 ]
Black, MJ
Yacoob, Y
Jepson, AD
机构
[1] Queens Univ, Dept Comp & Informat Sci, Kingston, ON K7L 3N6, Canada
[2] Xerox Corp, Palo Alto Res Ctr, Palo Alto, CA 94304 USA
[3] Univ Maryland, Comp Vis Lab, College Pk, MD 20742 USA
[4] Univ Toronto, Dept Comp Sci, Toronto, ON M5S 1A4, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
optical flow; motion discontinuities; occlusion; steerable filters; learning; eigenspace methods; motion-based recognition; non-rigid and articulated motion;
D O I
10.1023/A:1008156202475
中图分类号
TP18 [人工智能理论];
学科分类号
081104 [模式识别与智能系统]; 0812 [计算机科学与技术]; 0835 [软件工程]; 1405 [智能科学与技术];
摘要
Linear parameterized models of optical flow, particularly affine models, have become widespread in image motion analysis. The linear model coefficients are straightforward to estimate, and they provide reliable estimates of the optical flow of smooth surfaces. Here we explore the use of parameterized motion models that represent much more varied and complex motions. Our goals are threefold: to construct linear bases for complex motion phenomena; to estimate the coefficients of these linear models; and to recognize or classify image motions from the estimated coefficients. We consider two broad classes of motions: i) generic "motion features" such as motion discontinuities and moving bars; and ii) non-rigid, object-specific, motions such as the motion of human mouths. For motion features we construct a basis of steerable flow fields that approximate the motion features. For object-specific motions we construct basis flow fields from example motions using principal component analysis. In both cases, the model coefficients can be estimated directly from spatiotemporal image derivatives with a robust, multi-resolution scheme. Finally, we show how these model coefficients can be use to detect and recognize specific motions such as occlusion boundaries and facial expressions.
引用
收藏
页码:171 / 193
页数:23
相关论文
共 64 条
[1]
[Anonymous], 1992, THEORY
[2]
[Anonymous], P EUR C COMP VIS ECC
[3]
AYER S, 1995, FIFTH INTERNATIONAL CONFERENCE ON COMPUTER VISION, PROCEEDINGS, P777, DOI 10.1109/ICCV.1995.466859
[4]
Robust optic flow computation [J].
Bab-Hadiashar, A ;
Suter, D .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 1998, 29 (01) :59-77
[5]
PERFORMANCE OF OPTICAL-FLOW TECHNIQUES [J].
BARRON, JL ;
FLEET, DJ ;
BEAUCHEMIN, SS .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 1994, 12 (01) :43-77
[6]
BAUMBERG A, 1994, LECT NOTES COMPUTER, V800, P299
[7]
The frequency structure of one-dimensional occluding image signals [J].
Beauchemin, SS ;
Barron, JL .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (02) :200-206
[8]
The ''independent components'' of natural scenes are edge filters [J].
Bell, AJ ;
Sejnowski, TJ .
VISION RESEARCH, 1997, 37 (23) :3327-3338
[9]
A 3-FRAME ALGORITHM FOR ESTIMATING 2-COMPONENT IMAGE MOTION [J].
BERGEN, JR ;
BURT, PJ ;
HINGORANI, R ;
PELEG, S .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1992, 14 (09) :886-896
[10]
Feature correspondence by interleaving shape and texture computations [J].
Beymer, D .
1996 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1996, :921-928