Multibody grouping from motion images

被引:120
作者
Gear, CW [1 ]
机构
[1] NEC Res Inst, Princeton, NJ 08540 USA
关键词
clustering; rigid body motion; vision;
D O I
10.1023/A:1008026310903
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We want to deduce, from a sequence of noisy two-dimensional images of a scene of several rigid bodies moving independently in three dimensions, the number of bodies and the grouping of given feature points in the images to the bodies. Prior processing is assumed to have identified features or points common to all frames and the images are assumed to be created by orthographic projection (i.e., perspective effects are minimal). We describe a computationally inexpensive algorithm that can determine which points or features belong to which rigid body using the fact that, with exact observations in orthographic projection, points on a single body lie in a three or less dimensional linear manifold of frame space. If there are enough observations and independent motions, these manifolds can be viewed as a set linearly independent, four or less dimensional subspaces. We show that the row echelon canonical form provides direct information on the grouping of points to these subspaces. Treatment of the noise is the most difficult part of the problem. This paper uses a statistical approach to estimate the grouping of points to subspaces in the presence of noise by computing which partition has the maximum likelihood. The input data is assumed to be contaminated with independent Gaussian noise. The algorithm can base its estimates on a user-supplied standard deviation of the noise, or it can estimate the noise from the data. The algorithm can also be used to estimate the probability of a user-specified partition so that the hypothesis can be combined with others using Bayesian statistics.
引用
收藏
页码:133 / 150
页数:18
相关论文
共 17 条
[1]  
Boult T. E., 1991, Proceedings of the IEEE Workshop on Visual Motion (Cat. No.91TH0390-5), P179, DOI 10.1109/WVM.1991.212809
[2]  
COSTEIRA J, 1995, FIFTH INTERNATIONAL CONFERENCE ON COMPUTER VISION, PROCEEDINGS, P1071, DOI 10.1109/ICCV.1995.466815
[3]  
DEMSTER AP, 1977, P ROY STAT SOC B, V39, P1
[4]  
GEAR CW, 1993, TR93099 NEC RES I
[5]  
GEAR CW, 1993, P 1994 IEEE WORKSH M, P214
[6]  
Golub G.H., 1996, Matrix Computations, Vthird
[7]   MOTION AND STRUCTURE FROM FEATURE CORRESPONDENCES - A REVIEW [J].
HUANG, TS ;
NETRAVALI, AN .
PROCEEDINGS OF THE IEEE, 1994, 82 (02) :252-268
[8]  
Jacobs D. W., 1994, Proceedings of the 1994 IEEE Workshop on Motion of Non-Rigid and Articulated Objects (Cat. No.94TH0671-8), P96, DOI 10.1109/MNRAO.1994.346249
[9]  
JACOBS DW, 1994, INT C PATT RECOG, P650
[10]  
KUNG SY, 1996, IEEE INT C AC SPEECH