Automatic acquisition and initialization of articulated models

被引：15

作者：

Krahnstoever, N ^{[1
]}

Yeasin, M ^{[1
]}

Sharma, R ^{[1
]}

机构：

[1] Penn State Univ, Dept Comp Sci & Engn, Pond Lab 220, University Pk, PA 16802 USA

来源：

MACHINE VISION AND APPLICATIONS | 2003年 / 14卷 / 04期

关键词：

model assembly; model-based visual tracking; joint detection; model acquisition; articulated motion;

D O I：

10.1007/s00138-002-0081-2

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Tracking, classification and visual analysis of articulated motion is challenging because of the difficulties involved in separating noise and variabilities caused by appearance, size and viewpoint fluctuations from task-relevant variations. By incorporating powerful domain knowledge, model-based approaches are able to overcome these problem to a great extent and are actively explored by many researchers. However, model acquisition, initialization and adaptation are still relatively under-investigated problems, especially for the case of single-camera systems. In this paper, we address the problem of automatic acquisition and initialization of articulated models from monocular video without any prior knowledge of shape and kinematic structure. The framework is applied in a human-computer interaction context where articulated shape models have to be acquired from unknown users for subsequent limb tracking. Bayesian motion segmentation is used to extract and initialize articulated models from visual data. Image sequences are decomposed into rigid components that can undergo parametric motion. The relative motion of these components is used to obtain joint information. The resulting components are assembled into an articulated kinematic model which is then used for visual tracking, eliminating the need for manual initialization or adaptation. The efficacy of the method is demonstrated on synthetic as well as natural image sequences. The accuracy of the joint estimation stage is verified on ground truth data.

引用

页码：218 / 228

页数：11

共 47 条

[1]

AYER S, 1995, FIFTH INTERNATIONAL CONFERENCE ON COMPUTER VISION, PROCEEDINGS, P777, DOI 10.1109/ICCV.1995.466859

[2] Estimating anthropometry and pose from a single uncalibrated image [J].

Barrón, C ;

Kakadiaris, IA .

COMPUTER VISION AND IMAGE UNDERSTANDING, 2001, 81 (03) :269-284

[3] Motion segmentation by multistage affine classification [J].

Borshukov, GD ;

Bozdagi, G ;

Altunbasak, Y ;

Tekalp, AM .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 1997, 6 (11) :1591-1594

[4] Tracking people with twists and exponential maps [J].

Bregler, C ;

Malik, J .

1998 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1998, :8-15

[5]

Cheung GKM, 2000, PROC CVPR IEEE, P714, DOI 10.1109/CVPR.2000.854944

[6]

Covell MM, 2000, PROC CVPR IEEE, P438, DOI 10.1109/CVPR.2000.854875

[7]

*CUR LAB, 2001, POS 4 0

[8] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].

DEMPSTER, AP ;

LAIRD, NM ;

RUBIN, DB .

JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38

[9]

DEUTSCHER J, 2000, PROC CVPR IEEE, P126, DOI DOI 10.1109/CVPR.2000.854758

[10]

Doucet A., 2001, SEQUENTIAL MONTE CAR

← 1 2 3 4 5 →