Shape Similarity for 3D Video Sequences of People

被引：57

作者：

Huang, Peng ^{[1
]}

Hilton, Adrian ^{[1
]}

Starck, Jonathan ^{[1
]}

机构：

[1] Univ Surrey, CVSSP, Guildford GU2 7XH, Surrey, England

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2010年 / 89卷 / 2-3期

基金：

英国工程与自然科学研究理事会;

关键词：

Temporal shape similarity; 3D video; Surface motion capture; Human motion; OBJECT RECOGNITION; MOTION SYNTHESIS; RETRIEVAL; SIGNATURES;

D O I：

10.1007/s11263-010-0319-9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a performance evaluation of shape similarity metrics for 3D video sequences of people with unknown temporal correspondence. Performance of similarity measures is compared by evaluating Receiver Operator Characteristics for classification against ground-truth for a comprehensive database of synthetic 3D video sequences comprising animations of fourteen people performing twenty-eight motions. Static shape similarity metrics shape distribution, spin image, shape histogram and spherical harmonics are evaluated using optimal parameter settings for each approach. Shape histograms with volume sampling are found to consistently give the best performance for different people and motions. Static shape similarity is extended over time to eliminate the temporal ambiguity. Time-filtering of the static shape similarity together with two novel shape-flow descriptors are evaluated against temporal ground-truth. This evaluation demonstrates that shape-flow with a multi-frame alignment of motion sequences achieves the best performance, is stable for different people and motions, and overcome the ambiguity in static shape similarity. Time-filtering of the static shape histogram similarity measure with a fixed window size achieves marginally lower performance for linear motions with the same computational cost as static shape descriptors. Performance of the temporal shape descriptors is validated for real 3D video sequence of nine actors performing a variety of movements. Time-filtered shape histograms are shown to reliably identify frames from 3D video sequences with similar shape and motion for people with loose clothing and complex motion.

引用

页码：362 / 381

页数：20

共 50 条

[1]

[Anonymous], ICCV 03

[2]

[Anonymous], SSD 99

[3] Motion synthesis from annotations [J].

Arikan, O ;

Forsyth, DA ;

O'Brien, JF .

ACM TRANSACTIONS ON GRAPHICS, 2003, 22 (03) :402-408

[4] Shape matching and object recognition using shape contexts [J].

Belongie, S ;

Malik, J ;

Puzicha, J .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (04) :509-522

[5] The recognition of human movement using temporal templates [J].

Bobick, AF ;

Davis, JW .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (03) :257-267

[6] Content-based 3D object retrieval [J].

Bustos, Benjamin ;

Keim, Daniel ;

Saupe, Dietmar ;

Schreck, Tobias .

IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2007, 27 (04) :22-27

[7] Free-viewpoint video of human actors [J].

Carranza, J ;

Theobalt, C ;

Magnor, MA ;

Seidel, HP .

ACM TRANSACTIONS ON GRAPHICS, 2003, 22 (03) :569-577

[8] On visual similarity based 3D model retrieval [J].

Chen, DY ;

Tian, XP ;

Shen, YT ;

Ming, OY .

COMPUTER GRAPHICS FORUM, 2003, 22 (03) :223-232

[9] Point signatures: A new representation for 3D object recognition [J].

Chua, CS ;

Jarvis, R .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 1997, 25 (01) :63-85

[10] Coarse filters for shape matching [J].

Corney, J ;

Rea, H ;

Clark, D ;

Pritchard, J ;

Breaks, M ;

MacLeod, R .

IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2002, 22 (03) :65-74

← 1 2 3 4 5 →