Face recognition on large-scale video in the wild with hybrid Euclidean-and-Riemannian metric learning

被引：86

作者：

Huang, Zhiwu ^{[1
,2
]}

Wang, Ruiping ^{[1
]}

Shan, Shiguang ^{[1
]}

Chen, Xilin ^{[1
]}

机构：

[1] Chinese Acad Sci, Key Lab Intelligent Informat Proc, Inst Comp Technol, Beijing 100190, Peoples R China

[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

来源：

PATTERN RECOGNITION | 2015年 / 48卷 / 10期

关键词：

Face recognition; Large-scale video; Multiple heterogeneous statistics; Hybrid Euclidean-and-Riemannian metric learning; KERNEL; CLASSIFICATION;

D O I：

10.1016/j.patcog.2015.03.011

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Face recognition on large-scale video in the wild is becoming increasingly important due to the ubiquity of video data captured by surveillance cameras, handheld devices, Internet uploads, and other sources. By treating each video as one image set, set-based methods recently have made great success in the field of video-based face recognition. In the wild world, videos often contain extremely complex data variations and thus pose a big challenge of set modeling for set-based methods. In this paper, we propose a novel Hybrid Euclidean-and-Riemannian Metric Learning (HERML) method to fuse multiple statistics of image set. Specifically, we represent each image set simultaneously by mean, covariance matrix and Gaussian distribution, which generally complement each other in the aspect of set modeling. However, it is not trivial to fuse them since mean, covariance matrix and Gaussian model typically lie in multiple heterogeneous spaces equipped with Euclidean or Riemannian metric. Therefore, we first implicitly map the original statistics into high dimensional Hilbert spaces by exploiting Euclidean and Riemannian kernels. With a LogDet divergence based objective function, the hybrid kernels are then fused by our hybrid metric learning framework, which can efficiently perform the fusing procedure on large-scale videos. The proposed method is evaluated on four public and challenging large-scale video face datasets. Extensive experimental results demonstrate that our method has a clear superiority over the state-of-the-art set-based methods for large-scale video-based face recognition. (C) 2015 Elsevier Ltd. All rights reserved.

引用

页码：3113 / 3124

页数：12

共 60 条

[1]

[Anonymous], P COMP VIS PATT REC

[2]

[Anonymous], P COMP VIS PATT REC

[3]

[Anonymous], P INT C AUT FAC GEST

[4]

[Anonymous], P INT C COMP VIS

[5]

[Anonymous], 2014, P COMP VIS PATT REC

[6]

[Anonymous], 2013, P IEEE 6 INT C BIOM

[7]

[Anonymous], 2006, P EUR C COMP VIS

[8]

[Anonymous], INT C DIG IM COMP TE

[9]

[Anonymous], P EUR C COMP VIS

[10]

[Anonymous], P COMP VIS PATT REC

← 1 2 3 4 5 6 →