Context-aware Deep Feature Compression for High-speed Visual Tracking

被引：195

作者：

Choi, Jongwon ^{[1
]}

Chang, Hyung Jin ^{[2
,3
]}

Fischer, Tobias ^{[2
]}

Yun, Sangdoo ^{[1
,4
]}

Lee, Kyuewang ^{[1
]}

Jeong, Jiyeoup ^{[1
]}

Demiris, Yiannis ^{[2
]}

Choi, Jin Young ^{[1
]}

机构：

[1] Seoul Natl Univ, ECE, ASRI, Seoul, South Korea

[2] Imperial Coll London, Personal Robot Lab, EEE, London, England

[3] Univ Birmingham, Sch Comp Sci, Birmingham, W Midlands, England

[4] NAVER Corp, Clova AI Res, Seoul, South Korea

来源：

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年

关键词：

D O I：

10.1109/CVPR.2018.00057

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a new context-aware correlation filter based tracking framework to achieve both high computational speed and state-of-the-art performance among real-time trackers. The major contribution to the high computational speed lies in the proposed deep feature compression that is achieved by a context-aware scheme utilizing multiple expert auto-encoders; a context in our framework refers to the coarse category of the tracking target according to appearance patterns. In the pre-training phase, one expert auto-encoder is trained per category. In the tracking phase, the best expert auto-encoder is selected for a given target, and only this auto-encoder is used. To achieve high tracking performance with the compressed feature map, we introduce extrinsic denoising processes and a new orthogonality loss term for pre-training and fine-tuning of the expert auto encoders. We validate the proposed context-aware framework through a number of experiments, where our method achieves a comparable performance to state-of-the-art trackers which cannot run in real-time, while running at a significantly fast speed of over 100 fps.

引用

页码：479 / 488

页数：10

共 43 条

[1]

[Anonymous], 2008, ICML 08, DOI 10.1145/1390156.1390294

[2]

[Anonymous], 2007, Caltech-256 Object Category Dataset

[3]

[Anonymous], 2016, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2016.465

[4]

Babu Sam D., 2017, CVPR

[5] Fully-Convolutional Siamese Networks for Object Tracking [J].

Bertinetto, Luca ;

Valmadre, Jack ;

Henriques, Joao F. ;

Vedaldi, Andrea ;

Torr, Philip H. S. .

COMPUTER VISION - ECCV 2016 WORKSHOPS, PT II, 2016, 9914 :850-865

[6]

Bolme DS, 2010, PROC CVPR IEEE, P2544, DOI 10.1109/CVPR.2010.5539960

[7] The devil is in the details: an evaluation of recent feature encoding methods [J].

Chatfield, Ken ;

Lempitsky, Victor ;

Vedaldi, Andrea ;

Zisserman, Andrew .

PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,

[8] Attentional Correlation Filter Network for Adaptive Visual Tracking [J].

Choi, Jongwon ;

Chang, Hyung Jin ;

Yun, Sangdoo ;

Fischer, Tobias ;

Demiris, Yiannis ;

Choi, Jin Young .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4828-4837

[9] Visual Tracking Using Attention-Modulated Disintegration and Integration [J].

Choi, Jongwon ;

Chang, Hyung Jin ;

Jeong, Jiyeoup ;

Demiris, Yiannis ;

Choi, Jin Young .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :4321-4330

[10]

Danelljan M., 2016, ICCV WORKSH

← 1 2 3 4 5 →