50 Years of object recognition: Directions forward

被引:219
作者
Andreopoulos, Alexander [1 ]
Tsotsos, John K. [2 ]
机构
[1] IBM Res Almaden, 650 Hany Rd, San Jose, CA 95120 USA
[2] York Univ, Ctr Vis Res, Dept Comp Sci & Engn, Toronto, ON M3J 1P3, Canada
关键词
Active vision; Object recognition; Object representations; Object learning; Dynamic vision; Cognitive vision systems; IMAGE RETRIEVAL; COMPUTATIONAL MODEL; HIERARCHICAL-MODELS; DIFFERING VIEWS; VISUAL-SEARCH; 3-D OBJECTS; INVARIANT; INFORMATION; REPRESENTATION; FEATURES;
D O I
10.1016/j.cviu.2013.04.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object recognition systems constitute a deeply entrenched and omnipresent component of modern intelligent systems. Research on object recognition algorithms has led to advances in factory and office automation through the creation of optical character recognition systems, assembly-line industrial inspection systems, as well as chip defect identification systems. It has also led to significant advances in medical imaging, defence and biometrics. In this paper we discuss the evolution of computer-based object recognition systems over the last fifty years, and overview the successes and failures of proposed solutions to the problem. We survey the breadth of approaches adopted over the years in attempting to solve the problem, and highlight the important role that active and attentive approaches must play in any solution that bridges the semantic gap in the proposed object representations, while simultaneously leading to efficient learning and inference algorithms. From the earliest systems which dealt with the character recognition problem, to modern visually-guided agents that can purposively search entire rooms for objects, we argue that a common thread of all such systems is their fragility and their inability to generalize as well as the human visual system can. At the same time, however, we demonstrate that the performance of such systems in strictly controlled environments often vastly outperforms the capabilities of the human visual system. We conclude our survey by arguing that the next step in the evolution of object recognition algorithms will require radical and bold steps forward in terms of the object representations, as well as the learning and inference algorithms used. (c) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:827 / 891
页数:65
相关论文
共 385 条
[1]  
ALOIMONOS J, 1987, INT J COMPUT VISION, V1, P333
[2]   A computational model for visual selection [J].
Amit, Y ;
Geman, D .
NEURAL COMPUTATION, 1999, 11 (07) :1691-1715
[3]   License plate recognition from still images and video sequences: A survey [J].
Anagnostopoulos, Christos-Nikolaos E. ;
Anagnostopoulos, Ioannis E. ;
Psoroulas, Ioannis D. ;
Loumos, Vassili ;
Kayafas, Eleftherios .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2008, 9 (03) :377-391
[4]   Efficient and generalizable statistical models of shape and appearance for analysis of cardiac MRI [J].
Andreopoulos, Alexander ;
Tsotsos, John K. .
MEDICAL IMAGE ANALYSIS, 2008, 12 (03) :335-357
[5]   A Computational Learning Theory of Active Object Recognition Under Uncertainty [J].
Andreopoulos, Alexander ;
Tsotsos, John K. .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2013, 101 (01) :95-142
[6]   On Sensor Bias in Experimental Methods for Comparing Interest-Point, Saliency, and Recognition Algorithms [J].
Andreopoulos, Alexander ;
Tsotsos, John K. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (01) :110-126
[7]   Active 3D Object Localization Using a Humanoid Robot [J].
Andreopoulos, Alexander ;
Hasler, Stephan ;
Wersing, Heiko ;
Janssen, Herbert ;
Tsotsos, John K. ;
Koerner, Edgar .
IEEE TRANSACTIONS ON ROBOTICS, 2011, 27 (01) :47-64
[8]  
[Anonymous], TECH REP
[9]  
[Anonymous], CLASSIFICATION AIDED
[10]  
[Anonymous], TECH REP