Deep learning for visual understanding: A review

被引:1447
作者
Guo, Yanming [1 ,3 ]
Liu, Yu [1 ]
Oerlemans, Ard [2 ]
Lao, Songyang [3 ]
Wu, Song [1 ]
Lew, Michael S. [1 ]
机构
[1] Leiden Univ, LIACS Media Lab, Niels Bohrweg 1, Leiden, Netherlands
[2] VDG Secur BV, Zoetermeer, Netherlands
[3] Natl Univ Def Technol, Coll Informat Syst & Management, Changsha, Hunan, Peoples R China
关键词
Deep learning; Computer vision; Developments; Applications; Trends; Challenges; NEURAL-NETWORKS; REPRESENTATIONS;
D O I
10.1016/j.neucom.2015.09.116
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning algorithms are a subset of the machine learning algorithms, which aim at discovering multiple levels of distributed representations. Recently, numerous deep learning algorithms have been proposed to solve traditional artificial intelligence problems. This work aims to review the state-of-the-art in deep learning algorithms in computer vision by highlighting the contributions and challenges from over 210 recent research papers. It first gives an overview of various deep learning approaches and their recent developments, and then briefly describes their applications in diverse vision tasks, such as image classification, object detection, image retrieval, semantic segmentation and human pose estimation. Finally, the paper summarizes the future trends and challenges in designing and training deep neural networks. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:27 / 48
页数:22
相关论文
共 213 条
[1]   Measuring the Objectness of Image Windows [J].
Alexe, Bogdan ;
Deselaers, Thomas ;
Ferrari, Vittorio .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) :2189-2202
[2]  
[Anonymous], P ICASSP
[3]  
[Anonymous], P AISTATS
[4]  
[Anonymous], 2008, P NIPS
[5]  
[Anonymous], 2015, ARXIV150401013
[6]  
[Anonymous], P 3D VIS 3DV
[7]  
[Anonymous], 4 7 STAT MODELING PH
[8]  
[Anonymous], 2013, P ICML
[9]  
[Anonymous], 2010, ARXIV10103467
[10]  
[Anonymous], 2005, AISTATS BRIDGETOWN B