Unsupervised learning to detect loops using deep neural networks for visual SLAM system

被引:170
作者
Gao, Xiang [1 ]
Zhang, Tao [1 ]
机构
[1] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
关键词
Simultaneous localization and mapping (SLAM); Loop closure detection; Stacked denoising auto-encoder; Deep neural network; SIMULTANEOUS LOCALIZATION; LARGE-SCALE; FAB-MAP; FEATURES; TIME; RECOGNITION; ASSOCIATION; CLOSURE; SPACE;
D O I
10.1007/s10514-015-9516-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper is concerned of the loop closure detection problem for visual simultaneous localization and mapping systems. We propose a novel approach based on the stacked denoising auto-encoder (SDA), a multi-layer neural network that autonomously learns an compressed representation from the raw input data in an unsupervised way. Different with the traditional bag-of-words based methods, the deep network has the ability to learn the complex inner structures in image data, while no longer needs to manually design the visual features. Our approach employs the characteristics of the SDA to solve the loop detection problem. The workflow of training the network, utilizing the features and computing the similarity score is presented. The performance of SDA is evaluated by a comparison study with Fab-map 2.0 using data from open datasets and physical robots. The results show that SDA is feasible for detecting loops at a satisfactory precision and can therefore provide an alternative way for visual SLAM systems.
引用
收藏
页码:1 / 18
页数:18
相关论文
共 56 条
[31]  
Kummerle Rainer, 2011, IEEE International Conference on Robotics and Automation, P3607
[32]   Building 3D visual maps of interior space with a new hierarchical sensor fusion architecture [J].
Kwon, Hyukseong ;
Yousef, Khalil M. Ahmad ;
Kak, Avinash C. .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2013, 61 (08) :749-767
[33]   Appearance-Based Loop Closure Detection for Online Large-Scale and Long-Term Operation [J].
Labbe, Mathieu ;
Michaud, Francois .
IEEE TRANSACTIONS ON ROBOTICS, 2013, 29 (03) :734-745
[34]   Robust loop closing over time for pose graph SLAM [J].
Latif, Yasir ;
Cadena, Cesar ;
Neira, Jose .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (14) :1611-1626
[35]   Keypoint recognition using randomized trees [J].
Lepetit, Vincent ;
Fua, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (09) :1465-1479
[36]   Autoencoder for words [J].
Liou, Cheng-Yuan ;
Cheng, Wei-Chen ;
Liou, Jiun-Wei ;
Liou, Daw-Ran .
NEUROCOMPUTING, 2014, 139 :84-96
[37]   Distinctive image features from scale-invariant keypoints [J].
Lowe, DG .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (02) :91-110
[38]  
Lu XG, 2013, INTERSPEECH, P436
[39]   A Comparative Study of Registration Methods for RGB-D Video of Static Scenes [J].
Morell-Gimenez, Vicente ;
Saval-Calvo, Marcelo ;
Azorin-Lopez, Jorge ;
Garcia-Rodriguez, Jose ;
Cazorla, Miguel ;
Orts-Escolano, Sergio ;
Fuster-Guillo, Andres .
SENSORS, 2014, 14 (05) :8547-8576
[40]  
Muja M, 2009, VISAPP 2009: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 1, P331