Development of an Image Data Set of Construction Machines for Deep Learning Object Detection

被引:94
作者
Xiao, Bo [1 ]
Kang, Shih-Chung [1 ]
机构
[1] Univ Alberta, Dept Civil & Environm Engn, Edmonton, AB T6G 2R3, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Deep learning; Object detection; Algorithm analysis; Construction machines; Image data set; ACTION RECOGNITION; SAFETY; EQUIPMENT; INFORMATION; FEATURES; TRACKING; WORKERS;
D O I
10.1061/(ASCE)CP.1943-5487.0000945
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Deep learning object detection algorithms have proven their capacity to identify a variety of objects from images and videos in near real-time speed. The construction industry can potentially benefit from this machine intelligence by linking algorithms with construction videos to automatically analyze productivity and monitor activities from a safety perspective. However, an effective image data set of construction machines for training deep learning object detection algorithms is not currently available due to the limited accessibility of construction images, the time-and-labor-intensiveness of manual annotations, and the knowledge base required in terms of both construction and deep learning. This research presents a case study on developing an image data set specifically for construction machines named the Alberta Construction Image Data Set (ACID). In the case of ACID, 10,000 images belonging to 10 types of construction machines are manually collected and annotated with machine types and their corresponding positions on the images. To validate the feasibility of ACID, we train the data set using four existing deep learning object detection algorithms, including YOLO-v3, Inception-SSD, R-FCN-ResNet101, and Faster-RCNN-ResNet101. The mean average precision (mAP) is 83.0% for Inception-SSD, 87.8% for YOLO-v3, 88.8% for R-FCN-ResNet101, and 89.2% for Faster-RCNN-ResNet101. The average detection speed of the four algorithms is 16.7 frames per second (fps), which satisfies the needs of most studies in the field of automation in construction.
引用
收藏
页数:18
相关论文
共 54 条
  • [11] Automated excavators activity recognition and productivity analysis from construction site surveillance videos
    Chen, Chen
    Zhu, Zhenhua
    Hammad, Amin
    [J]. AUTOMATION IN CONSTRUCTION, 2020, 110
  • [12] Image-Based Safety Assessment: Automated Spatial Safety Risk Identification of Earthmoving and Surface Mining Activities
    Chi, Seokho
    Caldas, Carlos H.
    [J]. JOURNAL OF CONSTRUCTION ENGINEERING AND MANAGEMENT, 2012, 138 (03) : 341 - 351
  • [13] Dai J, 2016, PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), P1796, DOI 10.1109/ICIT.2016.7475036
  • [14] Davis J., 2006, P 23 INT C MACH LEAR, P233, DOI DOI 10.1145/1143844.1143874
  • [15] Everingham M., 2010, INT J COMPUT VISION, V88, P303, DOI DOI 10.1007/s11263-009-0275-4
  • [16] A deep learning-based method for detecting non-certified work on construction sites
    Fang, Qi
    Li, Heng
    Luo, Xiaochun
    Ding, Lieyun
    Rose, Timothy M.
    An, Wangpeng
    Yu, Yantao
    [J]. ADVANCED ENGINEERING INFORMATICS, 2018, 35 : 56 - 68
  • [17] Computer vision applications in construction safety assurance
    Fang, Weili
    Ding, Lieyun
    Love, Peter E. D.
    Luo, Hanbin
    Li, Heng
    Pena-Mora, Feniosky
    Zhong, Botao
    Zhou, Cheng
    [J]. AUTOMATION IN CONSTRUCTION, 2020, 110
  • [18] Automated detection of workers and heavy equipment on construction sites: A convolutional neural network approach
    Fang, Weili
    Ding, Lieyun
    Zhong, Botao
    Love, Peter E. D.
    Luo, Hanbin
    [J]. ADVANCED ENGINEERING INFORMATICS, 2018, 37 : 139 - 149
  • [19] Fast R-CNN
    Girshick, Ross
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
  • [20] Vision-based action recognition of earthmoving equipment using spatio-temporal features and support vector machine classifiers
    Golparvar-Fard, Mani
    Heydarian, Arsalan
    Carlos Niebles, Juan
    [J]. ADVANCED ENGINEERING INFORMATICS, 2013, 27 (04) : 652 - 663