Dataset and benchmark for detecting moving objects in construction sites

被引:116
作者
An Xuehui [1 ]
Zhou Li [1 ]
Liu Zuguang [1 ]
Wang Chengzhi [2 ]
Li Pengfei [2 ]
Li Zhiwei [3 ]
机构
[1] Tsinghua Univ, Dept Hydraul Engn, Beijing 10084, Peoples R China
[2] Chongqingjiaotong Univ, Sch River & Ocean Engn, Chongqing 400074, Peoples R China
[3] SinoHydro Bur 8 Co Ltd, Changsha 410004, Peoples R China
关键词
Dataset; Deep neural networks; Construction site; Benchmark; Object detection;
D O I
10.1016/j.autcon.2020.103482
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Detecting workers and equipment through images/videos can assist in safety monitoring, quality control, and productivity management at construction sites. Currently, the dominant method for detecting is Deep Neural Networks (DNNs). To apply this method, the DNNs always need to be trained on image datasets that contain objects at the construction site. However, a large-scale and publicly available image dataset for detecting objects at construction sites is still absent, and this hinders research in this field. In this study, the Moving Objects in Construction Sites (MOCS) image dataset is presented. The dataset contains 41,668 images collected from 174 different construction sites. Thirteen categories of moving objects found in construction sites were annotated. Furthermore, the objects were precisely annotated using per-pixel segmentations to assist in precise object localization. A detailed statistical analysis was performed in this study. Finally, a benchmark containing 15 different DNN-based detectors was made using the MOCS dataset. The results show that all detectors trained on the dataset could detect objects at construction sites precisely and robustly.
引用
收藏
页数:18
相关论文
共 57 条
[1]  
[Anonymous], 2018, EUROCITY PERSONS DAT
[2]  
[Anonymous], 2000, Opencv. Dr. Dobb's J. Softw. Tools
[3]  
Arnab A., 2016, ARXIV160902583, DOI 10.5244/C.30.19
[4]   Image segmentation of underfloor scenes using a mask regions convolutional neural network with two-stage transfer learning [J].
Atkinson, Gary A. ;
Zhang, Wenhao ;
Hansen, Mark F. ;
Holloway, Mathew L. ;
Napier, Ashley A. .
AUTOMATION IN CONSTRUCTION, 2020, 113
[5]   Segmentation and recognition of roadway assets from car-mounted camera video streams using a scalable non-parametric image parsing method [J].
Balali, Vahid ;
Golparvar-Fard, Mani .
AUTOMATION IN CONSTRUCTION, 2015, 49 :27-39
[6]   Hybrid Task Cascade for Instance Segmentation [J].
Chen, Kai ;
Pang, Jiangmiao ;
Wang, Jiaqi ;
Xiong, Yu ;
Li, Xiaoxiao ;
Sun, Shuyang ;
Feng, Wansen ;
Liu, Ziwei ;
Shi, Jianping ;
Ouyang, Wanli ;
Loy, Chen Change ;
Lin, Dahua .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4969-4978
[7]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[8]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[9]   The PASCAL Visual Object Classes Challenge: A Retrospective [J].
Everingham, Mark ;
Eslami, S. M. Ali ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) :98-136
[10]   A deep learning-based method for detecting non-certified work on construction sites [J].
Fang, Qi ;
Li, Heng ;
Luo, Xiaochun ;
Ding, Lieyun ;
Rose, Timothy M. ;
An, Wangpeng ;
Yu, Yantao .
ADVANCED ENGINEERING INFORMATICS, 2018, 35 :56-68