Vision-based action recognition of earthmoving equipment using spatio-temporal features and support vector machine classifiers

被引:209
作者
Golparvar-Fard, Mani [1 ]
Heydarian, Arsalan [2 ]
Carlos Niebles, Juan [3 ]
机构
[1] Univ Illinois, Dept Civil & Environm Engn, Dept Comp Sci, Urbana, IL 61801 USA
[2] Virginia Tech, Charles E Via Dept Civil & Environm Engn, Blacksburg, VA 24061 USA
[3] Univ Norte, Dept Elect & Elect Engn, Barranquilla, Colombia
关键词
Computer vision; Action recognition; Construction productivity; Activity analysis; Time-studies; Operational efficiency; TRACKING; WORKERS; LOCATION; SPACE;
D O I
10.1016/j.aei.2013.09.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video recordings of earthmoving construction operations provide understandable data that can be used for benchmarking and analyzing their performance. These recordings further support project managers to take corrective actions on performance deviations and in turn improve operational efficiency. Despite these benefits, manual stopwatch studies of previously recorded videos can be labor-intensive, may suffer from biases of the observers, and are impractical after substantial period of observations. This paper presents a new computer vision based algorithm for recognizing single actions of earthmoving construction equipment. This is particularly a challenging task as equipment can be partially occluded in site video streams and usually come in wide variety of sizes and appearances. The scale and pose of the equipment actions can also significantly vary based on the camera configurations. In the proposed method, a video is initially represented as a collection of spatio-temporal visual features by extracting space-time interest points and describing each feature with a Histogram of Oriented Gradients (HOG). The algorithm automatically learns the distributions of the spatio-temporal features and action categories using a multi-class Support Vector Machine (SVM) classifier. This strategy handles noisy feature points arisen from typical dynamic backgrounds. Given a video sequence captured from a fixed camera, the multi-class SVM classifier recognizes and localizes equipment actions. For the purpose of evaluation, a new video dataset is introduced which contains 859 sequences from excavator and truck actions. This dataset contains large variations of equipment pose and scale, and has varied backgrounds and levels of occlusion. The experimental results with average accuracies of 86.33% and 98.33% show that our supervised method outperforms previous algorithms for excavator and truck action recognition. The results hold the promise for applicability of the proposed method for construction activity analysis. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:652 / 663
页数:12
相关论文
共 77 条
  • [51] Assessing Effects of Operational Efficiency on Pollutant Emissions of Nonroad Diesel Construction Equipment
    Lewis, Phil
    Leming, Michael
    Frey, H. Christopher
    Rasdorf, William
    [J]. TRANSPORTATION RESEARCH RECORD, 2011, (2233) : 11 - 18
  • [52] Liu Q, 2008, IEEE IC COMP COM NET, P1
  • [53] Marszalek M, 2009, PROC CVPR IEEE, P2921, DOI 10.1109/CVPRW.2009.5206557
  • [54] Automated 2D detection of construction equipment and workers from site video streams using histograms of oriented gradients and colors
    Memarzadeh, Milad
    Golparvar-Fard, Mani
    Carlos Niebles, Juan
    [J]. AUTOMATION IN CONSTRUCTION, 2013, 32 : 24 - 37
  • [55] Assessing research issues in automated project performance control (APPC)
    Navon, Ronie
    Sacks, Rafael
    [J]. AUTOMATION IN CONSTRUCTION, 2007, 16 (04) : 474 - 484
  • [56] Unsupervised learning of human action categories using spatial-temporal words
    Niebles, Juan Carlos
    Wang, Hongcheng
    Fei-Fei, Li
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2008, 79 (03) : 299 - 318
  • [57] Niebles JC, 2010, LECT NOTES COMPUT SC, V6312, P392, DOI 10.1007/978-3-642-15552-9_29
  • [58] Three-Dimensional Tracking of Construction Resources Using an On-Site Camera System
    Park, Man-Woo
    Koch, Christian
    Brilakis, Ioannis
    [J]. JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2012, 26 (04) : 541 - 549
  • [59] Construction worker detection in video frames for initializing vision trackers
    Park, Man-Woo
    Brilakis, Ioannis
    [J]. AUTOMATION IN CONSTRUCTION, 2012, 28 : 15 - 25
  • [60] Powers D.M., 2020, EVALUATION PRECISION