SRGAN-enhanced unsafe operation detection and classification of heavy construction machinery using cascade learning

Bubryur Kim, Eui Jung An, Sungho Kim, K. R. Sri Preethaa, Dong Eun Lee, R. R. Lukacs

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

In the inherently hazardous construction industry, where injuries are frequent, the unsafe operation of heavy construction machinery significantly contributes to the injury and accident rates. To reduce these risks, this study introduces a novel framework for detecting and classifying these unsafe operations for five types of construction machinery. Utilizing a cascade learning architecture, the approach employs a Super-Resolution Generative Adversarial Network (SRGAN), Real-Time Detection Transformers (RT-DETR), self-DIstillation with NO labels (DINOv2), and Dilated Neighborhood Attention Transformer (DiNAT) models. The study focuses on enhancing the detection and classification of unsafe operations in construction machinery through upscaling low-resolution surveillance footage and creating detailed high-resolution inputs for the RT-DETR model. This enhancement, by leveraging temporal information, significantly improves object detection and classification accuracy. The performance of the cascaded pipeline yielded an average detection and first-level classification precision of 96%, a second-level classification accuracy of 98.83%, and a third-level classification accuracy of 98.25%, among other metrics. The cascaded integration of these models presents a well-rounded solution for near-real-time surveillance in dynamic construction environments, advancing surveillance technologies and significantly contributing to safety management within the industry.

Original languageEnglish
Article number206
JournalArtificial Intelligence Review
Volume57
Issue number8
DOIs
StatePublished - Aug 2024

Keywords

  • Safety management
  • Smart construction sites
  • Super-resolution generative adversarial network
  • Transformer
  • Unsafe operation detection

Fingerprint

Dive into the research topics of 'SRGAN-enhanced unsafe operation detection and classification of heavy construction machinery using cascade learning'. Together they form a unique fingerprint.

Cite this