A deep hybrid learning model to detect unsafe behavior: Integrating convolution neural networks and long short-term memory

Computer vision and pattern recognition approaches have been applied to determine unsafe behaviors on construction sites. Such approaches have been reliant on the computation of artificially complex image features that utilize a cumbersome parameter re-adjustment process. The creation of image featu...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Automation in construction 2018-02, Vol.86, p.118-124
Hauptverfasser:	Ding, Lieyun, Fang, Weili, Luo, Hanbin, Love, Peter E.D., Zhong, Botao, Ouyang, Xi
Format:	Artikel
Sprache:	eng
Schlagworte:	Building information modeling Complexity Computer memory Computer vision Construction sites Convolution Convolution neural network Deep learning Feature extraction Feature recognition Long short-term memory Machine learning Model accuracy Model testing Neural networks Occupational safety Pattern recognition Process parameters Safety Studies Unsafe actions Video surveillance
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Computer vision and pattern recognition approaches have been applied to determine unsafe behaviors on construction sites. Such approaches have been reliant on the computation of artificially complex image features that utilize a cumbersome parameter re-adjustment process. The creation of image features that can recognize unsafe actions, however, poses a significant research challenge on construction sites. This due to the prevailing complexity of spatio-temporal features, lighting, and the array of viewpoints that are required to identify an unsafe action. Considering these challenges, a new hybrid deep learning model that integrates a convolution neural network (CNN) and long short-term memory (LSTM) that automatically recognizes workers' unsafe actions is developed. The proposed hybrid deep learning model is used to: (1) identify unsafe actions; (2) collect motion data and site videos; (3) extract the visual features from videos using a CNN model; and (4) sequence the learning features that are enabled by the use of LSTM models. An experiment is used to test the model's ability to detect unsafe actions. The results reveal that the developed hybrid model (CNN+LSTM) is able to accurately detect safe/unsafe actions conducted by workers on-site. The model's accuracy exceeds the current state-of-the-art descriptor-based methods for detecting points of interest on images. •A hybrid deep learning model is proposed to automatically recognize workers’ unsafe actions.•A sequence of feature representations of unsafe acts from a series of action videos can be automatically extracted.•The model’s ability to detect workers’ unsafe actions is validated.•Outperforms comparable results to current state-of-the-art descriptor methods to detect unsafe action by an average of 10%.
ISSN:	0926-5805 1872-7891
DOI:	10.1016/j.autcon.2017.11.002