The classification and localization of crack using lightweight convolutional neural network with CBAM

•The combination of MobileNetV3-Large and CBAM is suitable for crack classification with satisfactory accuracy.•Focal Loss is employed to solve the imbalance of dataset categories and hard samples.•The trained MobileNetV3-Large-CBAM and moving windows are used to classify, locate and identify cracks...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Engineering structures 2023-01, Vol.275, p.115291, Article 115291
Hauptverfasser:	Chen, Liujie, Yao, Haodong, Fu, Jiyang, Tai Ng, Ching
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy Artificial neural networks CBAM (Convolution Block Attention Module) Classification Coexistence Computer applications Cracks Datasets Focal Loss Illumination Image classification Image detection and localization Image reconstruction Imbalance dataset Lightweight Localization MobileNetV3-Large Neural networks Object recognition Sliding Sliding window
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	•The combination of MobileNetV3-Large and CBAM is suitable for crack classification with satisfactory accuracy.•Focal Loss is employed to solve the imbalance of dataset categories and hard samples.•The trained MobileNetV3-Large-CBAM and moving windows are used to classify, locate and identify cracks in three random reconstruction images.•The proposed model has satisfactory performance in crack detection and localization.•The crack detection is computationally fast and has strong generalization in mobile scenarios with limited calculation conditions. Convolutional Neural Networks (CNNs) are currently often used for crack detection. However, the crack datasets collected in real engineering are imbalanced datasets and are affected by interference factors such as different illumination issues and the coexistence of various material crack images. Therefore, the generalization ability of the model itself and the ability to face imbalanced datasets is critical. In addition, a real engineering environment is usually low computational power environment. Therefore, it is undoubtedly more beneficial for the model to have a lightweight feature for practical applications. To address the above challenges in crack detection, MobileNetV3-Large is employed as the backbone combined with CBAM (Convolutional Block Attention Module) to gain MobileNetV3-Large-CBAM in this study. The classification and identification of crack are studied by using the open-source bridge crack dataset. MobileNetV3-Large-CBAM is compared with cutting-edge CNNs, and it verifies that the proposed model combined with the preferred Focal Loss has good performance in dealing with imbalanced datasets and hard samples. To verify the generalization ability of the proposed model, this paper further studies the crack datasets with various material and huge-width cracks under different illumination issues. Finally, the sliding window is adopted to perform crack detection and localization on the three randomly reconstructed crack images. The research results show that, compared with other CNNs, the proposed lightweight MobileNetV3-Large-CBAM combined with the preferred Focal Loss has better comprehensive performance, and the model size is 16.6 MB. I. For imbalanced datasets, the proposed model obtains the best results for crack classification. The Overall Accuracy (OA), F1-score, training speed, and classification speed of MobileNetV3-Large-CBAM are 95.90 %, 95.89 %, 101 images/second and 48 images/second, res
ISSN:	0141-0296 1873-7323
DOI:	10.1016/j.engstruct.2022.115291