Automatic road crack detection and classification using multi-tasking faster RCNN

Automatic road crack detection is a prominent challenging task, in view of that, a novel approach is proposed using multi-tasking Faster-RCNN to detect and classify road cracks. In this present study, we have collected the road images (a dataset of 19300 images) from the Outer Ring Road of Chennai,...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of intelligent & fuzzy systems 2021-01, Vol.41 (6), p.6615-6628
Hauptverfasser: Sekar, Aravindkumar, Perumal, Varalakshmi
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Automatic road crack detection is a prominent challenging task, in view of that, a novel approach is proposed using multi-tasking Faster-RCNN to detect and classify road cracks. In this present study, we have collected the road images (a dataset of 19300 images) from the Outer Ring Road of Chennai, Tamil Nadu, India. The collected road images were pre-processed using various conventional image processing techniques to identify the ground-truth label of the bounding boxes for the cracks. We present a novel multi-tasking Faster-RCNN based approach using the Global Average Pooling(GAP) and Region of Interest (RoI) Align techniques to detect the road cracks. The RoI Align is used to avoid quantizing the stride. So that the information loss can be minimized and the bi-linear interpolation can be used to map the proposal to the input image. The resulting features from RoI Align are given as input to the GAP layer which drastically reduces the multi-dimension features into a single feature map. The output of the GAP layer is given to the fully connected layer for classification (softmax) and also to a regression model for predicting the crack location using a bounding box. F1-measure, precision, and recall were used to evaluate the results of classification and detection. The proposed model achieves the accuracy-97.97%, precision-99.12%, and recall-97.25% for classification using the MIT-CHN-ORR dataset. The experimental results show, that the proposed approach outperforms the other state-of-the-art methods.
ISSN:1064-1246
1875-8967
DOI:10.3233/JIFS-210475