CrackSegDiff: Diffusion Probability Model-based Multi-modal Crack Segmentation
Integrating grayscale and depth data in road inspection robots could enhance the accuracy, reliability, and comprehensiveness of road condition assessments, leading to improved maintenance strategies and safer infrastructure. However, these data sources are often compromised by significant backgroun...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Integrating grayscale and depth data in road inspection robots could enhance
the accuracy, reliability, and comprehensiveness of road condition assessments,
leading to improved maintenance strategies and safer infrastructure. However,
these data sources are often compromised by significant background noise from
the pavement. Recent advancements in Diffusion Probabilistic Models (DPM) have
demonstrated remarkable success in image segmentation tasks, showcasing potent
denoising capabilities, as evidenced in studies like SegDiff. Despite these
advancements, current DPM-based segmentors do not fully capitalize on the
potential of original image data. In this paper, we propose a novel DPM-based
approach for crack segmentation, named CrackSegDiff, which uniquely fuses
grayscale and range/depth images. This method enhances the reverse diffusion
process by intensifying the interaction between local feature extraction via
DPM and global feature extraction. Unlike traditional methods that utilize
Transformers for global features, our approach employs Vm-unet to efficiently
capture long-range information of the original data. The integration of
features is further refined through two innovative modules: the Channel Fusion
Module (CFM) and the Shallow Feature Compensation Module (SFCM). Our
experimental evaluation on the three-class crack image segmentation tasks
within the FIND dataset demonstrates that CrackSegDiff outperforms
state-of-the-art methods, particularly excelling in the detection of shallow
cracks. Code is available at https://github.com/sky-visionX/CrackSegDiff. |
---|---|
DOI: | 10.48550/arxiv.2410.08100 |