An adaptive multichannel DeepLabv3 + for semantic segmentation of aerial images using improved Beluga Whale Optimization Algorithm

Semantic segmentation of aerial images plays a pivotal role in extracting detailed information about land cover, infrastructure, and natural features. Traditional single-channel segmentation models struggle to harness the rich information present in multi-channel aerial images, such as multi-spectra...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Multimedia tools and applications 2024-05, Vol.83 (15), p.46439-46478
Hauptverfasser: Anilkumar, P., Venugopal, P.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Semantic segmentation of aerial images plays a pivotal role in extracting detailed information about land cover, infrastructure, and natural features. Traditional single-channel segmentation models struggle to harness the rich information present in multi-channel aerial images, such as multi-spectral or hyperspectral data. While the DeepLabV3 + architecture has shown remarkable success in semantic segmentation tasks by exploiting multi-scale context and atrous convolutions, its performance on aerial images remains suboptimal due to the unique challenges of this domain. With the motive of addressing the difficulties in the existing semantic segmentation techniques for aerial images, the adoption of deep learning techniques is utilized. A multi-objective derived Adaptive Multichannel deeplabv3 + (AMC-Deeplabv3 +) with a new meta-heuristic algorithm called Improved Beluga whale optimization (IBWO) algorithm is proposed in this paper. Here, the hyperparameters of Multichannel deeplabv3 + are optimized by the IBWO algorithm and this model intends to set new benchmarks in the accuracy and contextual understanding of aerial image segmentation by integrating multi-channel data processing techniques and preserving spatial context. The proposed model attains improved accuracies of 98.65% & 98.72% for dataset 1 and 2 respectively and also achieves the dice coefficient of 98.73% & 98.85% respectively, with a computation time of 113.0123 s. The evolutional outcomes of the proposed model show significantly better than the state-of-the-art techniques.
ISSN:1573-7721
1380-7501
1573-7721
DOI:10.1007/s11042-023-17247-z