Swin Transformer UNet for Very High Resolution Image Dehazing

Rapid image acquisition for a region affected by an earthquake is important to manage the rescue operation. The use of an unmanned aerial vehicle (UAV) to rapidly cruise an affected region and obtain very high resolution (VHR) images is highly advantageous. However, haze is a problem for many UAV ae...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Sensors and materials 2022-01, Vol.34 (11), p.4029
Hauptverfasser:	Bian, Yuxin, Zhang, Enguang, Wang, Jiayan, Xie, Rixin, Jiang, Shenlu
Format:	Artikel
Sprache:	eng
Schlagworte:	Data structures Earthquakes Haze High resolution Image acquisition Image resolution Parallel processing Remote sensing Transformers Unmanned aerial vehicles Workflow
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Rapid image acquisition for a region affected by an earthquake is important to manage the rescue operation. The use of an unmanned aerial vehicle (UAV) to rapidly cruise an affected region and obtain very high resolution (VHR) images is highly advantageous. However, haze is a problem for many UAV aerial images, especially when UAVs cross clouds. In this paper, we present a parallel predicting workflow that cooperates with Swin Transformer UNet (ST-UNet) for this task. ST-UNet utilizes the Swin Transformer instead of a convolutional layer (CNN), which greatly enhances the processing speed without accuracy loss. The predicting workflow employs parallel processing and a reasonable data structure to maximize the computing resources for rapid processing. To demonstrate the advantageousness of the proposed workflow, we employed three public remote sensing datasets for evaluation, and the proposed ST-UNet obtained the highest accuracy and speed. Furthermore, the high dehazing performance of ST-UNet was demonstrated using a real post-earthquake scene.
ISSN:	0914-4935 2435-0869
DOI:	10.18494/SAM4059