Lightweight two-stage transformer for low-light image enhancement and object detection

In low-light conditions, due to the loss of image details the visual tasks are challenging. To achieve real-time image enhancement and improve the accuracy of object detection task, we propose a lightweight two-stage Transformer. First, we use dynamic convolution to improve the adaptability of the n...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Digital signal processing 2024-07, Vol.150, p.104521, Article 104521
Hauptverfasser: Kou, Kangkang, Yin, Xiangchen, Gao, Xin, Nie, Fuhui, Liu, Jing, Zhang, Guoying
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In low-light conditions, due to the loss of image details the visual tasks are challenging. To achieve real-time image enhancement and improve the accuracy of object detection task, we propose a lightweight two-stage Transformer. First, we use dynamic convolution to improve the adaptability of the network to different samples and preliminarily enhance the image through predicting the multiplicative and additive maps of the least squares method. In the second stage, we propose a FFT-Guidance Block (FGB) to obtain frequency components for explicit modeling, guiding the recovery of image potential information. In addition, joint our model with YOLOv3 to build a dark object detection framework, and we only use normal detection loss to simplify the training process. On the LOLv2 dataset, our model achieves advanced results. The enhanced model maintains good performance while the parameters are only 0.050M, and increase the accuracy of the downstream object detection task. The detection framework reaches 77.9% and 60.2 in mAP and FPS respectively on ExDark dataset, which can be better robust in dark conditions.
ISSN:1051-2004
1095-4333
DOI:10.1016/j.dsp.2024.104521