High-low level task combination for object detection in foggy weather conditions

For the object detection task in foggy weather conditions, image dehazing network is often used as preprocessing method to get a clear input. However, there is not strictly a strong positive correlation between image dehazing task and object detection task. Moreover, the preprocessing module can inc...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of visual communication and image representation 2024-02, Vol.98, p.104042, Article 104042
Hauptverfasser: Hu, Ke, Wu, Fei, Zhan, Zhenfei, Luo, Jun, Pu, Huayan
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:For the object detection task in foggy weather conditions, image dehazing network is often used as preprocessing method to get a clear input. However, there is not strictly a strong positive correlation between image dehazing task and object detection task. Moreover, the preprocessing module can increase the inference time of the whole model to a certain extent. To alleviate these problems, we propose a novel High-Low level task combination network (HLNet) based on multitask learning, which can learn both high-level and low-level tasks. Specially, instead of restoring the features to clear pixel-wise feature space like common image dehazing method, we opt to perform a restoration in feature level to mitigate the influence of the Batch Normalization (BN) layer of encoder on dehazing task. HLNet jointly learn dehazing task and detection task in an end-to-end fashion, which ensures that the weather-specific information in latent feature space is suppressed. Moreover, we applied the HLNet framework on three different object detection networks, including RetinaNnet, YOLOv3 and YOLOv5s network, and achieved improvements of 1.7 percent, 2.3 percent, and 1.2 percent in mAP respectively. The experimental results demonstrate the effectiveness and generalization ability of our proposed HLNet framework in real foggy scenarios. •The proposed HLNet initially explores how to combine high-level and low-level tasks and improves the detection performance of the Retinanet on real foggy test dataset without increasing the inference time.•The application of the YOLOv3 model and YOLOv5s model also prove the effectiveness and generalization performance of the strategy in this article.•The contrastive loss is used to enhance the learning of task-relevant factors.
ISSN:1047-3203
1095-9076
DOI:10.1016/j.jvcir.2023.104042