CTFCD: Channel transformer based on full convolutional decoder for single image deraining

Although convolutional neural network and visual transformer have been successfully applied in various field of computer vision, there is little work combining them to construct an efficient network model to solve image deraining tasks. Convolutional neural network is utilized to extract the feature...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of visual communication and image representation 2024-02, Vol.98, p.103992, Article 103992
Hauptverfasser: Tan, Shaohan, Chen, Hui, Zhu, Songhao
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Although convolutional neural network and visual transformer have been successfully applied in various field of computer vision, there is little work combining them to construct an efficient network model to solve image deraining tasks. Convolutional neural network is utilized to extract the features from each region, while visual transformer is utilized to extract the context information between local features. Due to the limitations in computational resources and processing time, it is difficult for visual transformer to process high-resolution images, which hinders the application of visual transformer in devices with limited hardware resources. The purpose of this article is to utilize the advantages of to design a lightweight encoder-decoder network for real-time image deraining. Firstly, a novel channel Transformer module is designed to obtain global contextual information, where deep separable convolution is utilized to extract multi-scale local features and a Transformer encoder is constructed by stacking Transformer modules. Secondly, a decoder based on a fully convolution is designed to adopt mask attention and inverted bottleneck convolution to achieve progressive feature fusion and feature reconstruction, which significantly reduces computational complexity and memory requirement. A large number of experimental results have verified that the proposed method has superior performance compared with other state-of-the art methods, while the computational cost and parameter quantity are much smaller than those of similar methods.
ISSN:1047-3203
1095-9076
DOI:10.1016/j.jvcir.2023.103992