Learning super-resolution and pyramidal convolution residual network for vehicle re-identification

Vehicle re-identification (Vehicle Re-ID) aims at retrieving and tracking the specified target vehicle with multiple other cameras, which can provide help in checking violations and catching fugitives, but there are still the following problems that need to be solved urgently. First, the existing co...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Scientific reports 2024-11, Vol.14 (1), p.26531-14, Article 26531
Hauptverfasser: Liu, Mengxue, Min, Weidong, Han, Qing, Xiang, Hongyue, Zhu, Meng
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Vehicle re-identification (Vehicle Re-ID) aims at retrieving and tracking the specified target vehicle with multiple other cameras, which can provide help in checking violations and catching fugitives, but there are still the following problems that need to be solved urgently. First, the existing collected Vehicle Re-ID data often have low resolution and blur in local regions, so that the Vehicle Re-ID algorithm cannot accurately extract subtle feature representations. In addition, small features are easy to cause the disappearance of features under the operation of a large convolution kernel, which makes the model unable to capture and learn subtle features, resulting in inaccurate judgment of vehicles. In this study, we propose a Vehicle Re-ID method based on super resolution and pyramidal convolution residual network. Firstly, a super-resolution image generation network leveraging generative adversarial networks (GANs) is proposed. This network employs both content loss and adversarial loss as optimization criteria, ensuring an efficient transformation from a low-resolution image into a super-resolution counterpart, while meticulously preserving intricate high-frequency details. Then, multi levels of pyramidal convolution operations are designed to generate multi-scale features, which can capture information on different scales. Moreover, the concept of residual learning is applied between the multi levels of pyramidal convolution operations to expedite model optimization and enhance recognition capabilities. Ultimately, the double pyramidal convolutions are meticulously employed on both the original image and the super-resolution image, yielding low-noise feature representations and intricate semantic information respectively. By seamlessly fusing these two diverse sources of information, the resultant combined features exhibit heightened discrimination capabilities and significantly bolster the robustness of image features. In order to verify the effectiveness of the proposed method, extensive experiments are carried out on VeRi-776 and VehicleID datasets. The experimental results show that the method proposed in this paper effectively captures the detail information of vehicle images, accurately distinguishes the subtle differences between different vehicles of the same type, and is superior to state-of-the-art methods.
ISSN:2045-2322
2045-2322
DOI:10.1038/s41598-024-77973-8