TransU2-Net: A hybrid Transformer Architecture for Image Splicing Forgery Detection

In recent years, various convolutional neural network (CNN) based frameworks have been presented to detect forged regions in images. However, most of the existing models can not obtain satisfactory performance due to tampered areas with various sizes, especially for objects with large-scale. In orde...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE access 2023-01, Vol.11, p.1-1
Hauptverfasser:	Yan, Caiping, Li, Shuyuan, Li, Hong
Format:	Artikel
Sprache:	eng
Schlagworte:	Artificial neural networks Coders Convolutional neural network Convolutional neural networks Cross-attention Datasets Decoding Feature extraction Forgery Image splicing forgery detection Location awareness Self-attention Semantics Spatial dependencies Splicing Streaming media Tampered region localization
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	In recent years, various convolutional neural network (CNN) based frameworks have been presented to detect forged regions in images. However, most of the existing models can not obtain satisfactory performance due to tampered areas with various sizes, especially for objects with large-scale. In order to obtain an accurate object-level forgery localization result, we propose a novel hybrid transformer architecture, which exhibits both advantages of spatial dependencies and contextual information from different scales, namely, TransU 2 -Net. Specifically, long-range semantic dependencies are captured by the last block of encoder to locate large-scale tampered areas more completely . Meanwhile, non-semantic features are filtered out by enhancing low-level features under the guidance of high-level semantic information in the skip connections to achieve more refined spatial recovery. Therefore, our hybrid model can locate spliced forgeries with various sizes without requiring large data set pre-training. In comparison with other existing CNN-based methods, our framework achieves better performance over state-of-the-art methods.
ISSN:	2169-3536
DOI:	10.1109/ACCESS.2023.3264014