Tri-Path Backbone Network for Image Manipulation Localization

We propose a novel Tri-Path Backbone Network (TPB-Net) and train it end-to-end to effectively detect multiple types of image manipulations. The key challenge for image manipulation localization lies in the difficulty and diversity of extracting forgery features. To address this, we adopt a Triple-pa...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE access 2024, Vol.12, p.83217-83227
Hauptverfasser: Song, H., Lin, Baichuan, Ye, D.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:We propose a novel Tri-Path Backbone Network (TPB-Net) and train it end-to-end to effectively detect multiple types of image manipulations. The key challenge for image manipulation localization lies in the difficulty and diversity of extracting forgery features. To address this, we adopt a Triple-path Interconnected Backbone (TIB) scheme as the feature extractor, which enables the strong feature detection capabilities. Furthermore, we design and introduce the Dual-path Compressed Sensing Attention (DCSA) module, that incorporates a dual-path attention mechanism. The DCSA module intelligently compresses channels in the spatial path and spatial information in the channel path. These compression operations lead to improved learning efficiency, enhanced representation effectiveness, and increased model robustness. TPB-Net offers an end-to-end framework comprising trainable modules, facilitating joint optimization and enabling the achievement of optimal performance. Through rigorous experiments conducted on four standard image manipulation datasets, we demonstrate the superior performance of our method compared to previous state-of-the-art approaches.
ISSN:2169-3536
2169-3536
DOI:10.1109/ACCESS.2024.3410974