CSTAN: A Deepfake Detection Network with CST Attention for Superior Generalization

With the advancement of deepfake forgery technology, highly realistic fake faces have posed serious security risks to sensor-based facial recognition systems. Recent deepfake detection models mainly use binary classification models based on deep learning. Despite achieving high detection accuracy on...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Sensors (Basel, Switzerland) Switzerland), 2024-11, Vol.24 (22), p.7101
Hauptverfasser:	Yang, Rui, You, Kang, Pang, Cheng, Luo, Xiaonan, Lan, Rushi
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Analysis attention mechanism Automated Facial Recognition - methods Biometry Datasets Deep Learning Deepfake deepfake detection Design detection model Face - anatomy & histology Face - physiology Facial Recognition - physiology feature extraction Forgery Humans Image Processing, Computer-Assisted - methods Neural Networks, Computer Sensors
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	With the advancement of deepfake forgery technology, highly realistic fake faces have posed serious security risks to sensor-based facial recognition systems. Recent deepfake detection models mainly use binary classification models based on deep learning. Despite achieving high detection accuracy on intra-datasets, these models lack generalization ability when applied to cross-datasets. We propose a deepfake detection model named Channel-Spatial-Triplet Attention Network (CSTAN), which focuses on the difference between real and fake features, thereby enhancing the generality of the detection model. To enhance the feature-learning ability of the model for image forgery regions, we have designed the Channel-Spatial-Triplet (CST) attention mechanism, which extracts subtle local information by capturing feature channels and the spatial correlation of three different scales. Additionally, we propose a novel feature extraction method, OD-ResNet-34, by embedding ODConv into the feature extraction network to enhance its dynamic adaptability to data features. Trained on the FF++ dataset and tested on the Celeb-DF-v1 and Celeb-DF-v2 datasets, the experimental results show that our model has stronger generalization ability in cross-datasets than similar models.
ISSN:	1424-8220 1424-8220
DOI:	10.3390/s24227101