Novel Exploit Feature-Map-Based Detection of Adversarial Attacks

In machine learning (ML), adversarial attack (targeted or untargeted) in the presence of noise disturbs the model prediction. This research suggests that adversarial perturbations on pictures lead to noise in the features constructed by any networks. As a result, adversarial assaults against image c...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Applied sciences 2022-05, Vol.12 (10), p.5161
Hauptverfasser:	Almuflih, Ali Saeed, Vyas, Dhairya, Kapdia, Viral V., Qureshi, Mohamed Rafik Noor Mohamed, Qureshi, Karishma Mohamed Rafik, Makkawi, Elaf Abdullah
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy adversarial attack Algorithms Approximation Assaults Classification convolutional neural networks Feature maps feature-map Learning algorithms Machine learning Methods Network topologies Neural networks Noise Noise prediction ResNet50 VGGNet19 Wavelet transforms white box
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	In machine learning (ML), adversarial attack (targeted or untargeted) in the presence of noise disturbs the model prediction. This research suggests that adversarial perturbations on pictures lead to noise in the features constructed by any networks. As a result, adversarial assaults against image categorization systems may present obstacles and possibilities for studying convolutional neural networks (CNNs). According to this research, adversarial perturbations on pictures cause noise in the features created by neural networks. Motivated by adversarial perturbation on image pixel attacks observation, we developed a novel exploit feature map that describes adversarial attacks by performing individual object feature-map visual description. Specifically, a novel detection algorithm calculates each object’s class activation map weight and makes a combined activation map. When checked with different networks like VGGNet19 and ResNet50, in both white-box and black-box attack situations, the unique exploit feature-map significantly improves the state-of-the-art in adversarial resilience. Further, it will clearly exploit attacks on ImageNet under various algorithms like Fast Gradient Sign Method (FGSM), DeepFool, Projected Gradient Descent (PGD), and Backward Pass Differentiable Approximation (BPDA).
ISSN:	2076-3417 2076-3417
DOI:	10.3390/app12105161