METHOD FOR RECOGNIZING AN ADVERSARIAL DISTURBANCE IN INPUT DATA OF A NEURAL NETWORK

A method for detecting an adversarial perturbation in input data of a neural network, wherein a conditional generative adversarial network is trained during a training phase, wherein a generator network of the conditional generative adversarial network is trained to generate adversarial perturbation...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: SCHLICHT, Peter, VARGHESE, John Serin, KAPOOR, Nikhil
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A method for detecting an adversarial perturbation in input data of a neural network, wherein a conditional generative adversarial network is trained during a training phase, wherein a generator network of the conditional generative adversarial network is trained to generate adversarial perturbations conditioned on input data of the neural network, and wherein a discriminator network of the conditional generative adversarial network is trained at least to detect an adversarial perturbation in the input data generated by the generator network, and wherein, during an application phase, the trained discriminator network detects an adversarial perturbation in input data of the neural network and provides a detection result. Also disclosed is a backend server, a detection device and a system.