A semi-supervised multiscale generalized-VAE framework for one-class classification

Deep-learning based approaches for unsupervised anomaly detection typically learn either a generative model of the inlier class or a decision boundary to encapsulate the inlier class. In addition to the training data from the inlier class, the availability of a small amount of training data from the...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Neurocomputing (Amsterdam) 2025-03, Vol.620, p.129172, Article 129172
Hauptverfasser: Sharma, Renuka, Awate, Suyash P.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Deep-learning based approaches for unsupervised anomaly detection typically learn either a generative model of the inlier class or a decision boundary to encapsulate the inlier class. In addition to the training data from the inlier class, the availability of a small amount of training data from the outlier class can aid in refining the classifier model using principles of semi-supervised learning. This paper proposes a novel end-to-end deep semi-supervised variational framework for one-class classification of images, leveraging data-adaptive generalized-Gaussian (GG) models leading to effective modeling of distributions in both latent space and image space. The framework proposes a novel variational encoder that models a distribution on a multiscale (here, “scale” refers to spatial resolution) latent-space encoding, together with a generalized reparameterization scheme for the GG model’s sampling at each such scale. While the multiscale latent-space helps effective feature learning at coarse and fine spatial scales, the semi-supervision helps tune the feature learning to improve separability between the inliers and the outliers. Results on several publicly available industrial-imaging and medical-imaging datasets show the benefits of our framework’s novel components over existing approaches.
ISSN:0925-2312
DOI:10.1016/j.neucom.2024.129172