A semi-supervised multiscale generalized-VAE framework for one-class classification

Deep-learning based approaches for unsupervised anomaly detection typically learn either a generative model of the inlier class or a decision boundary to encapsulate the inlier class. In addition to the training data from the inlier class, the availability of a small amount of training data from the...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Neurocomputing (Amsterdam) 2025-03, Vol.620, p.129172, Article 129172
Hauptverfasser:	Sharma, Renuka, Awate, Suyash P.
Format:	Artikel
Sprache:	eng
Schlagworte:	Generalized-Gaussian Multiscale latent space One-class classification Robustness Semi-supervision Uncertainty Variational autoencoder
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Deep-learning based approaches for unsupervised anomaly detection typically learn either a generative model of the inlier class or a decision boundary to encapsulate the inlier class. In addition to the training data from the inlier class, the availability of a small amount of training data from the outlier class can aid in refining the classifier model using principles of semi-supervised learning. This paper proposes a novel end-to-end deep semi-supervised variational framework for one-class classification of images, leveraging data-adaptive generalized-Gaussian (GG) models leading to effective modeling of distributions in both latent space and image space. The framework proposes a novel variational encoder that models a distribution on a multiscale (here, “scale” refers to spatial resolution) latent-space encoding, together with a generalized reparameterization scheme for the GG model’s sampling at each such scale. While the multiscale latent-space helps effective feature learning at coarse and fine spatial scales, the semi-supervision helps tune the feature learning to improve separability between the inliers and the outliers. Results on several publicly available industrial-imaging and medical-imaging datasets show the benefits of our framework’s novel components over existing approaches.
ISSN:	0925-2312
DOI:	10.1016/j.neucom.2024.129172