UniVAD: A Training-free Unified Model for Few-shot Visual Anomaly Detection
Visual Anomaly Detection (VAD) aims to identify abnormal samples in images that deviate from normal patterns, covering multiple domains, including industrial, logical, and medical fields. Due to the domain gaps between these fields, existing VAD methods are typically tailored to each domain, with sp...
Gespeichert in:
Hauptverfasser: | , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Visual Anomaly Detection (VAD) aims to identify abnormal samples in images
that deviate from normal patterns, covering multiple domains, including
industrial, logical, and medical fields. Due to the domain gaps between these
fields, existing VAD methods are typically tailored to each domain, with
specialized detection techniques and model architectures that are difficult to
generalize across different domains. Moreover, even within the same domain,
current VAD approaches often follow a "one-category-one-model" paradigm,
requiring large amounts of normal samples to train class-specific models,
resulting in poor generalizability and hindering unified evaluation across
domains. To address this issue, we propose a generalized few-shot VAD method,
UniVAD, capable of detecting anomalies across various domains, such as
industrial, logical, and medical anomalies, with a training-free unified model.
UniVAD only needs few normal samples as references during testing to detect
anomalies in previously unseen objects, without training on the specific
domain. Specifically, UniVAD employs a Contextual Component Clustering ($C^3$)
module based on clustering and vision foundation models to segment components
within the image accurately, and leverages Component-Aware Patch Matching
(CAPM) and Graph-Enhanced Component Modeling (GECM) modules to detect anomalies
at different semantic levels, which are aggregated to produce the final
detection result. We conduct experiments on nine datasets spanning industrial,
logical, and medical fields, and the results demonstrate that UniVAD achieves
state-of-the-art performance in few-shot anomaly detection tasks across
multiple domains, outperforming domain-specific anomaly detection models. The
code will be made publicly available. |
---|---|
DOI: | 10.48550/arxiv.2412.03342 |