Attentional prototype inference for few-shot segmentation
•We provide the fully probabilistic framework for few-shot semantic segmentation.•We introduce a variational attention mechanism to enable the model to capture the appearance variation of an object.•We formulate the optimization as a variational inference problem to jointly estimate posteriors over...
Gespeichert in:
Veröffentlicht in: | Pattern recognition 2023-10, Vol.142, p.109726, Article 109726 |
---|---|
Hauptverfasser: | , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | •We provide the fully probabilistic framework for few-shot semantic segmentation.•We introduce a variational attention mechanism to enable the model to capture the appearance variation of an object.•We formulate the optimization as a variational inference problem to jointly estimate posteriors over latent variables.
This paper aims to address few-shot segmentation. While existing prototype-based methods have achieved considerable success, they suffer from uncertainty and ambiguity caused by limited labeled examples. In this work, we propose attentional prototype inference (API), a probabilistic latent variable framework for few-shot segmentation. We define a global latent variable to represent the prototype of each object category, which we model as a probabilistic distribution. The probabilistic modeling of the prototype enhances the model’s generalization ability by handling the inherent uncertainty caused by limited data and intra-class variations of objects. To further enhance the model, we introduce a local latent variable to represent the attention map of each query image, which enables the model to attend to foreground objects while suppressing the background. The optimization of the proposed model is formulated as a variational Bayesian inference problem, which is established by amortized inference networks. We conduct extensive experiments on four benchmarks, where our proposal obtains at least competitive and often better performance than state-of-the-art prototype-based methods. We also provide comprehensive analyses and ablation studies to gain insight into the effectiveness of our method for few-shot segmentation. |
---|---|
ISSN: | 0031-3203 1873-5142 |
DOI: | 10.1016/j.patcog.2023.109726 |