On the Use of Anchoring for Training Vision Models
Anchoring is a recent, architecture-agnostic principle for training deep neural networks that has been shown to significantly improve uncertainty estimation, calibration, and extrapolation capabilities. In this paper, we systematically explore anchoring as a general protocol for training vision mode...
Gespeichert in:
Hauptverfasser: | , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Anchoring is a recent, architecture-agnostic principle for training deep
neural networks that has been shown to significantly improve uncertainty
estimation, calibration, and extrapolation capabilities. In this paper, we
systematically explore anchoring as a general protocol for training vision
models, providing fundamental insights into its training and inference
processes and their implications for generalization and safety. Despite its
promise, we identify a critical problem in anchored training that can lead to
an increased risk of learning undesirable shortcuts, thereby limiting its
generalization capabilities. To address this, we introduce a new anchored
training protocol that employs a simple regularizer to mitigate this issue and
significantly enhances generalization. We empirically evaluate our proposed
approach across datasets and architectures of varying scales and complexities,
demonstrating substantial performance gains in generalization and safety
metrics compared to the standard training protocol. |
---|---|
DOI: | 10.48550/arxiv.2406.00529 |