A Distinct Unsupervised Reference Model From The Environment Helps Continual Learning
The existing continual learning methods are mainly focused on fully-supervised scenarios and are still not able to take advantage of unlabeled data available in the environment. Some recent works tried to investigate semi-supervised continual learning (SSCL) settings in which the unlabeled data are...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The existing continual learning methods are mainly focused on
fully-supervised scenarios and are still not able to take advantage of
unlabeled data available in the environment. Some recent works tried to
investigate semi-supervised continual learning (SSCL) settings in which the
unlabeled data are available, but it is only from the same distribution as the
labeled data. This assumption is still not general enough for real-world
applications and restricts the utilization of unsupervised data. In this work,
we introduce Open-Set Semi-Supervised Continual Learning (OSSCL), a more
realistic semi-supervised continual learning setting in which
out-of-distribution (OoD) unlabeled samples in the environment are assumed to
coexist with the in-distribution ones. Under this configuration, we present a
model with two distinct parts: (i) the reference network captures
general-purpose and task-agnostic knowledge in the environment by using a broad
spectrum of unlabeled samples, (ii) the learner network is designed to learn
task-specific representations by exploiting supervised samples. The reference
model both provides a pivotal representation space and also segregates
unlabeled data to exploit them more efficiently. By performing a diverse range
of experiments, we show the superior performance of our model compared with
other competitors and prove the effectiveness of each component of the proposed
model. |
---|---|
DOI: | 10.48550/arxiv.2301.04506 |