Toward durable representations for continual learning

Continual learning models are known to suffer from catastrophic forgetting . Existing regularization methods to countering forgetting operate by penalizing large changes to learned parameters. A significant downside to these methods, however, is that, by effectively freezing model parameters, they g...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Advances in computational intelligence 2022-02, Vol.2 (1), p.7, Article 7
Hauptverfasser:	El Khatib, Alaa, Karray, Fakhri
Format:	Artikel
Sprache:	eng
Schlagworte:	Artificial Intelligence Computational Intelligence Engineering Fines & penalties Freezing Initiatives Learning Machine Learning Mathematical models Neural networks Optimization Original Article Parameters Regularization Regularization methods Representations
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Continual learning models are known to suffer from catastrophic forgetting . Existing regularization methods to countering forgetting operate by penalizing large changes to learned parameters. A significant downside to these methods, however, is that, by effectively freezing model parameters, they gradually suspend the capacity of a model to learn new tasks. In this paper, we explore an alternative approach to the continual learning problem that aims to circumvent this downside. In particular, we ask the question: instead of forcing continual learning models to remember the past, can we modify the learning process from the start, such that the learned representations are less susceptible to forgetting? To this end, we explore multiple methods that could potentially encourage durable representations. We demonstrate empirically that the use of unsupervised auxiliary tasks achieves significant reduction in parameter re-optimization across tasks, and consequently reduces forgetting, without explicitly penalizing forgetting. Moreover, we propose a distance metric to track internal model dynamics across tasks, and use it to gain insight into the workings of our proposed approach, as well as other recently proposed methods.
ISSN:	2730-7794 2730-7808
DOI:	10.1007/s43674-021-00022-8