Aitken-based acceleration methods for assessing convergence of multilayer neural networks

This paper first develops the ideas of Aitken /spl delta//sup 2/ method to accelerate the rate of convergence of an error sequence (value of the objective function at each step) obtained by training a neural network with a sigmoidal activation function via the backpropagation algorithm. The Aitken m...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transaction on neural networks and learning systems 2001-09, Vol.12 (5), p.998-1012
Hauptverfasser:	Pilla, R.S., Kamarthi, S.V., Lindsay, B.G.
Format:	Artikel
Sprache:	eng
Schlagworte:	Acceleration Backpropagation algorithms Closed-form solution Convergence Criteria Error functions Errors Force measurement Invariants Iterative algorithms Iterative methods Multi-layer neural network Multilayers Neural networks Studies Tail
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	This paper first develops the ideas of Aitken /spl delta//sup 2/ method to accelerate the rate of convergence of an error sequence (value of the objective function at each step) obtained by training a neural network with a sigmoidal activation function via the backpropagation algorithm. The Aitken method is exact when the error sequence is exactly geometric. However, theoretical and empirical evidence suggests that the best possible rate of convergence obtainable for such an error sequence is log-geometric. This paper develops a new invariant extended-Aitken acceleration method for accelerating log-geometric sequences. The resulting accelerated sequence enables one to predict the final value of the error function. These predictions can in turn be used to assess the distance between the current and final solution and thereby provides a stopping criterion for a desired accuracy. Each of the techniques described is applicable to a wide range of problems. The invariant extended-Aitken acceleration approach shows improved acceleration as well as outstanding prediction of the final error in the practical problems considered.
ISSN:	1045-9227 2162-237X 1941-0093 2162-2388
DOI:	10.1109/72.950130